GithubHelp home page GithubHelp logo

clustering-triplets-with-tangles's Introduction

Master Thesis

Setup

Install the requirements file with pip install -r requirements.txt. Alternatively, you can use anaconda to create the environment from a .yamlfile using conda env create -f environment.yaml (tested on Ubuntu 20.04), or conda env create -f environment-mac.yaml (tested on an M1 Mac on MacOS Monterey).

Additionally, you will have to install cblearn, comparison-hc and tangles by hand. cblearn can be installed by

git clone https://github.com/dekuenstle/cblearn 
pip install ./cblearn

ComparisonHC can be installed by following the instructions at https://github.com/mperrot/ComparisonHC. You will have to run:

git clone https://github.com/mperrot/ComparisonHC.git
cd ComparisonHC
python setup.py install

The tangles package is provided in a local submodule. To install, run

git submodule update
pip install ./tangles

Organization

The code that was written for the master thesis directly is in the top-level folder. All proper functions are put into python files, and all experiments are done in .ipynb files. The python files form some kind of library for tangles-triplet experiments, which can be useful. If you just want to cluster triplets with tangles, take a look at the estimators.py file. If you also need all the triplet generation methods etc., take a look at triplets.py and questionnaire.py. All functions and files have appropriate documentation, if you need further instructions on how to use them, take a look at the .ipynb files.

We step through the important files and folders:

  • latex: Contains all the latex for the thesis. paper.tex is the main .tex file which has include directives for all the subchapters (methods.tex, ...)
  • tests: Test files that can be run via pytest tests (you have to install pytest first).
  • tangles-rust: Abandonded proof of concept of running tangles with Rust.
  • results: Holds all figures and the raw data (.csv) from the .ipynb files.
  • experiments: Contains .ipynb files that were testing things from the earlier thesis phases. Not all of them still work, but you can checkout older versions of this repository to get them back (use git tag to see the old versions which correspond to experiments).
  • tangles: Repository of a modified tangles version, which I used for the thesis. Contains some plotting changes etc.
  • thesis_simulations.ipynb: Contains all code for the "Simulations" figures in the thesis.
  • thesis_real_data.ipynb: Contains all code for the "Real Data" figures in the thesis.
  • thesis.py: Contains helper functions for thesis plotting and experiment orchestration

clustering-triplets-with-tangles's People

Contributors

conzel avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.