GithubHelp home page GithubHelp logo

gyasmeen-ml / multidiv-teachers Goto Github PK

View Code? Open in Web Editor NEW

This project forked from arkaung/multidiv-teachers

0.0 0.0 0.0 19 KB

Research repo for Active Learning from Divergent Multiple Teachers

Python 85.77% Jupyter Notebook 14.23%

multidiv-teachers's Introduction

Active Learning from Divergent Multiple Teachers

The idea behind "Active Learning from Divergent Multiple Teachers" is to construct multiple teacher neural networks with different architectures, each potentially producing varying softmax results for an input $X$. Then, KL divergence is applied to all possible combinations of these softmax outputs. After that, mean of the all the KL-divergences is calculated. The samples with highest mean KL-divergence is chosen as a prime candidate for being sent to oracle for being labelled. Those samples with low KL-divergence are considered as confident samples and multiple softmax probability distributions are treated as multiple soft-labels for the student network. The student network learns from this multi-label distribution. During inference time, the student network looks outputs with highest agreement.

Pipeline

  • Train $N$ neural networks on training data ** Add temperature layer right before softmax layer
  • Use those networks to make predictions on unlabelled data
  • Get the softmax values of unlabelled from those networks
  • Use multi-teacher knowledge distillation to train a student network for faster inference time, using the softmax outputs of training data from teacher network works
  • Calculate the mean divergences among all possible combinations of softmax pairs from $N$ networks. There will be $N_c_2$ combinations
  • Get the top $100$ data points with highest divergence to be sent to oracle. ** This can be elaborated into better selection process of cutting the number of data points which are above certain divergence threshold.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.