GithubHelp home page GithubHelp logo

guixianjin / some-research-on-noisy-labels Goto Github PK

View Code? Open in Web Editor NEW
7.0 3.0 1.0 887 KB

This is a record of current published experiment results on some realistic noisy data sets

License: MIT License

some-research-on-noisy-labels's Introduction

This is a record of current experiment result on some realistic noisy label datasets

1. Clothing1M dataset

from: CVPR-15: Learning from Massive Noisy Labeled Data for Image Classification

  • 14 classes: T-shirt, Shirt, Knitwear, Chiffon, Sweater, Hoodie, Windbreaker, Jacket, Down Coat, Suit, Shawl, Dress, Vest, and Underwear
  • noisy labeled training dataset ($D_\eta$): $10^6$
  • clean train data($D_c$): 47,570
  • clean validation set: 14,313
  • clean test set: 10,526

Noise confusion matrix

It's not column-diagonally dominant, thus small-loss trick may not work. But if examples in noisy class 3 and noisy class 5 have been swapped, it may become column-diagonally dominant, in which case small-loss trick may work.

some result

some resutl

some result

some result

69.9(only use noisy training data) -> 79.9(fine-tuning)

some result

some result

some result

some result

some result

some result

about 71%

some result

some result

some result

some result

some result

some result

some result

19. [ICLR-20: underview: DivideMiX: Learning with noisy labels as semi-supervised learning]

from: Arxiv17: Webvision database: Visual learning and understanding from web data

  • 1,000 classes: concepts in ImageNet ILSVRC12
  • noise rate: 20-40%

use all data or only use the first 50 classes of Google image subset

use all 1000 classes

only use the first 50 classes of Google image subset

only use the randomly selected 100 classes

from: CVPR-18: CleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise

  • 101 food classes
  • 310k image, 55k verification,
  • noise rate: 20%
  not use Food101N created by cleanNet paper, but use Food101 and inject 20% noise

from: ICML-19: SELFIE: Refurbishing Unclean Samples for Robust Deep Learning

some-research-on-noisy-labels's People

Contributors

guixianjin avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Forkers

zhanghaoxin1994

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.