GithubHelp home page GithubHelp logo

fashion-mnist's Introduction

Fashion-MNIST

This notebook explores Fashion-MNIST dataset and trains a couple of classification NN models https://github.com/zalandoresearch/fashion-mnist

The tested models are

  1. ResNet-18
  2. MobileNet-v2

The models are trained for 7 epochs, with changing parameters:

  1. Learning rate reduced 1e-4 to 1e-5 at epoch 4
  2. Batch size increased 64 to 256 at epoch 5
  3. Augmentations turned on at epoch 6

The augmentations are:

  1. random shift 1-2 pixels
  2. random scale
  3. random horizontal flip for the relevant classes

ResNet-18 takes 32 minutes to train and reaches test accuracy 92.41% MobileNet-v2 takes 23 minutes to train and reaches test accuracy 91.43%

The system GPU is NVIDIA GeForce GTX 960M, CPU i7-6700HQ

Potential improvements:

  1. Better training schedule: start from the beginning with the augmentations, use a learning rate schedule (for example cosine annealing), train for longer.
  2. Augmentations: cutout, mixup
  3. Loss function: label smoothing, focal loss can be promising
  4. TTA - test time augmentations
  5. Ensembling - average predictions from a few networks

As a sanity check, we give a photograph of a T-Shirt to the model, and it appropriately estimates 66% for class "T-shirt/top" and 33% for class "Shirt"

fashion-mnist's People

Contributors

nosound2 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.