GithubHelp home page GithubHelp logo

muhammadnabil / resnet-cifar10-caffe Goto Github PK

View Code? Open in Web Editor NEW

This project forked from ethanhe42/resnet-cifar10-caffe

0.0 1.0 0.0 447 KB

ResNet-20/32/44/56/110 on CIFAR-10 with Caffe

Home Page: https://arxiv.org/pdf/1512.03385.pdf

License: MIT License

Python 95.31% Shell 4.69%

resnet-cifar10-caffe's Introduction

ResNet 20/32/44/56/110 for CIFAR10 with caffe

Testing

~/caffe/build/tools/caffe test -gpu 0 -iteartions 100 -model resnet-20/trainval.prototxt -weights resnet-20/snapshot/solver_iter_64000.caffemodel 
Model Acc Claimed Acc
ResNet-20 91.4% 91.25%
ResNet-32 92.48% 92.49%
ResNet-44 % 92.83%
ResNet-56 92.9% 93.03%
ResNet-110 % 93.39%

Citation

If you find the code useful in your research, please consider citing:

@InProceedings{He_2017_ICCV,
author = {He, Yihui and Zhang, Xiangyu and Sun, Jian},
title = {Channel Pruning for Accelerating Very Deep Neural Networks},
booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
month = {Oct},
year = {2017}
}

Training

#build caffe
git clone https://github.com/yihui-he/resnet-cifar10-caffe
./download_cifar.sh
./train.sh [GPUs] [NET]
#eg., ./train.sh 0 resnet-20
#find logs at resnet-20/logs

Visualization

specify caffe path in cfgs.py and use plot.py to generate beautful loss plots.

python plot.py PATH/TO/LOGS

Results are consistent with original paper. seems there's no much difference between resnet-20 and plain-20. However, from the second plot, you can see that plain-110 have difficulty to converge.

How I generate prototxts:

use net_generator.py to generate solver.prototxt and trainval.prototxt, you can generate resnet or plain net of depth 20/32/44/56/110, or even deeper if you want. you just need to change n according to depth=6n+2

How I generate lmdb data:

./create_cifar.sh

create 4 pixel padded training LMDB and testing LMDB, then create a soft link ln -s cifar-10-batches-py in this folder.

  • get cifar10 python version
  • use data_utils.py to generate 4 pixel padded training data and testing data. Horizontal flip and random crop are performed on the fly while training.

Other models in Caffe

ResNet-ImageNet-Caffe
Xception-Caffe

resnet-cifar10-caffe's People

Contributors

ethanhe42 avatar xbsu avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.