Comments (5)
@idimitriadis That's my fault.
In the earlier model, I did not follow the exact methodology of the paper (did not use weight regularization, did not use He et al initialization). In addition to these mistakes, I also added the initial convolution with bias, which was not according to the paper.
That is why the error is occuring. The model assumes only convolution weights will be provided (no bias), but the weights file has the weights for the bias as well. Simply removing the bias term in the weights file is wrong, since the prediction accuracy drops dramatically.
That is why I am retraining the DenseNet 40-12 model on CIFAR 10 again and this will have the correct implementation. Please wait a few days for the weights.
from densenet.
thank you for your immediate response!
Keep up the good work!
from densenet.
@idimitriadis Just uploaded the final model. It gets 94.45% accuracy compared to 94.76 % according to the paper for the DenseNet 40-12 model.
I used Adam with a low learning rate of 1e-4 and trained for 150 epochs only, since each iteration takes over 10 minutes for me, so that may be the reason that it is scoring slightly less.
I've also begun implementing the faster version of DenseNet that the authors are using now. It significantly speeds up training (6 minutes per epoch), but I still have to train on the CIFAR-10 to see if the implementation is correct.
Edit:
Trained the DenseNet fast model as well, and while it trains faster, it reaches its best performance after many more epochs. Perhaps this is due to an implementation issue (there is no ScaleLayer and BiasLayer specifically in Keras, unlike Lasagne). After approximately 250 epochs, it scored 94.29 % accuracy.
from densenet.
Hi, have you tried to train with imagenet dataset?
from densenet.
@MrXu No I haven't tried training on ImageNet. I don't have the GPU processing power to train on ImageNet.
However, there are now parameters to create DenseNet BC models which were used to train DenseNet on ImageNet
from densenet.
Related Issues (20)
- Cifar10 weights HOT 1
- no longer works with newest keras HOT 8
- About implementation of __dense_block HOT 2
- Running Densnet in CPU HOT 1
- Why reshape in fc-densenet in top layer before applying softmax? HOT 2
- AttributeError: 'NoneType' object has no attribute 'get_file' HOT 2
- Where is connection to 12 layers of each dense block? HOT 1
- About DenseNet HOT 1
- About Densenet architecture HOT 1
- Plans for memory efficient implementation in Keras? HOT 1
- How to upload my own dataset instead of the Cifar10 dataset HOT 1
- How to use 'DENSENET_121_WEIGHTS_PATH_NO_TOP'? HOT 1
- Mistake in L2 regularization HOT 1
- creat model HOT 2
- creat model HOT 1
- imagenet_inference
- inter_channel HOT 1
- Poor CIFAR100 accuracy HOT 1
- Error about normalize_data_format
- Unable to convert to frozen graph or checkpoint
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from densenet.