Comments (8)
The performance may be unstable indeed. This is a common problem of object detectors that have to solve a different problem in training and test time. NMS is not robust to small changes in boxes scores either.
Why do you mean by «step=4000 »?
from blitznet.
@dvornikita Thanks for your reply! There is a typo for <step=4000>. It should be <step=40000>. Is it reasonable (~1%) for object detection performance using yours code? I have also observed the performance variations for semantic segmentation using BlitzNet. Is it related to 'shuffle_batch'?
from blitznet.
--step is a parameter used in testing (how often you evaluate checkpoints). See config.py for details.
What you meant is probably --lr_decay (see code snippets on how to train the model). The learning rate is decreased twice during the training. For details check the original paper.
from blitznet.
Thanks for your reply! I set the configurations for voc2012 as (lr_decay 25000 35000) for training as that in your paper. Have you evaluated all/some checkpoints (e.g. --step=2) when you reported your performance in your paper? If so, is the result calculated by average or max?
from blitznet.
I see what you mean now.
We simply took the last checkpoint for each model, usually, its performance was the best or among the best (in ~0.3% range). If you train the model several times and none of the results if close enough to the reported ones it may indicate a problem, especially if it happens consistently in other settings as well. Otherwise, it's possible to observe performance fluctuating around the reported values.
from blitznet.
Thanks a lot! I am trying to find the training problem.
Have you used warmup_lr in your training? I find the warmup_lr in the config is set as 1e-5, but the warmup_step is set to 0 as default.
If you use warmup_lr, how many steps have you set to train the network?
Thanks a lot!
from blitznet.
@tjulyz warmup was meant to help with some other experiments. In the end, we didn’t use it in the experiments reported (that’s why it’s set to 0)
What do you mean by “training problem”?
from blitznet.
Thanks for your kind reply!
Because I haven't reproduced the performance, I am trying to find out if there are some problems when I training the network.
I will close this issues soon. If I find the answer, I will discuss it again.
Thanks a lot!
from blitznet.
Related Issues (20)
- python demo.py --run_name=BlitzNet300_COCO+VOC07+12 --x4 --detect --segment --eval_min_conf=0.5 --ckpt=1 HOT 7
- files = glob(osp.join(self.folder, '*{}'.format(self.data_format))) HOT 1
- tensorflow.python.framework.errors_impl.NotFoundError: /home/cbl/PycharmProjects/blitznet-master/Datasets/voc07-trainval-segmentation; No such file or directory HOT 1
- python training.py --run_name=BlitzNet300_x4_VOC0712_detseg --dataset=voc07+12-segmentation --trunk=resnet50 --x4 --batch_size=32 --optimizer=adam --detect --segment --max_iterations=65000 --lr_decay 40000 50000 HOT 1
- without Preparation3 ,can I train ? HOT 2
- ModuleNotFoundError: No module named 'progressbar' HOT 1
- 1a
- true_number_of_negatives calculation HOT 4
- About the arguments in class feed_forward HOT 1
- Model download HOT 2
- Demo results HOT 2
- Test on MS COCO dataset based on trained checkpoint HOT 3
- Some illogical problems occurred during the detection process.
- How to train on cityscapes? HOT 2
- How to calculate each class of IoU? HOT 4
- About MIoU result HOT 1
- Where is the deconvolution layers?
- Is there any other hidden settings or tricks for training HOT 4
- Pre-trained model and the interface HOT 1
- Download the pre-trained model
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from blitznet.