GithubHelp home page GithubHelp logo

Unstable performance about blitznet HOT 8 CLOSED

dvornikita avatar dvornikita commented on July 24, 2024
Unstable performance

from blitznet.

Comments (8)

dvornikita avatar dvornikita commented on July 24, 2024

The performance may be unstable indeed. This is a common problem of object detectors that have to solve a different problem in training and test time. NMS is not robust to small changes in boxes scores either.
Why do you mean by «step=4000 »?

from blitznet.

tjulyz avatar tjulyz commented on July 24, 2024

@dvornikita Thanks for your reply! There is a typo for <step=4000>. It should be <step=40000>. Is it reasonable (~1%) for object detection performance using yours code? I have also observed the performance variations for semantic segmentation using BlitzNet. Is it related to 'shuffle_batch'?

from blitznet.

dvornikita avatar dvornikita commented on July 24, 2024

--step is a parameter used in testing (how often you evaluate checkpoints). See config.py for details.
What you meant is probably --lr_decay (see code snippets on how to train the model). The learning rate is decreased twice during the training. For details check the original paper.

from blitznet.

tjulyz avatar tjulyz commented on July 24, 2024

Thanks for your reply! I set the configurations for voc2012 as (lr_decay 25000 35000) for training as that in your paper. Have you evaluated all/some checkpoints (e.g. --step=2) when you reported your performance in your paper? If so, is the result calculated by average or max?

from blitznet.

dvornikita avatar dvornikita commented on July 24, 2024

I see what you mean now.
We simply took the last checkpoint for each model, usually, its performance was the best or among the best (in ~0.3% range). If you train the model several times and none of the results if close enough to the reported ones it may indicate a problem, especially if it happens consistently in other settings as well. Otherwise, it's possible to observe performance fluctuating around the reported values.

from blitznet.

tjulyz avatar tjulyz commented on July 24, 2024

Thanks a lot! I am trying to find the training problem.
Have you used warmup_lr in your training? I find the warmup_lr in the config is set as 1e-5, but the warmup_step is set to 0 as default.
If you use warmup_lr, how many steps have you set to train the network?
Thanks a lot!

from blitznet.

dvornikita avatar dvornikita commented on July 24, 2024

@tjulyz warmup was meant to help with some other experiments. In the end, we didn’t use it in the experiments reported (that’s why it’s set to 0)
What do you mean by “training problem”?

from blitznet.

tjulyz avatar tjulyz commented on July 24, 2024

Thanks for your kind reply!
Because I haven't reproduced the performance, I am trying to find out if there are some problems when I training the network.
I will close this issues soon. If I find the answer, I will discuss it again.
Thanks a lot!

from blitznet.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.