GithubHelp home page GithubHelp logo

Comments (15)

ujsyehao avatar ujsyehao commented on June 26, 2024 1

Here is a link 链接: https://pan.baidu.com/s/1vZONIe2pBkxjo-s5wP3zAg 密码: 3ip6.

  • step 1: unzip the file in $caffeSSD/models directory

  • step 2: modify batch size in pelee_voc/train.prototxt, I modify batch size 32 to 20 because I only have 8 GB memory, you can revert batch size to 32

  • step 3:
    cd $caffeSSD
    ./build/tools/caffe train -solver=models/pelee_voc/solver.prototxt

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

@MrWhiteHomeman I am trying to do it, response to you later

from pelee.

MrWhiteHomeman avatar MrWhiteHomeman commented on June 26, 2024

@ujsyehao It is so nice of you for your reply !!! I have another question about the code, in the feature_extractor.py ,the 30th line, why are there two 'stage4_tb/ext/pm2/res/relu' in the Pelee.mbox_source_layers? Can you give me some advices? Thank you!!!

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

You can use (http://ethereon.github.io/netscope/#/editor) to view peleenet-ssd network structure, you will find stage4_tb/ext/pm2/res layer is used twice to generate ext/pm1_mbox_loc layer and ext/pm2_mbox_loc layer(conf layer/priorbox layer is also the case).
The reason is that peleenet drops 38x38 feature map(you can view pelee paper) and just use the remaining 5 feature extracted layer(19x19, 10x10, 5x5, 3x3, 1x1), but SSD merges 6 layers' prior boxes, so author use 19x19 feature map(also known as stage4_tb/ext/pm2/res) twice to predict two conf/loc/priorbox layers.

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

mobile-net ssd also follows this design pattern, I will update later.

from pelee.

MrWhiteHomeman avatar MrWhiteHomeman commented on June 26, 2024

@ujsyehao Hello,I have a question about the batchsize, in this paper , the batch-size is 32, if I change the batch-size to 64, will I get a better result about testing ? I would appreciate it if you could give me some advices.
Thank you!!!

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

No, batch size can affect training time and has no direct relation with model performance.

from pelee.

MrWhiteHomeman avatar MrWhiteHomeman commented on June 26, 2024

@ujsyehao So,I always have a question about batchsize, if the batch size is too big, will it have a bad result? And I know the DetNet(旷世科技) , it set the batch size to 256, and get a greatest result...

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

You can look it

from pelee.

RainFrost1 avatar RainFrost1 commented on June 26, 2024

Could you please share the prototxt again? The link [链接: https://pan.baidu.com/s/1vZONIe2pBkxjo-s5wP3zAg 密码: 3ip6.] failed now.
Thank you very much~~~ @MrWhiteHomeman @ujsyehao

from pelee.

foralliance avatar foralliance commented on June 26, 2024

@ujsyehao
所谓的batch size大小不会影响model performance,应该仅限于模型中的BN层参数固定的情况吧.
如果BN层的参数在训练过程中也进行微调,那么batch size大小还是会影响model performance的吧.

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

@foralliance model performance 依赖于模型本身,batch size只是一个超参而已,如果你修改了batch size,再选定其它合适的超参比如base_lr,它最终的效果是一样的,一般而言,使用一个大batch只是训练的更快,更容易出结果,并不会从根本上决定模型的性能

from pelee.

foralliance avatar foralliance commented on June 26, 2024

@ujsyehao HI
这是之前和一位作者关于batch size的讨论

from pelee.

ujsyehao avatar ujsyehao commented on June 26, 2024

@foralliance 我看了这个回答,默认accum_batch_size固定,对于无BN层/BN层参数固定情况下,batch size不影响模型性能这个观点我是认同的

from pelee.

EvaneSnow avatar EvaneSnow commented on June 26, 2024

各位大佬好,你们谁晓得Pelee训练目标检测 + 车道线,谢谢各位

from pelee.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.