GithubHelp home page GithubHelp logo

Comments (15)

davheld avatar davheld commented on August 23, 2024 2

Try reducing the learning rate? The oscillations in that graph look fairly large. Although if you are using the default learning rate then this is unusual that you should need to change it. Also, make sure that all convolutional layers are fixed (e.g. in both streams of the network).

from goturn.

Ouya-Bytes avatar Ouya-Bytes commented on August 23, 2024 1

yet, i only use your code and prototxt( run the train.cpp, keep params of solver.prototxt and tracker.prototxt) to re-train the network, i don't change anymore. the convolutional layer is from CaffeNet, and lr_mult is set 0 no change.

from goturn.

davheld avatar davheld commented on August 23, 2024
  1. I use the pre-trained CaffeNet architecture, which is available from Caffe:
    http://caffe.berkeleyvision.org/model_zoo.html

  2. I think it probably will - if you do this then I recommend using a much smaller architecture.

  3. As mentioned above, I do not pre-train the convolutional layers myself but I take the layers pre-trained from Caffe. The two sets of convolutional layers have identical weights.

from goturn.

Ouya-Bytes avatar Ouya-Bytes commented on August 23, 2024

@davheld I according to your code(tracker.prototxt and solver.prototxt ,iteration 500000) and use the train.cpp to train the network use dataset(2014_DET and ALOV300+), the train loss value is not convergent, and oscillation between on 20 ~ 50 finally.so that have a very pool tracking performance. Can you give me some advise? Thks

from goturn.

davheld avatar davheld commented on August 23, 2024

It sounds like you are overfitting. Just to be sure - I don't train the conv layers at all, those are pre-trained using CaffeNet.

from goturn.

davheld avatar davheld commented on August 23, 2024

How do you create the pre-trained network?

from goturn.

Ouya-Bytes avatar Ouya-Bytes commented on August 23, 2024

the pretrian param from your offer address http://cs.stanford.edu/people/davheld/public/GOTURN/weights_init/tracker_init.caffemodel, i dont change the prototxt, i only want to run train.cpp code to get tracker_iter_500000.caffemodel, then can test the tracker

from goturn.

davheld avatar davheld commented on August 23, 2024

That's odd, not sure.

from goturn.

Jiangfeng-Xiong avatar Jiangfeng-Xiong commented on August 23, 2024

I have the same problem, changing val_ratio from 0.2 to 0 in "loader/loader_alov.cpp" may help, but still , model trained by myself doesn't perform as good as pre-train model.

from goturn.

Ouya-Bytes avatar Ouya-Bytes commented on August 23, 2024

not convergent? oscillation?

from goturn.

davheld avatar davheld commented on August 23, 2024

The oscillation is normal and simply occurs because the training evaluation is occurring on mini-batches which are randomly sampled at each iteration. However, the numbers that you listed seem lower than what I remember so I believe that you are overfitting, although I am not sure why.

from goturn.

Jiangfeng-Xiong avatar Jiangfeng-Xiong commented on August 23, 2024

train loss is like this train loss, range from 20 to 90 @OuYag

from goturn.

ujsyehao avatar ujsyehao commented on August 23, 2024

@OuYag your words"lr_mult is set 0 no change." I don't think it is right, lr_mult set to 0 means no learning rate.
Caffe says that we will set the weight learning rate to be the same as the learning rate given by the solver during runtime

from goturn.

freescar avatar freescar commented on August 23, 2024

@Jiangfeng-Xiong @OuYag do you solve the issue? I have the same problem, the test loss value is between 10 and 20. I guess it is overfitting, however changing lr or batchsize cannot reduce losses

from goturn.

wendianwei avatar wendianwei commented on August 23, 2024

Hi, I want to know how to evaluate your performance?

from goturn.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.