GithubHelp home page GithubHelp logo

Comments (5)

mansimov avatar mansimov commented on June 17, 2024

Hey

Interesting. Last time I tried running it I had no problem converging and managed to train within a day or two on Titan/Titan Black GPUs.

The UCF-101 patches I uploaded specifically here (the one used in the paper was much larger) is a very small dataset due to memory constraints so as far as I remember I managed to heavily overfit at training set.

I will try finding weights but I can't promise anything :)

from unsupervised-videos.

b3nk4n avatar b3nk4n commented on June 17, 2024

Thank you for your fast answer!

I just realized I was not that precise.
Up to now I only tried to train it on MovingMNIST. For training data (which is using the ground truth as input to the future-predictor), it is actually converging quite good. I think it was down to a training-loss to about 170, but the validation-loss (which is using the previously predicted frames as input to the future-predictor) stays at a value of about 2600. In consequece, it can predict the first frame very well, but it does not seem to learn the overall motion that good.

from unsupervised-videos.

mansimov avatar mansimov commented on June 17, 2024

Hm, I don't remember having this issue of validation loss being way worse compared to training loss.

from unsupervised-videos.

jbboin avatar jbboin commented on June 17, 2024

Hi,

Iā€™m a bit late to the party but it seems that I found the reason why this problem occurred. It was actually caused by the bug fixing that occurred in commit fae11d6. During testing, for each time step the sigmoid was applied to the inputs of the LSTM (which are the outputs of the previous stages). But in the end, the sigmoid was applied again to all outputs. This caused it to be incorrectly applied twice for the outputs at t = 0, ā€¦, T-2. My pull request (#26) fixes this problem.

Running the default training mode (1 layer LSTM) on MovingMNIST now gives me a validation cross entropy of around 335 after 120k iterations, which is much more consistent with the values reported in the paper.

from unsupervised-videos.

mansimov avatar mansimov commented on June 17, 2024

Thanks for noticing and fixing this bug @jbboin !

from unsupervised-videos.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    šŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. šŸ“ŠšŸ“ˆšŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ā¤ļø Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.