GithubHelp home page GithubHelp logo

Comments (6)

Johnson-yue avatar Johnson-yue commented on August 19, 2024

Hi, I know this nll loss compute is from Eq (6) in paper ,but where is L2 regularization?

And Question 2:
I run your experiments/mnist_minimal_example/train.py, and then run eval.py .Both run with default configure and plot those pictures...
Screenshot from 2019-07-12 14-16-18
I think the result is very bad . and how can I reimplement your experiments like your paper?

Validation loss:
-3.348240375518799

from freia.

ardizzone avatar ardizzone commented on August 19, 2024

Hi, thank you for bringing this to our attention!

Concerning the L2 regularization: this is included in the pytorch optimizer, so we do not have to implement it by hand.

About the quality of results: You are right, they do not look good. I will investigate, and get back to you in a few days!

from freia.

ardizzone avatar ardizzone commented on August 19, 2024

I could reproduce the issue on my end, and I also found the problem:
The amplitude for the noise augmentation was a factor of 10 off.
I just pushed commit e6bc8fd to fix it.
I also tweaked some other hyper-parameters, it now trains in half the time (15 minutes, 60 epochs).

Thank you again for bringing our attention to the issue!

If you want even better results, you can also increase the number of coupling blocks to 24, and train for 480 epochs (settings used in the paper, but takes several hours).

2019-07-12_16-45-00

I will close the issue, but feel free to reopen if there are further problems.

from freia.

Johnson-yue avatar Johnson-yue commented on August 19, 2024

@ardizzone Hi, I also test for style transfer experiments like Fig.(7), so I modify the eval.py into style_transfer.py and result like this:
source-vs-rec

I think in this configuration the cINN could not decompose into 10 essentially separate subnetworks.. because in this cause, some label is not generate image

from freia.

ardizzone avatar ardizzone commented on August 19, 2024

Hi,

very interesting! For most of your samples it works fine, but as you said, some don't work at all.
This is just and issue of the training (and the model).

I just pushed the actual research code for the mnist task, used to produce the paper (some documentation and comments were still missing): 8ea0165

I also included a model checkpoint (see the readme in the experiments/mnist_cINN folder).

The model from the paper is slightly larger and trained for longer, you can test the style transfer by running eval.py in the experiments/mnist_cINN folder.

I am confident it can be made much smaller and more efficient using convolutional layers, instead of all fully connected, I may update in the future (or you can experiment yourself).

from freia.

Johnson-yue avatar Johnson-yue commented on August 19, 2024

@ardizzone Ok,thank you very much . when I test mnist experiments, I will check convolutional layers experiments like : colorization_minimal_experiment and colorization_cINN.

Thanks for your sharing again. and I will learn more for this.

from freia.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.