GithubHelp home page GithubHelp logo

Comments (2)

trougnouf avatar trougnouf commented on May 29, 2024

I can reproduce this early in training:
the side_string bytes are evaluated as above after 400 iterations:

1.400:  mse loss: 0.08312108367681503;  bpp: 0.25097793340682983;       combined loss: 0.25180914998054504
eval: entropy_bottleneck.bits(z, training=False)[0]/8: 1778.6331787109375
train: entropy_bottleneck.bits(z, training=True)[0]/8: 1778.98876953125
compress: len(entropy_bottleneck.compress(z).numpy()[0]): 2104

The bits() function decreases a bit while the length of compress is pretty much constant

Some process with optimizer2.apply_gradients(...) commented out to make sure that the entropy_bottleneck was being optimized:

1.400:  mse loss: 0.072761669754982;    bpp: 0.3175785541534424;        combined loss: 0.31830617785453796                                                                
eval: 2095.109130859375                                                                                                                                                   
train: 2095.53662109375                                                                                                                                                   
compress: 2096   

Here the overall bpp decreases but the size of the side_strings is never optimized.

from compression.

trougnouf avatar trougnouf commented on May 29, 2024

There doesn't seem to be an issue if I instantiate a new ContinuousBatchedEntropyModel with compression=True using a reference to the same prior, and save that model for inference, as is suggested in the code comments.

from compression.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.