GithubHelp home page GithubHelp logo

xyq7 / invcompress Goto Github PK

View Code? Open in Web Editor NEW
117.0 117.0 20.0 7.79 MB

[ACMMM 2021 Oral] Enhanced Invertible Encoding for Learned Image Compression

License: Apache License 2.0

Makefile 0.02% Python 3.65% C++ 0.24% Dockerfile 0.01% Batchfile 0.01% Jupyter Notebook 96.07%
pytorch

invcompress's People

Contributors

felixcheng97 avatar xyq7 avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

invcompress's Issues

Need for speed !

Hi,

Great work ! I'm currently using it in my phd research project.

If I understand the code correctly, the context model (ctx_p) is significantly slowing down the entropy coding/decoding computation...

Is there any way to make this much faster ?

Thanks.

Fail to load pretrained model

Hi, I try to load the pre-trained model and fine-tune it in my own dataset. However, there exists a bug in loading the model:

net = architectures[args.model].from_state_dict(checkpoint['state_dict'])

In this line, there has an KeyError: 'state_dict'.

It seems that the model is not saved as train.py in line 501:

save_checkpoint(
{
"epoch": epoch + 1,
"state_dict": net.state_dict(),
"loss": loss,
"optimizer": optimizer.state_dict(),
"aux_optimizer": aux_optimizer.state_dict(),
"lr_scheduler": lr_scheduler.state_dict(),
},
is_best,
os.path.join('../experiments', args.experiment, 'checkpoints', "checkpoint_%03d.pth.tar" % (epoch + 1))
)

Total patches

Hello,

Great work and awesome code. Thank you very much for sharing.
I see that total 20,745 images are present in Flickr dataset out of which 200 random images are taken away for validation, and then generated patches of size 256*256.
If I use the generate patch script, it results huge number of patches. How many patches are present in your training dataset?

Regards
Priyanka

Small bug of the network architecture

Hi Yueqi, thanks for your great work!

I am wondering about a small bug of the network architecture.
According to the paper, for the SSIM models, the para N for the first two lambda values is 128, while that for the last three lambda values is 192.
However, according to the code, the para N for the first four lambda values is 128, while that for the last four lambda values is 192.

I want to make sure that if I want to train the all five SSIM models, my training code should be:

python examples/train.py ../ -exp exp_ssim_q1_01 -m invcompress -d ../data/flickr2w --epochs 600 -lr 1e-4 --batch-size 8 -n 8 --cuda --gpu_id 7 -q 1 --lambda 6 --metrics ms-ssim --save --seed 7

python examples/train.py ../ -exp exp_ssim_q2_01 -m invcompress -d ../data/flickr2w --epochs 600 -lr 1e-4 --batch-size 8 -n 8 --cuda --gpu_id 6 -q 1 --lambda 12 --metrics ms-ssim --save --seed 7

python examples/train.py ../ -exp exp_ssim_q3_01 -m invcompress -d ../data/flickr2w --epochs 600 -lr 1e-4 --batch-size 8 -n 8 --cuda --gpu_id 5 -q 5 --lambda 40 --metrics ms-ssim --save --seed 7

python examples/train.py ../ -exp exp_ssim_q4_01 -m invcompress -d ../data/flickr2w --epochs 600 -lr 1e-4 --batch-size 8 -n 8 --cuda --gpu_id 4 -q 5 --lambda 120 --metrics ms-ssim --save --seed 7

python examples/train.py ../ -exp exp_ssim_q5_01 -m invcompress -d ../data/flickr2w --epochs 600 -lr 1e-4 --batch-size 8 -n 8 --cuda --gpu_id 3 -q 5 --lambda 220 --metrics ms-ssim --save --seed 7

Is it correct? Thank you.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.