GithubHelp home page GithubHelp logo

patch-dm's Issues

Training time

Hi, I have a question regarding the training on natural image dataset. How long did you guys trained on the natural images dataset? Thank you!

How to train LSUN Church dataset on latent diffusion model?

Thanks to your great work! I did experiment on LSUN Church dataset.Following on your paper, global conditions of this dataset is provided by a U-Net's encoder. After that, train another latent diffusion model for unconditional image synthesis by the pretrained model. But the loading model code is for your own landscape images not for low resolution images.I wonder how to train LSUN Church dataset on latent diffusion model?Sincerely looking forward to your reply.
image

Estimated Number of Epoch Required For Training

Hi Zheng and Mengqi,

Thank you for your amazing work and making your code public! I wonder if you could kindly provide some insights on my experiments below?

I am training on the truncated CelebA dataset, which only has 5000 256x256 images. And I am using CLIP embedding. My batch-size is 96.

  1. How many epoch would it take for the train.py to converge?
  2. How many epoch would it take for the train_latent.py to converge?
  3. Should I run train.py longer or train_latent.py longer when the generated images have grid-like artifacts?

train_latent error

image

I try to run train_latent on my custom dataset but I run into NameError: free variable 'patch_num_x' referenced before assignment in enclosing scope

training image size

Thank you so much for sharing the amazing work! I have a question regarding training image resolution, in img2lmdb.py I only see it converting size up to 1024. Do you support even higher resolutions such as 2k or 4k and patchify? My dataset mostly 3k resolution images. Thank you!

data_path in initialize.py

Is data_path in initialize.py the same as lmdb_path in img2lmdb.py? I use lmdb_path in img2lmdb.py as data_path in initialize.py. The error message is as follows. After a long time of debugging, the error still occurs.Could you please help me answer my question? Thank you!

PIL.UnidentifiedImageError: cannot identify image file <_io.BytesIO object at 0x7f642ed3a5e0>

Border artifacts

Hello, thank you for presenting your work.I have a question about border artifacts during training.

  • My understanding is the grid-like artifact will gradually gets better during training iterations, but still remain obvious in early training stage, is that the case during your expreiment?
  • And do you think it helps to constrain the consistency between feature collagre output (output shift) and image collage output (otuput nonshift)

Questions about position embedding

Thanks for your great work! But I have some questions about code. Is the BeatGANsAutoencModel in unet_autoenc.py counterpart with the encoder of UNet? But the tensors of pos_emb and pos_emb_new seem not to be used in the model. Does it not use the position embedding or I misunderstand? Sincerely looking forward to your answer!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.