GithubHelp home page GithubHelp logo

Training scripts about dit HOT 5 CLOSED

GeGehub avatar GeGehub commented on August 17, 2024 20
Training scripts

from dit.

Comments (5)

phcerdan avatar phcerdan commented on August 17, 2024 4

@hamzafar You can hit the Subscribe button in the right to follow this thread.

image

from dit.

wpeebles avatar wpeebles commented on August 17, 2024 3

Hi everybody. We just added a brand new DiT PyTorch training script (train.py) to the repo. Note that you'll need to update to the latest version of the repo to use it. Sorry for the delay!

The script is not super well-tested currently; we only tried training a 256x256 DiT-XL/2 model from scratch for 90K steps on an A100 node (8x GPUs), but the loss curve looks correct (at least up to that point), and FID-50K at 50K steps is very similar to the JAX version's. If you encounter any bugs, please open a new issue and I'll try my best to take a look.

from dit.

wpeebles avatar wpeebles commented on August 17, 2024 2

Update: since the training script was released, I've trained a few XL/2 and B/4 models. In all experiments the PyTorch-trained models perform very closely compared to the JAX ones (sometimes better actually). I added a bunch of info to the README here. Just make sure you update to the latest version of the repo.

from dit.

hamzafar avatar hamzafar commented on August 17, 2024

Following this post.

from dit.

YTEP-ZHI avatar YTEP-ZHI commented on August 17, 2024

@wpeebles Thanks for sharing your incredible work.

from dit.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.