GithubHelp home page GithubHelp logo

grimig / stabletuner Goto Github PK

View Code? Open in Web Editor NEW

This project forked from devilismyfriend/stabletuner

0.0 0.0 0.0 3.34 MB

Finetuning SD in style.

License: GNU Affero General Public License v3.0

Python 98.60% Batchfile 1.40%

stabletuner's Introduction



Stable Tuner, Fine-tune your SD

Buy Me a Coffee at ko-fi.com Join the discord :)

IMAGE ALT TEXT HERE

Join the Discord for training and chill ;)

Stable Tuner wants to be the easiest and most complete Stable Diffusion tuner :)

Features

  • For End Users - ST was made to provide a solution that is convenient but powerful on windows, if you wanted to try finetuning, there's no better option, for Linux folks, a bash script will be added at a later date if there's enough interest.
  • Easy Installation - ST makes installing convenient, using a bat file, ST will setup an environment ready for work and will install all the necessary components to get your training started fast!
  • Friendly GUI - ST features a full GUI to configure training runs, import and export settings, view tool tips for options, test your new model in the playground, convert the model to CKPT and more!
  • Better Performance - Using Diffusers, Xformers, CUDNN 1.8 and Bitsandbytes along with Latent caching allows for higher batch sizes and faster speeds, higher batch sizes = better quality model!.
  • A Toolbox - Use Caption Buddy to quickly generate and edit captions for your dataset in one streamlined tool, ST is building a toolbox for the must-have tools if you're training models.
  • Fine Tuning Mindset - ST is built to fine-tune, unlike Dreambooth, ST is meant to fine-tune a model, providing tools and settings to make most of your 3090/4090s, Dreambooth is still an option.
  • Filename/Caption/Token based learning - You can train using the individual file names as caption, use a caption txt file or a single token DB style, for finetuning file name and captions are best.
  • Aspect Ratio Bucketing - Using Aspect Ratio bucketing you can use any aspect ratio or resolution for your training images, images will get shuffled into buckets and resized to your chosen resolution target!, supports up to 1024 resolution!.
  • Remote monitoring using Telegram - Want to keep tabs on your training? set a bot up in Telegram and receive samples and notifications as you train,
  • Better Sampling controls - To gauge how your model is doing sampling is important, to that effect ST gives you the option to add sample prompts as you see fit, set the number of images to produce per prompt, send a controlled seed prompt (to gauge how a seed changes) or even use random aspect ratios to see how buckets are changing your generations!.
  • Better Dataset Handling - Use Dataset balancing to even out multiple concepts so they don't over-power each other, add class images to dataset to train them directly, override per dataset if necessary.
  • Quality of life - Many options to tune the experience to your liking, use save latent caching to avoid regenerating them at every run, use high batch-sizes to maximize training speed and performance, use epochs instead of steps to gauge progress better!.
  • Built for Diffusers - ST uses HF's Diffusers library to allow the best and fastest implementations going forward, as of now, training 1.4,1.5,2 and 2-768 work great.

Installation

Download and install Anaconda or miniconda and clone this repo, run the install_stabletuner.bat, when finished start the app with the StableTuner.cmd file.

CUDNN 8.6

NOTICE - As of this writing this step is no longer necessary and the installer will download CUDNN by itself, keeping it here for now

Due to the filesize I can't host the DLLs needed for CUDNN 8.6 on Github, I strongly advise you download them for a speed boost in sample generation (almost 50% on 4090) you can download them from here: CUDNN 8.6

To install simply unzip the directory and place in the same directory as StableTuner.cmd, run install_stabletuner.bat and you're good to go!

Usage

Refer to the tool tips in the GUI for more information, if you have any questions feel free to ask in the Discord

Kudos

  • Shivam - For the original code and inspiration - A2 License
  • Diffusers - For the latest and greatest implementations - A2 License
  • Everydream - For the Aspect Ratio bucketing - MIT License
  • Sygil.dev - For the environment setup - GAPLV3 License
  • sd_dreambooth_extension - for the bitsandbytes files and install script
  • StabilityAI - For the latest and greatest models
  • The whole SD community - For making this possible

What's next?

  • Linux support
  • More models
  • Advanced model mixing
  • And more! :D
  • Support me on Ko-Fi and come hang out in Discord to help me decide what's next :)

stabletuner's People

Contributors

devilismyfriend avatar grimig avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.