GithubHelp home page GithubHelp logo

awesome-stable-diffusion / awesome-stable-diffusion Goto Github PK

View Code? Open in Web Editor NEW
1.4K 39.0 71.0 424 KB

Curated list of awesome resources for the Stable Diffusion AI Model.

License: Mozilla Public License 2.0

stable-diffusion ai awesome awesome-list diffusion esrgan

awesome-stable-diffusion's Introduction

Awesome Stable-Diffusion

Awesome

This is a list of software and resources for the Stable Diffusion AI model.

  • πŸ–ŠοΈ marks content that requires sign-up or account creation for a third party service outside GitHub.
  • ❓ marks content with unclear licensing conditions (e.g. lack of a license on Github)
  • πŸ’΅ marks Non-Free content: commercial content that may require any kind of payment.

Due to the fast-moving nature of the topic, entries in the list may be removed at an expedited rate until the ecosystem matures.

See Contributing.

TL;DR

The easiest way to get started for most people is to pick one of the available GUIs based on the desired platform and follow it's installation instructions.

Alternatively, most of the more developed forks (such as InvokeAI) come with their own user interfaces.

To get up and running, one usually needs:

  • A GUI or CLI based installation on a supported platform (usually NVIDIA GPU but support for Apple Silicon, AMD GPUS and even CPU exists in some branches)
  • The weights (.ckpt files) for the model downloaded from Huggingface
  • Sufficent RAM/VRAM available on the system.

Official Resources

Actively Maintained Forks and Containers

All forks listed here add additional features and optimisations and are generally faster than the original release, as they keep the model in memory rather than reloading it after every prompt. Most forks seem to remove the Safety Classifier which may present a risk if used to provide public-facing services, such as Discord bots. The two most popular forks at this point are Automatic1111 and InvokeAI

Potentially Stale/Less active branches

These branches have not been updated in more than a months and are kept here for historic purposes

  • neonsecret/stable-diffusion - Fork focusing on bigger resolutions with less vram at the expense of speed, automatically adjusting to the GPUs abilities. Also includes upscaling, facial restoration via CodeFormer and custom UI

Checkpoints and Weights

Checkpoints (.ckpt files) must be separately downloaded and are required to run Stable Diffusion. The latest stability ai release is 2.1. The last model containing NSFW concepts was 1.5 released by RunwayML.

For 2.x releases, there is a 768x768px resolution capable model trained off the base model (512x512 pixels). You need one of these models to use stable diffusion and generally want to chose the latest one that fits your needs.

Online Demos and Notebooks

Complementary Models and Tools

Tools and models for use in conjuction with Stable Diffusion

  • Civitai Model Repository - Repository for custom models and textual inversion embeddings.
  • Prompt to Prompt - Unofficial Implementation of Cross-attention-control for prompt to prompt image editing.
  • sd-prompt-graph - This is a React-based curve editor GUI for prompt interpolation animations made with Stable Diffusion.
  • DAAM - Diffusion attention attribution maps, generating heatmaps modelling the impact of specific terms and tokens in the prompt on the final diffusion result.
  • sd-concepts-library - A library of user created textual-inversion embeddings to add new concepts to stable diffusion

Training

  • Dreambooth-Stable-Diffusion - Implementation of Google's DreamBooth for stable diffusion, allowing fine-tuning of the model for specific concepts.

  • LORA - LORA (Low-rank Adaptation) for Faster Text-to-Image Diffusion Fine-tuning (UNET+CLIP), a faster, more efficient Fine tuning method requiring less RAM than Dreambooth. Huggingface demo here.

  • Stable Tuner - Stable Tuner, an easy to install Dreambooth trainer with a very comfortable user interface.

  • Stable Diffusion Trainer - Stable Diffusion trainer with scalable dataset size and hardware usage. Requires 10G of VRAM.

  • textual-inversion - Addition of personalized content to Stable Diffusion without retraining the model (Paper, Paper2).

  • Stable Dreamfusion - Text to 3D dreamfusion implementation based on stable diffusion.

GUIS

Most of these GUIS, unless mentioned otherwise in their documentation, include stable-diffusion.

  • πŸ–ŠοΈπŸ’΅ Auto SD Workflow - A UI for lstein/stable-diffusion's dream.py with optimized UX for large-scale/production workflow around image synthesis. Video Walkthrough.

  • Carefree Creator (local version) - User friendly GUI with a creator/artist centric workflow.

  • cmdr2/stable-diffusion-ui - Another, simple to use UI for windows and Linux.

  • DiffusionBee - Self contained binary app for MacOS.

  • πŸ–ŠοΈ DiffusionUI - web UI made with Vue.js inspired by Dall-e using diffusers, perfect for inpainting. Video demo

  • πŸ–ŠοΈ KIRI.ART (formerly SD-MUI) - mobile-first PWA with multiple models (incl. waifu diffusion). Run free locally or use free & paid credits on the live site. Built with React + MaterialUI. (Source Code) MIT License TypeScript

  • πŸ’΅ NMKD GUI - Windows UI, fully featured. Closed source. Pick your own price.

  • sd-webui/stable-diffusion-webui - Very active fork with optional, highly featureful Gradio UI and support for txt2img, img2img inpainting, GFPGAN, ESRGAN, weighted prompts, optimized low memory version, optional textual-inversion and more.

  • Stable Diffusion GRisk GUI - Windows GUI binary for SD. Closed source so use at your own risk.

  • Stable Diffusion Infinity - A proof of concept for outpainting with an infinite canvas interface. (requires powerful GPU).

  • Unstable Fusion - A Stable Diffusion desktop frontend with inpainting, img2img and more

  • stable-diffusion-webui-docker - A docker based frontend integrating the most popular forks.

  • πŸ–ŠοΈ Synesthetic - Prompt search engine and generator focused on remixing

Upscaling

  • BasicSR - Open-source upscaling and restoration toolbox supporting several models.
  • BSRGAN - BSRGANβ€”another upscaling solution specialized in upscaling degraded images.
  • Cupscale - GUI for running various upscaling models, including ESRGAN and RealESRGAN.
  • RealESRGAN - ESRGAN Upscaling (2x, 4x) and content restoration. Python: pip install realesrgan
  • jquesnelle/txt2imghd - A port of the GOBIG mode from progrockdiffusion, providing high quality upscaling on top of txt2img.
  • Upscale Wiki Model Database - Wiki and model database for checkpoints compatible with ESRGAN et al.

Content Restoration

  • lama-cleaner - Content aware AI inpainting tool useful for removing unwanted objects or defects from images. Python: pip install lama-cleaner
  • GFPGAN - Face Restoration GAN included in several forks for automatically fixing the face deformation commonly found in SD output.
  • CodeFormer - Another Face Restoration model (Paper).

Task Chaining

  • chaiNNer - Graphical node-based editor for chaining image processing tasks.
  • ai-art-generator - AI art generation suite combining Stable Diffusion and other models for high volume art generation.
  • dfserver distributed backend AI pipeline server for building self-hosted distributed GPU cluster to run the Stable Diffusion and various AI image or prompt building model.

Prompt Building

Prompts are the instructions given to diffusion models to manipulate their output.

  • Stable diffusion prompt book - OpenAI's stable diffusion prompt book, a very comprehensive resource on prompt engineering.

  • ai-art.com/modifiers - A visual reference guide for keywords.

  • aipromptguide.com - Visual Database of styles, modifier, artists and persons

  • πŸ–ŠοΈ aiTransformer Prompt Builder - Help build text prompts intuitively using an Image To Text tool to generate detailed text descriptions from image, and 800+ special Textual Inversion terms and often-used generic terms with sample images.

  • krea.ai - Prompt search engine that also recommends similar prompts to the one that you click on.

    • πŸ–ŠοΈ With account creation, you can like and save prompts in your own collections.
  • Lexica.art - A searchable, visual database of images and the prompts settings used to create them.

  • pharmapsychotic/clip-interrogator - Jupyter notebook uses CLIP models to suggest a prompt for images similar to a given image (Demo).

  • πŸ–ŠοΈPhraser - A visual prompt builder drawing on a database of examples. (Requires account creation)

  • πŸ–ŠοΈPrompthero = Another visual prompt builder and reference library.

  • PromptoMania - A visual prompt construction tool.

  • rom1504/clip-retrieval - Searches for prompt keywords in the datasets used in training Stable Diffusion and other models (Online GUI). Some GUIS like Automatic1111 include this functionality.

  • Stable Diffusion Prompt Generator - Gives suggestions for improving a given text prompt.

  • PromptSearch - Yet another Stable Diffusion search engine but with public API

  • Same Energy - A visual search engine that returns images that have the same 'energy'.

  • PublicPrompts - Collection of PublicPrompts

Specialized Usecases

  • dream-textures - A blender addon leveraging stable diffusion for texture creation.
  • stable-dreamfusion - An open source implementation of Google's text-to-3D dreamfusion paper with imagegen replaced by stable diffusion.

Tutorials and Comparisons

Tutorials and high quality educational resources

Getting Up and Running

Learning and Mastering

Studies

Extending Functionality

Community Resources

  • 1 week of Stable Diffusion - A curated list of Stable Diffusion services, adaptations, user interfaces and integrations.
  • pharmapsychotic.com/tools - A curated list of Tools and Resources for AI Art, including but not limited to Stable Diffusion.
  • Stable Diffusion Resources - A thorough resource for answering pressing questions about Stable Diffusion, including guides, tutorials, and best software.

Social Media

Plugins for third party apps

Commercial SaaS and apps implementing Stable Diffusion

  • πŸ–ŠοΈπŸ’΅ AI Art Generator (IOS) - iOS App to generate art using Stable Diffusion.
  • aiimagegenerator.org - Free online art generator for generating images using Stable Diffusion. Supports txt2img and img2img generation and inpainting.
  • πŸ–ŠοΈ aiTransformer - Free online image/video transformation tools, including Super Stylizer based on Stable Diffusion, supports txt2img, img2img and img2txt (clip-interrogator).
  • πŸ–ŠοΈπŸ’΅ Barium.ai - Generate PBR (physics based rendering) textures from text. Free and paid plans.
  • πŸ–ŠοΈπŸ’΅ Canva text-to-image - Text-to-image (beta) service from Canva
  • πŸ–ŠοΈπŸ’΅ Dream Studio - Online art generation service by StabilityAI, the creators of Stable Diffusion. Similar to services like DALL-E or Midjourney, this operates on a credit model with a free allowance of credits given to signed up users on a monthly basis.
  • πŸ–ŠοΈπŸ’΅ dream.ai - Online art generation service by Wombo.ai (mobile apps available).
  • πŸ–ŠοΈπŸ’΅ GimmeAPIs - Third party REST API Stable Diffusion service.
  • πŸ–ŠοΈπŸ’΅ Image Computer - Easy-to-use service aimed at non-technical people (comes with free trial credits)
  • πŸ–ŠοΈπŸ’΅ Neural.love - Another online art generator with generous free credits as of Oct 2022.
  • πŸ–ŠοΈπŸ’΅ replicate.com stable diffusion - Another SaaS offering for Stable Diffusion.
  • πŸ–ŠοΈπŸ’΅ Starry AI (IOS) - Another IOS app offering stable diffusion with preset art styles.
  • πŸ–ŠοΈ Stable Horde - Distributed stable diffusion cluster (think folding@home) with web, discord and telegram interfaces where joining with your GPU gives you priority.
  • πŸ–ŠοΈπŸ’΅ Stable Diffusion as API - Third party REST API into table Diffusion service.

The Attic - previous links, retired for inactivity

  • lowfuel/progrock-stable - Fork with optional Web GUI and a different approach to upscaling (GoBIG/ESRGAN)
    • txt2imghd - Fork of progrock diffusion that creates detailed, higher-resolution images by first generating an image from a prompt, upscaling it, then running img2img on smaller pieces of the upscaled image, and blending the results back into the original image.

awesome-stable-diffusion's People

Contributors

19wolf avatar artsparkai avatar asciidiego avatar audreyaudz avatar brycedrennan avatar capableweb avatar dymil avatar fkulla avatar gadicc avatar georgzoeller avatar h4rk8s avatar huo-ju avatar jtoy avatar justcaptcha avatar leszekhanusz avatar marckohlbrugge avatar moulai avatar nicoweio avatar ozgurshn avatar rsandx avatar sonoroboto avatar stacksize avatar theycallmeloki avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-stable-diffusion's Issues

sd-webui is a fork of https://github.com/AUTOMATIC1111/stable-diffusion-webui

As the title says, sd-webui is a fork of https://github.com/AUTOMATIC1111/stable-diffusion-webui

The Automatic version has things that the fork does not have. Like loopback and outpainting. The sd-webui fork might also have a legal problem. The original repo has no license yet. And so the fork is not on safe ground. No license means the whole code is currently under copyright. Without the right to modify and redistribute it.

Deforum notebook

May I suggest the addition of Deforum? It's a Colab notebook for Stable Diffusion with 2D, 3D, Video Input, and Interpolation animations. Includes inpainting, prompt batching, and other nice features too. Under active development with latest version 0.4 from this past weekend.
https://github.com/deforum/stable-diffusion

It's worth noting that AUTOMATIC1111's UI is NOT Free software.

The readme links to the wikipedia page for "Free Software," but auto's ui, while great, is "non-free."

from the wikipedia page:

Free software or libre software,[1][2] infrequently known as freedom-respecting software, is computer software distributed under terms that allow users to run the software for any purpose

auto's repo doesn't have any license and isn't CC, so it's wholly auto's property and not "Free Software."

Multiple updates

Sorry I am on mobile and can't do a PR.

One question:

Wouldn't it make sense to have sd-webui and AUTOMATIC1111's forks both in the same section?

And then some updates that I didn't find in your list yet:

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.