Python path broken under Linux

Mental Diffusion

Stable diffusion command-line interface
For debian-based linux distributions
Powered by Diffusers

Version 0.7.7 alpha
Python 3.11 / 3.12
Torch 2.2.2 +cu121

Features
Installation
Help
Known Issues
FAQ

Recent changes

- Update config.json
- Remove true/false from boolean arguments (--arg is enough)
- Update websockets server and add node_module for node.js clients
- New curses terminal menu (support arrow keys)
- Experimental client (webui) has been removed
- New python venv package installer
- Add support for python 3.12
- Update pytorch to 2.2.2+cu121
- Update all python dependencies
- Electron was removed in favor of a simple http address
- New --pipeline argument (txt2img, img2img, inpaint...)
- Fix latent preview for sdxl
- Simplified seed generation for maximum compatibility
- New progress callback
- New real-esrgan downloader with progress bar
- Bug fixes and code management

Features

Fast startup
Command-line interface
Websockets server
Curses terminal menu
SD 1.5, SDXL, SDXL-Turbo
VAE, TAESD, LoRA
Text-to-Image, Image-to-Image, Inpaint
Latent preview for SD/SDXL (bmp/webp)
Upscaler Real-ESRGAN x2/x4/x4anime
Read and write PNG with metadata
Optimized for low specs
Support CPU and GPU
Support http proxy
JSON config file
Extras: ComfyUI Bridge for VS Code

Installation

Install Python 3.11 / 3.12
Install Python packages in a venv (see requirements.txt)
Terminal menu requirements: nano, node, xdg-open

git clone https://github.com/nimadez/mental-diffusion.git
source install.sh
nano src/config.json

Start headless

SD15: mdx.py -p "prompt" -c /sd15.safetensors -st 20 -g 7.5 -f img_{seed}
SDXL: mdx.py -p "prompt" -mode xl -c /sdxl.safetensors -w 1024 -h 1024 -st 30 -g 8.0 -f img_{seed}
Img2Img: mdx.py -p "prompt" -pipe img2img -i image.png -sr 0.5
Inpaint: mdx.py -p "prompt" -pipe inpaint -i image.png -m mask.png
Upscale: mdx.py -upx4 image.png

Start terminal menu

~/.venv/python3 src/mdx.py

Start server

~/.venv/python3 src/mdx.py -serv

Show preview

open http://localhost:port

Run node.js client

nano tests/ws-client.js && node tests/ws-client.js

These models are downloaded as needed after launch:

openai/clip-vit-large-patch14 (hf-cache)
laion/CLIP-ViT-bigG-14-laion2B-39B-b160k (hf-cache)
madebyollin/taesd (hf-cache)
madebyollin/taesdxl (hf-cache)
RealESRGAN_x2plus.pth (optional, src/models/realesrgan)
RealESRGAN_x4plus.pth (optional, src/models/realesrgan)
RealESRGAN_x4plus_anime_6B.pth (optional, src/models/realesrgan)

Help

--help                     show this help message and exit

--server     -serv         start websockets server (port: config.json)
--upscaler   -upx4  str    /path-to-image.png, upscale image x4
--metadata   -meta  str    /path-to-image.png, extract metadata from PNG

--model      -mode  str    sd/xl, set checkpoint model type (def: config.json)
--pipeline   -pipe  str    txt2img/img2img/inpaint, define pipeline (def: txt2img)
--checkpoint -c     str    checkpoint .safetensors path (def: config.json)
--vae        -v     str    optional vae .safetensors path (def: null)
--lora       -l     str    optional lora .safetensors path (def: null)
--lorascale  -ls    float  0.0-1.0, lora scale (def: 1.0)
--scheduler  -sc    str    ddim, ddpm, lcm, pndm, euler_anc, euler, lms (def: config.json)
--prompt     -p     str    positive prompt text input (def: sample)
--negative   -n     str    negative prompt text input (def: empty)
--width      -w     int    width value must be divisible by 8 (def: config.json)
--height     -h     int    height value must be divisible by 8 (def: config.json)
--seed       -s     int    seed number, -1 to randomize (def: -1)
--steps      -st    int    steps from 1 to 100+ (def: 20)
--guidance   -g     float  0.0-20.0+, how closely linked to the prompt (def: 8.0)
--strength   -sr    float  0.0-1.0, how much respect the image should pay to the original (def: 1.0)
--image      -i     str    PNG file path or base64 PNG (def: null)
--mask       -m     str    PNG file path or base64 PNG (def: null)
--base64     -64           do not save the image to a file, get base64 only
--filename   -f     str    filename prefix (no png extension)
--batch      -b     int    enter number of repeats to run in batch (def: 1)
--preview    -pv           stepping is slower with preview enabled
--upscale    -up    str    auto-upscale x2, x4, x4anime (def: null)

[http server requests]
curl http://localhost:port/preview --output preview.bmp
curl http://localhost:port/replay --output preview.webp
curl http://localhost:port/progress (return { step: int, timestep: int })
curl http://localhost:port/interrupt
curl http://localhost:port/config (return config.json)

* --server or --batch is recommended because there is no need to reload the checkpoint
* add "{seed}" to --filename, which will be replaced by seed later
* ~/ and $USER accepted for file and directory paths

[config.json]
use_cpu = use cpu instead of gpu
http_proxy = bypass censorship (e.g. http://localhost:8118)
host = localhost, 192.168.1.10, 0.0.0.0
port = a valid port number
output = image output directory (e.g. ~/img_output)
onefile = save original image when auto-upscale is enabled?
interrupt_save = save image after interrupt?

Test Websockets Client

The experimental websockets client has been removed, see "tests/ws-client.js" for example.

Test Latent Preview

Test LoRA + VAE

_{* Juggernaut Aftermath, TRCVAE, World of Origami}

Test SDXL

_{* OpenDalleV1.1}

Test SDXL-Turbo

_{* A cinematic shot of a baby racoon wearing an intricate italian priest robe.}

Known Issues

:: Stepping is slower with preview enabled
We used the BMP format which has no compression.
Reminder: one solution is to set "pipe._guidance_scale" to 0.0 after 40%

:: Interrupt operation does not work
You have to wait for the current step to finish,
the interrupt operation is applied at the end of each step.

:: The server does not start from the terminal menu
If you are running MD for the first time and the
huggingface cache is missing, start the server using:
~/.venv/python3 src/mdx.py -serv

:: CUDA out of memory error
nvidia-driver does not support system memory fallback

FAQ

:: How to load SDXL with 3.5 GB VRAM?
To load SDXL, you need at least 16 GB RAM and a swap file/partition.

:: How to download HuggingFace models in a specific path?
Use symlink or export path:
ln -s /path-to-huggingface-cache ~/.cache/huggingface
export HF_HOME=d:\path-to-huggingface-cache

History

↑ Migration to linux environment
↑ Back to the roots (diffusers)
↑ Ported to VS Code
↑ Switch from Diffusers to ComfyUI
↑ Upgrade from sdkit to Diffusers
↑ Undiff renamed to Mental Diffusion
↑ Undiff started with "sdkit"
↑ Created for my personal use

"AI will take us back to the age of terminals."

License

Code released under the MIT license.

nimadez / mental-diffusion Goto Github PK

mental-diffusion's Introduction

Mental Diffusion

Recent changes

Features

Installation

Start headless

Start terminal menu

Start server

Show preview

Run node.js client

These models are downloaded as needed after launch:

Help

Test Websockets Client

Test Latent Preview

Test LoRA + VAE

Test SDXL

Test SDXL-Turbo

Known Issues

FAQ

History

License

Credits

mental-diffusion's People

Contributors

Stargazers

Watchers

Forkers

mental-diffusion's Issues

Recommend Projects

Recommend Topics

Recommend Org

Jobs