The stable-diffusion AI startup

In this example we are going to train our own custom model for stable diffusion image generator, monitor it's performance, and serve it as a REST API.

Things

alias STD_NBX="nbx projects -id 'a394e541'"

Step 1: 🌤️ Upload Images

First we need to gather some data for this, so create a folder mkdir data and put some images in it. Modify the manifest.jsonl file and load with all the images you want to train on with captions. To upload:

STD_NBX - artifact put_to manifest.json manifest.json
STD_NBX - artifact put_to ./data ./

Step 2: Train the model

Next to finetune the model we are going to use a GPU NimbleBox Job. To create the job and trigger it.

STD_NBX - run train_text_to_image_lora:main \
  --resource_cpu="1000m" \
  --resource_memory="6000Mi" \
  --resource_disk_size="20Gi" \
  --resource_gpu="nvidia-tesla-t4" \
  --manifest manifest.json \
  --train_batch_size=3 \
  --gradient_accumulation_steps=4 \
  --resolution=512 \
  --mixed_precision="fp16" \
  --max_train_steps=10000 \
  --learning_rate=1e-04 \
  --max_grad_norm=1 \
  --lr_scheduler="cosine" \
  --lr_warmup_steps=0 \
  --checkpointing_steps=1000 \
  --validation_steps 500 \
  --validation_prompt 'Beautiful purple hand holding a red heart with planets, stars and black universe in the background, style: @wrinkledot'

The only difference between running your script on NimbleBox and local machines is that you have define resources one time

- python3 train_text_to_image_lora.py \
+ nbx projects --id '<project_id>' - run train_text_to_image_lora:main \

# only once
+  --resource_cpu="1000m" \
+  --resource_memory="6000Mi" \
+  --resource_disk_size="20Gi" \
+  --resource_gpu="nvidia-tesla-t4" \
+  --resource_gpu_count="1" \

It will create a Relic called "dreambooth" and put all the files there. (Coming) use nbox.Lmao to monitor the model in production with a live dashboard.

Step 3: Serve the model

In order to serve the model for a public end point we are going to use GPU NimbleBox Serving. To create the serving and trigger it.

STD_NBX - serve op_server:prompt \
  --resource_cpu="1000m" \
  --resource_memory="6000Mi" \
  --resource_disk_size="20Gi" \
  --resource_gpu="nvidia-tesla-t4" \
  --server_type="fastapi"

yashbonde / dreambooth-stable-diff Goto Github PK

dreambooth-stable-diff's Introduction

The stable-diffusion AI startup

Step 1: 🌤️ Upload Images

Step 2: Train the model

Step 3: Serve the model

dreambooth-stable-diff's People

Contributors

Stargazers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent

Jobs