https://github.com/saladtechnologies/dreambooth

A docker container for training dreambooth LoRAs, with automatic checkpointing and resuming to s3-compatible storage
https://github.com/saladtechnologies/dreambooth

Last synced: 4 months ago
JSON representation

A docker container for training dreambooth LoRAs, with automatic checkpointing and resuming to s3-compatible storage

Host: GitHub
URL: https://github.com/saladtechnologies/dreambooth
Owner: SaladTechnologies
License: mit
Created: 2024-02-09T13:57:57.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-05-02T16:21:52.000Z (over 1 year ago)
Last Synced: 2025-06-06T04:19:04.808Z (4 months ago)
Language: Python
Size: 57.6 KB
Stars: 2
Watchers: 2
Forks: 1
Open Issues: 0
Metadata Files:
- Readme: readme.md
- License: LICENSE

Awesome Lists containing this project

README

          # Dreambooth LoRA Training

## Train on Salad

In this repo, there's a script `train_on_salad.py`. You can customize this script to easily run SDXL dreambooth LoRA training jobs on Salad.

## Environment Variables

| Variable Name 
| ---------------------------- 
| LOG_LEVEL 
| MODEL_NAME 
| INSTANCE_DIR 
| OUTPUT_DIR 
| VAE_PATH 
| PROMPT 
| DREAMBOOTH_SCRIPT 
| RESOLUTION 
| MAX_TRAIN_STEPS 
| CHECKPOINTING_STEPS 
| LEARNING_RATE 
| GRADIENT_ACCUMULATION_STEPS 
| LR_WARMUP_STEPS 
| MIXED_PRECISION 
| TRAIN_BATCH_SIZE 
| LR_SCHEDULER 
| USE_8BIT_ADAM 
| TRAIN_TEXT_ENCODER 
| GRADIENT_CHECKPOINTING 
| WITH_PRIOR_PRESERVATION 
| PRIOR_LOSS_WEIGHT 
| CHECKPOINT_BUCKET_NAME 
| CHECKPOINT_BUCKET_PREFIX 
| DATA_BUCKET_NAME 
| DATA_BUCKET_PREFIX 
| WEBHOOK_URL 
| PROGRESS_WEBHOOK_URL 
| COMPLETE_WEBHOOK_URL 
| WEBHOOK_AUTH_HEADER 
| PROGRESS_WEBHOOK_AUTH_HEADER 
| COMPLETE_WEBHOOK_AUTH_HEADER 
| WEBHOOK_AUTH_VALUE 
| PROGRESS_WEBHOOK_AUTH_VALUE 
| COMPLETE_WEBHOOK_AUTH_VALUE 
| SALAD_MACHINE_ID 
| SALAD_CONTAINER_GROUP_ID 
| SALAD_CONTAINER_GROUP_NAME 
| SALAD_ORGANIZATION_NAME 
| SALAD_PROJECT_NAME

| Default Value                            | Description                               | | ---------------------------------------- | ----------------------------------------- | | INFO                                     | Log level configuration                   | | stabilityai/stable-diffusion-xl-base-1.0 | Huggingface Hub Model Name or Path        | | /images                                  | Directory where training data is stored   | | /output                                  | Directory where training output is stored | | madebyollin/sdxl-vae-fp16-fix            | VAE model name or path                    | | photo of timberdog                       | Prompt for training                       | | train_dreambooth_lora_sdxl.py            | Dreambooth training script path           | | 1024                                     | Resolution of the images                  | | 500                                      | Total number of training steps            | | 50                                       | Save a checkpoint after every N steps     | | 1e-4                                     | Learning rate                             | | 4                                        | Gradient accumulation steps               | | 0                                        | LR warmup steps                           | | fp16                                     | Mixed precision training                  | | 1                                        | Train batch size                          | | constant                                 | Learning rate scheduler                   | | None                                     | Use 8-bit adam                            | | None                                     | Train text encoder                        | | None                                     | Gradient checkpointing                    | | None                                     | With prior preservation                   | | 1.0                                      | Prior loss weight                         | | None                                     | S3 bucket name for storing checkpoints    | | None                                     | Prefix for storing checkpoints in S3      | | None                                     | S3 bucket name for storing training data  | | None                                     | Prefix for storing training data in S3    | | None                                     | Webhook URL                               | | None                                     | Webhook URL for progress                  | | None                                     | Webhook URL for completion                | | None                                     | Authentication header for webhook         | | None                                     | Auth header for progress webhook          | | None                                     | Auth header for completion webhook        | | None                                     | Authentication value for webhook          | | None                                     | Auth value for progress webhook           | | None                                     | Auth value for completion webhook         | | None                                     | Salad Machine ID                          | | None                                     | Container Group ID for Salad              | | None                                     | Container Group name for Salad            | | None                                     | Organization name for Salad               | | None                                     | Project name for Salad                    |

Additonally, if using s3-compatible storage for checkpointing, you will need to provide AWS configuration environment variables as well.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/saladtechnologies/dreambooth

Awesome Lists containing this project

README