Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/IrisRainbowNeko/DreamArtist-sd-webui-extension
DreamArtist for Stable-Diffusion-webui extension
https://github.com/IrisRainbowNeko/DreamArtist-sd-webui-extension
Last synced: 2 months ago
JSON representation
DreamArtist for Stable-Diffusion-webui extension
- Host: GitHub
- URL: https://github.com/IrisRainbowNeko/DreamArtist-sd-webui-extension
- Owner: IrisRainbowNeko
- License: apache-2.0
- Created: 2022-11-12T12:17:15.000Z (about 2 years ago)
- Default Branch: master
- Last Pushed: 2023-11-08T15:39:19.000Z (about 1 year ago)
- Last Synced: 2024-05-19T16:35:21.871Z (8 months ago)
- Language: Python
- Size: 58.2 MB
- Stars: 687
- Watchers: 14
- Forks: 51
- Open Issues: 30
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- awesome-stable-diffusion-webui - DreamArtist-sd-webui-extension - Diffusion-webui extension that introduces new artistic tools and functions into the existing WebUI, enhancing its capabilities and providing a more immersive experience. (GitHub projects)
README
# DreamArtist++
***DreamArtist++ for training lora with just one image has been released, try it now:***[HCP-Diffusion](https://github.com/7eu7d7/HCP-Diffusion)
All future updates of the DreamArtist series will be released in this new framework.
# DreamArtist (webui Eextension)
Paper: [![arXiv](https://img.shields.io/badge/arXiv-2211.11337-b31b1b.svg)](https://arxiv.org/abs/2211.11337)
This repo is the official ***Stable-Diffusion-webui extension version** implementation of ***"DreamArtist: Towards Controllable One-Shot Text-to-Image Generation via Contrastive Prompt-Tuning"***
with [Stable-Diffusion-webui](https://github.com/AUTOMATIC1111/stable-diffusion-webui).Standalone version: [DreamArtist](https://github.com/7eu7d7/DreamArtist-stable-diffusion)
Everyone is an artist. Rome wasn't built in a day, but your artist dreams can be!
With just ***one*** training image DreamArtist learns the content and style in it, generating diverse high-quality images with high controllability.
Embeddings of DreamArtist can be easily combined with additional descriptions, as well as two learned embeddings.![](imgs/exp1.jpg)
![](imgs/exp_text1.jpg)
![](imgs/exp_text2.jpg)
![](imgs/exp_text3.jpg)# Setup and Running
Clone this repo to extension folder.
```bash
git clone https://github.com/7eu7d7/DreamArtist-sd-webui-extension.git extensions/DreamArtist
```## Training and Usage
First create the positive and negative embeddings in ```DreamArtist Create Embedding``` Tab.
![](imgs/create.jpg)### Preview Setting
After that, the ```names``` of the positive and negative embedding (```{name}``` and ```{name}-neg```) should be filled into the
```txt2img Tab``` with some common descriptions. This will ensure a correct preview image.
![](imgs/preview.png)### Train
Then, select positive embedding and set the parameters and image folder path in the ```DreamArtist Train``` Tab to start training.
The corresponding negative embedding is loaded automatically.
If your VRAM is low or you want save time, you can uncheck the ```reconstruction```.[Recommended parameters](https://github.com/7eu7d7/DreamArtist-sd-webui-extension#pre-trained-embeddings)
***better to train without filewords***
![](imgs/train.jpg)Remember to check the option below, otherwise the preview is wrong.
![](imgs/fromtxt.png)### Inference
Fill the trained positive and negative embedding into txt2img to generate with DreamArtist prompt.
![](imgs/gen.jpg)### Attention Mask
Attention Mask can strengthen or weaken the learning intensity of some local areas.
Attention Mask is a grayscale image whose grayscale values are related to the learning intensity show in the following table.| grayscale | 0% | 25% | 50% | 75% | 100% |
|-----------|----|-----|------|------|------|
| intensity | 0% | 50% | 100% | 300% | 500% |The Attention Mask is in the same folder as the training image and its name is the name of the training image + "_att".
You can choose whether to enable Attention Mask for training.
![](imgs/att_map.jpg)Since there is a self-attention operation in VAE, it may change the distribution of features.
In the ***Process Att-Map*** tab, it can superimpose the attention map of self-attention on the original Att-Map.### Dynamic CFG
Dynamic CFG can improve the performance, especially when the data set is large (>20).
For example, linearly from 1.5 to 3.0 (1.5-3.0), or with a 0-π/2 cycle of cosine (1.5-3.0:cos), or with a -π/2-0 cycle of cosine (1.5-3.0:cos2).
Or you can also customize non-linear functions, such as 2.5-3.5:torch.sqrt(rate), where rate is a variable from 0-1.## Tested models (need ema version):
+ Stable Diffusion v1.4
+ Stable Diffusion v1.5
+ animefull-latest
+ Anything v3.0
+ momoko-eEmbeddings can be transferred between different models of the same dataset.
## Pre-trained embeddings:
[Download](https://github.com/7eu7d7/DreamArtist-stable-diffusion/releases/tag/embeddings_v2)
| Name | Model | Image | embedding length
(Positive, Negative) | iter | lr | cfg scale |
|------------|------------------|--------------------------------------------------------------------|--------------------------------------------|-------|--------|-----------|
| ani-nahida | animefull-latest | | 3, 6 | 8000 | 0.0025 | 3 |
| ani-cocomi | animefull-latest | | 3, 6 | 8000 | 0.0025 | 3 |
| ani-gura | animefull-latest | | 3, 6 | 12000 | 0.0025 | 3 |
| ani-g | animefull-latest | | 3, 10 | 1500 | 0.003 | 5 |
| asty-bk | animefull-latest | | 3, 6 | 5000 | 0.003 | 3 |
| asty-gc | animefull-latest | | 3, 10 | 1000 | 0.005 | 5 |
| real-dog | sd v1.4 | | 3, 3 | 1000 | 0.005 | 5 |
| real-sship | sd v1.4 | | 3, 3 | 3000 | 0.003 | 5 |
| sty-cyber | sd v1.4 | | 3, 5 | 15000 | 0.0025 | 5 |
| sty-shuimo | sd v1.4 | | 3, 5 | 15000 | 0.0025 | 5 |# Style Clone
![](imgs/exp_style.jpg)# Prompt Compositions
![](imgs/exp_comp.jpg)# Comparison on One-Shot Learning
![](imgs/cmp.jpg)# Other Results
![](imgs/cnx.jpg)
![](imgs/cnx2.jpg)