Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/donahowe/Theatergen
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
https://github.com/donahowe/Theatergen
image-generation
Last synced: 3 months ago
JSON representation
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation
- Host: GitHub
- URL: https://github.com/donahowe/Theatergen
- Owner: donahowe
- Created: 2024-04-25T07:38:36.000Z (9 months ago)
- Default Branch: main
- Last Pushed: 2024-06-20T03:04:34.000Z (7 months ago)
- Last Synced: 2024-08-01T18:37:16.479Z (6 months ago)
- Topics: image-generation
- Language: Python
- Homepage: https://howe140.github.io/theatergen.io/
- Size: 13.1 MB
- Stars: 47
- Watchers: 3
- Forks: 2
- Open Issues: 4
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesome-diffusion-categorized - [Code
README
## Theatergen: Character Management with LLM for Consistent Multi-turn Image Generation
[π[Paper](https://arxiv.org/abs/2404.18919)] β [π©[Project Page](https://howe140.github.io/theatergen.io/)]
![Teaser figure](scripts/more.png)
## Model Architecture
![Teaser figure](scripts/model.png)## Introduction
We propose Theatergen, a tuning-free method for consistent multi-turn image generation. The key idea is to utilize LLM for character management with `layout` and `id` and customize each `character` to avoid attention leakage. We further propose the `CMIGBench` for evaluating the consistency in multi-turn image generation.## TODO
- [ ] Deployment with GPT interface
- [x] Release Benchmark
- [x] Release code## :fire: News
* **[2024.04.26]** We have released our code and benchmark## Setup
### π§ RequirementsTo install requirements:
```
pip install -r requirements.txt
```### π Generate
Generate with `CMIGBench` or replace with your own demo```
python generate.py --task story --sd_version '1.5' --dataset_path CMIGBench
```### π§ͺ Evaluate
Prepare the output in the following formatβββ output_dir
| βββ dialogue 1
| βββ turn1.png
| βββ turn2.png
| βββ turn3.png
| βββ turn4.png
| βββ dialogue 2
| ...Evalutate the generated results of `CMIGBench`
```
python CMIGBench/eval/eval.py
python CMIGBench/eval/eval_extra.py
```## π Contact Us
If you have any questions, please feel free to email us at [email protected].πππ(I am an undergraduate student actively seeking opportunities for a Ph.D. program in 25 fall.)πππ
## π‘Acknowledgement
Our work is based on [stable diffusion](https://github.com/Stability-AI/StableDiffusion), [Grounded-SAM](https://github.com/IDEA-Research/Grounded-Segment-Anything), [T2I-Adapter](https://github.com/TencentARC/T2I-Adapter), and [IP-Adapter](https://github.com/tencent-ailab/IP-Adapter). We appreciate their outstanding contributions.## Citation
If you found this code helpful, please consider citing:
~~~
@article{cheng2024theatergen,
title={TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation},
author={Cheng, Junhao and Yin, Baiqiao and Cai, Kaixin and Huang, Minbin and Li, Hanhui and He, Yuxin and Lu, Xi and Li, Yue and Li, Yifei and Cheng, Yuhao and others},
journal={arXiv preprint arXiv:2404.18919},
year={2024}
}
~~~