https://github.com/potato-kitty/objectadd

The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"
https://github.com/potato-kitty/objectadd

Last synced: 8 months ago
JSON representation

The codes of our paper "ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"

Host: GitHub
URL: https://github.com/potato-kitty/objectadd
Owner: potato-kitty
License: mit
Created: 2024-04-26T07:07:46.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2025-01-30T04:32:27.000Z (10 months ago)
Last Synced: 2025-01-30T05:23:54.429Z (10 months ago)
Language: Python
Homepage:
Size: 146 KB
Stars: 10
Watchers: 4
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

# ObjectAdd
This is the official implementation of our paper ["ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion"](https://arxiv.org/abs/2404.17230#)

0. To try our example, run by_reference.ipynb

1. If you got problem for download the SD pre-train model by "StableDiffusionPipeline.from_pretrained", download it by yourself and place it under "CompVis" folder.

2. Before running the codes, copy three files in folder "replace_diffusers" to ".../site-packages/diffusers/models/" and replace the original files. Rember to make a backup of these replaced files incase you want to run other project in this enviroment.

3. Your can prepare your own data following the form of txt files in "mask_info" folder, the first line of these folder is the x-corordinate of left-top point of the drawn box, the secound one is its y-coordinate, and following with the width and height, the last line is the prompt of object you want to add. The defult setting of our codes requires the object word should be in the last position, for example, to add a running cat, you should type the prompt as 'A runing cat' instead of 'A cat runing'.

# Reference
Part of our codes are based on following two projects:
1. [prompt-to-prompt](https://github.com/google/prompt-to-prompt)
```bibtex
@article{hertz2022prompt,
title = {Prompt-to-Prompt Image Editing with Cross Attention Control},
author = {Hertz, Amir and Mokady, Ron and Tenenbaum, Jay and Aberman, Kfir and Pritch, Yael and Cohen-Or, Daniel},
journal = {arXiv preprint arXiv:2208.01626},
year = {2022},
}
```
2. [layout-guidance](https://github.com/silent-chen/layout-guidance)
```bibtex
@article{chen2023trainingfree,
title={Training-Free Layout Control with Cross-Attention Guidance},
author={Minghao Chen and Iro Laina and Andrea Vedaldi},
journal={arXiv preprint arXiv:2304.03373},
year={2023}
}
```
# Citation
Please cite it if you think these codes helpful! Thanks~
```bibtex
@article{zhang2024objectadd,
title={ObjectAdd: Adding Objects into Image via a Training-Free Diffusion Modification Fashion},
author={Zhang, Ziyue and Lin, Mingbao and Ji, Rongrong},
journal={arXiv preprint arXiv:2404.17230},
year={2024}
}
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/potato-kitty/objectadd

Awesome Lists containing this project

README