https://github.com/Meaoxixi/FilterPrompt

FilterPrompt: Guiding Image Transfer in Diffusion Models
https://github.com/Meaoxixi/FilterPrompt

Last synced: 8 months ago
JSON representation

FilterPrompt: Guiding Image Transfer in Diffusion Models

Host: GitHub
URL: https://github.com/Meaoxixi/FilterPrompt
Owner: Meaoxixi
Created: 2024-03-16T07:53:30.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-06-11T09:34:19.000Z (over 1 year ago)
Last Synced: 2024-08-01T18:37:59.760Z (over 1 year ago)
Homepage: https://meaoxixi.github.io/FilterPrompt/
Size: 132 MB
Stars: 5
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

# ___***FilterPrompt: A Simple yet Efficient Approach to Guide Image Appearance Transfer in Diffusion Models***___
**Xi Wang**, Yichen Peng, Heng Fang, Yilin Wang, Haoran Xie, Xi Yang, Chuntao Li

We propose FilterPrompt, an approach to enhance the model control effect. It can be universally applied to any diffusion model, allowing users to adjust the representation of specific image features in accordance with task requirements, thereby facilitating more precise and controllable generation outcomes. In particular, our designed experiments demonstrate that the FilterPrompt optimizes feature correlation, mitigates content conflicts during the generation process, and enhances the model's control capability.

![arch](https://raw.githubusercontent.com/Meaoxixi/FilterPrompt/gh-pages/resources/method_diagram.png)

---
## 📝 Changelog
- [x] 2024.04.20: The arXiv paper of [FilterPrompt](https://arxiv.org/abs/2404.13263) is online.
- [x] 2024.05.01: [Project-Page](https://meaoxixi.github.io/FilterPrompt/) of FilterPrompt.
- [ ] Release the code.
- [ ] Public [Demo](https://huggingface.co/spaces/Meaowangxi/FilterPrompt-demo) for users to try FilterPrompt online.

---
## Prerequisites
- We recommend running this repository using [Anaconda](https://docs.anaconda.com/anaconda/install/).
- NVIDIA GPU (Available memory is greater than 20GB)
- CUDA CuDNN (version ≥ 11.1, we actually use 11.7)
- Python 3.11.3 (Gradio requires Python 3.8 or higher)
- PyTorch: [Find the torch version that is suitable for the current cuda](https://pytorch.org/get-started/previous-versions/)
- 【example】:`pip install torch==2.0.0+cu117 torchvision==0.15.1+cu117 torchaudio==2.0.1+cu117 --extra-index-url https://download.pytorch.org/whl/cu117`

## Installation
Specifically, inspired by the concept of decoupled cross-attention in [IP-Adapter](https://ip-adapter.github.io/), we apply a similar methodology.
Please follow the instructions below to complete the environment configuration required for the code:
- Cloning this repo
```
git clone --single-branch --branch main https://github.com/Meaoxixi/FilterPrompt.git
```
- Dependencies

All dependencies for defining the environment are provided in `requirements.txt`.
```
cd FilterPrompt
conda create --name fp_env python=3.11.3
conda activate fp_env
pip install torch==2.0.0+cu117 torchvision==0.15.1+cu117 torchaudio==2.0.1+cu117 --extra-index-url https://download.pytorch.org/whl/cu117
pip install -r requirements.txt
```
- Download the necessary modules in the relative path `models/` from the following links

| Path | Description |
|:---------------------------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------------------------------|
| `models/` | root path |
|     ├── `ControlNet/` | Place the pre-trained model of [ControlNet](https://huggingface.co/lllyasviel) |
|            ├── `control_v11f1p_sd15_depth ` | [ControlNet_depth](https://huggingface.co/lllyasviel/control_v11f1p_sd15_depth/tree/main) |
|            └── `control_v11p_sd15_softedge` | [ControlNet_softEdge](https://huggingface.co/lllyasviel/control_v11p_sd15_softedge/tree/main) |
|     ├── `IP-Adapter/` | [IP-Adapter](https://huggingface.co/h94/IP-Adapter/tree/main/models) |
|            ├── `image_encoder ` | image_encoder of IP-Adapter |
|            └── `other needed configuration files` | |
|     ├── `sd-vae-ft-mse/` | Place the model of [sd-vae-ft-mse](https://huggingface.co/stabilityai/sd-vae-ft-mse/tree/main) |
|     ├── `stable-diffusion-v1-5/` | Place the model of [stable-diffusion-v1-5](https://huggingface.co/runwayml/stable-diffusion-v1-5) |
|     ├── `Realistic_Vision_V4.0_noVAE/` | Place the model of [Realistic_Vision_V4.0_noVAE](https://huggingface.co/SG161222/Realistic_Vision_V4.0_noVAE/tree/main) |

## Demo on Gradio

After installation and downloading the models, you can use `python app.py` to perform code in gradio. We have designed four task types to facilitate you to experience the application scenarios of FilterPrompt.

## Citation
If you find [FilterPrompt](https://arxiv.org/abs/2404.13263) helpful in your research/applications, please cite using this BibTeX:
```bibtex
@misc{wang2024filterprompt,
title={FilterPrompt: Guiding Image Transfer in Diffusion Models},
author={Xi Wang and Yichen Peng and Heng Fang and Haoran Xie and Xi Yang and Chuntao Li},
year={2024},
eprint={2404.13263},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/Meaoxixi/FilterPrompt

Awesome Lists containing this project

README