https://github.com/leeruibin/SPDInv

[ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models
https://github.com/leeruibin/SPDInv

Last synced: 3 months ago
JSON representation

[ECCV2024] Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models

Host: GitHub
URL: https://github.com/leeruibin/SPDInv
Owner: leeruibin
Created: 2024-03-11T03:19:07.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-07-04T10:46:28.000Z (12 months ago)
Last Synced: 2024-10-30T23:35:58.143Z (8 months ago)
Language: Python
Homepage:
Size: 8.47 MB
Stars: 34
Watchers: 3
Forks: 3
Open Issues: 3
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

        ## Source Prompt Disentangled Inversion for Boosting Image Editability with  Diffusion Models

   

>[Ruibin Li](https://github.com/leeruibin)¹ | [Ruihuang Li](https://scholar.google.com/citations?user=8CfyOtQAAAAJ&hl=zh-CN)¹ |[Song Guo](https://scholar.google.com/citations?user=Ib-sizwAAAAJ&hl=en)² | [Lei Zhang](https://www4.comp.polyu.edu.hk/~cslzhang/)^1* 


>¹The Hong Kong Polytechnic University, ²The Hong Kong University of Science and Technology. 


>In ECCV2024

## 🔎 Overview framework

Pipelines of different inversion methods in text-driven editing. (a) DDIM inversion inverts a real image to a latent noise code, but the inverted noise code often results in large gap of reconstruction $D_{Rec}$ with higher CFG parameters. (b) NTI optimizes the null-text embedding to narrow the gap of reconstruction $D_{Rec}$, while NPI further optimizes the speed of NTI. (c) DirectInv records the differences between the inversion feature and the reconstruction feature, and merges them back to achieve high-quality reconstruction. (d) Our SPDInv aims to minimize the gap of noise $D_{Noi}$, instead of $D_{Rec}$, which can reduce the impact of source prompt on the editing process and thus reduce the artifacts and inconsistent details encountered by the previous methods.

![SPDInv](figures/methods.png)

## ⚙️ Dependencies and Installation

```

## git clone this repository

git clone https://github.com/leeruibin/SPDInv.git

cd SPDInv

# create an environment with python >= 3.8

conda env create -f environment.yaml

conda activate SPDInv

```

## 🚀 Quick Inference

#### Run P2P with SPDInv

```

python run_SPDInv_P2P.py --input xxx --source [source prompt] --target [target prompt] --blended_word "word1 word2"

```

#### Run MasaCtrl with SPDInv

```

python run_SPDInv_MasaCtrl.py --input xxx --source [source prompt] --target [target prompt]

```

#### Run PNP with SPDInv

To run PNP, you should first upgrade diffusers to 0.17.1 by

```

pip install diffusers==0.17.1

```

then, you can run

```

python run_SPDInv_PNP.py --input xxx --source [source prompt] --target [target prompt]

```

#### Run ELITE with SPDInv

For ELITE, you should first download the pre-trained [global_mapper.pt](https://drive.google.com/drive/folders/1VkiVZzA_i9gbfuzvHaLH2VYh7kOTzE0x?usp=sharing) checkpoint provided by the ELITE, put it into the checkpoints folder.

```

python run_SPDInv_ELITE.py --input xxx --source [source prompt] --target [target prompt]

```

## 📷 Editing cases with P2P, MasaCtrl, PNP, ELITE

## Editing cases with P2P

  


## Editing cases with MasaCtrl

  


## Editing cases with PNP

  


## Editing cases with ELITE

  


## Citation

```

@article{li2024source,

  title={Source Prompt Disentangled Inversion for Boosting Image Editability with Diffusion Models},

  author={Li, Ruibin and Li, Ruihuang and Guo, Song and Zhang, Lei},

  booktitle={European Conference on Computer Vision},

  year={2024}

}

```

## Acknowledgements

This code is built on [diffusers](https://github.com/huggingface/diffusers/) version of [Stable Diffusion](https://github.com/CompVis/stable-diffusion).

Meanwhile, the code is heavily based on the [Prompt-to-Prompt](https://github.com/google/prompt-to-prompt), [Null-Text Inversion](https://github.com/google/prompt-to-prompt), [MasaCtrl](https://github.com/TencentARC/MasaCtrl), [ProxEdit](https://github.com/phymhan/prompt-to-prompt), [ELITE](https://github.com/csyxwei/ELITE), [Plug-and-Play](https://github.com/MichalGeyer/plug-and-play), [DirectInversion](https://github.com/cure-lab/PnPInversion), thanks to all the contributors!.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/leeruibin/SPDInv

Awesome Lists containing this project

README