https://github.com/zewei-Zhang/GoodDrag

Ofiicial GoodDrag implementation.
https://github.com/zewei-Zhang/GoodDrag

Last synced: 3 months ago
JSON representation

Ofiicial GoodDrag implementation.

Host: GitHub
URL: https://github.com/zewei-Zhang/GoodDrag
Owner: zewei-Zhang
License: apache-2.0
Created: 2024-04-02T19:33:00.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-04-17T14:20:50.000Z (over 1 year ago)
Last Synced: 2024-08-09T14:31:38.577Z (about 1 year ago)
Language: Python
Size: 113 MB
Stars: 83
Watchers: 4
Forks: 6
Open Issues: 3
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models

Zewei Zhang

Huan Liu

Jun Chen

Xiangyu Xu

## 📢 Latest Updates
- **2024.04.17** - Updated DAI (Dragging Accuracy Index) and GScore (Gemini Score) evaluation methods. Please check the evaluation file. GScore is modified from [Generative AI](https://github.com/GoogleCloudPlatform/generative-ai/tree/main).
- **2025.03.01** - Our paper is accepted by ICLR2025!

## 1. Getting Started with GoodDrag

Before getting started, please make sure your system is equipped with a CUDA-compatible GPU and Python 3.9 or higher. We provide three methods to directly run GoodDrag:
### 1️⃣ Automated Script for Effortless Setup

- **Windows Users:** Double-click **webui.bat** to automatically set up your environment and launch the GoodDrag web UI.
- **Linux Users:** Run **webui.sh** for a similar one-step setup and launch process.

### 2️⃣ Manual Installation via pip
1. Install the necessary dependencies:

```bash
pip install -r requirements.txt
```
2. Launch the GoodDrag web UI:

```bash
python gooddrag_ui.py
```

### 3️⃣ Quick Start with Colab
For a quick and easy start, access GoodDrag directly through Google Colab. Click the badge below to open a pre-configured notebook that will guide you through using GoodDrag in the Colab environment:

### Runtime and Memory Requirements
GoodDrag's efficiency depends on the image size and editing complexity. For a 512x512 image on an A100 GPU: the LoRA phase requires ~17 seconds, and drag editing takes around 1 minute. GPU memory requirement is below 13GB.

## 2. Parameter Description

We have predefined a set of parameters in the GoodDrag WebUI. Here are a few that you might consider adjusting:
| Parameter Name | Description |
| -------------- | ------------------------------------------------------------ |
| Learning Rate | Influences the speed of drag editing. Higher values lead to faster editing but may result in lower quality or instability. It is recommended to keep this value below 0.05. |
| Prompt | The text prompt for the diffusion model. It is suggested to leave this empty. |
| End time step | Specifies the length of the time step during the denoise phase of the diffusion model for drag editing. If good results are obtained early in the generated video, consider reducing this value. Conversely, if the drag editing is insufficient, increase it slightly. It is recommended to keep this value below 12. |
| Lambda | Controls the consistency of the non-dragged regions with the original image. A higher value keeps the area outside the mask more in line with the original image. |

## 3. Acknowledgments
Part of the code was based on [DragDiffusion](https://github.com/Yujun-Shi/DragDiffusion) and [DragGAN](https://github.com/XingangPan/DragGAN). Thanks for the great work!

## 4. BibTeX
```bibtex
@inproceedings{zhang2025gooddrag,
title={GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models},
author={Zewei Zhang and Huan Liu and Jun Chen and Xiangyu Xu},
booktitle={The Thirteenth International Conference on Learning Representations},
year={2025},
}
```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/zewei-Zhang/GoodDrag

Awesome Lists containing this project

README

GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models