https://github.com/google-research/readout_guidance

Official PyTorch Implementation for Readout Guidance, CVPR 2024
https://github.com/google-research/readout_guidance

Last synced: 7 months ago
JSON representation

Official PyTorch Implementation for Readout Guidance, CVPR 2024

Host: GitHub
URL: https://github.com/google-research/readout_guidance
Owner: google-research
License: apache-2.0
Created: 2024-02-07T17:15:21.000Z (over 1 year ago)
Default Branch: main
Last Pushed: 2024-10-03T07:04:48.000Z (about 1 year ago)
Last Synced: 2025-03-29T11:05:05.427Z (7 months ago)
Language: Jupyter Notebook
Homepage: https://readout-guidance.github.io
Size: 22.7 MB
Stars: 141
Watchers: 3
Forks: 10
Open Issues: 5
Metadata Files:
- Readme: README.md
- Contributing: CONTRIBUTING.md
- License: LICENSE

Awesome Lists containing this project

awesome-diffusion-categorized - [Code

README

          # 🔮 Readout Guidance: Learning Control from Diffusion Features

**Grace Luo, Trevor Darrell, Oliver Wang, Dan B Goldman, Aleksander Holynski**

This repository contains the PyTorch implementation of Readout Guidance: Learning Control from Diffusion Features.

*This is not an officially supported Google product.*

[[`Project Page`](https://readout-guidance.github.io)][[`arXiv`](https://arxiv.org/abs/2312.02150)]

## Releases

- 🚀 2024/04/26: Additional code for pose estimation with readout heads in the [readout_pose](readout_pose) directory.

- 🚀 2024/01/31: Initial codebase release with demos for drag-based manipulation and spatial control, as well as readout head training code. Includes weights for SDXL and SDv1-5 readout heads for appearance, correspondence, depth, edge, pose.

## Setup

This code was tested with Python 3.8. To install the necessary packages, please run:

```

conda env create -f environment.yml

conda activate readout

```

## Readout Heads

All model weights can be found on our [HuggingFace page](https://huggingface.co/g-luo/readout-guidance/tree/main/weights). To automatically download the weights run:

```

./download_weights.sh

```

| Readout Head Type| SDv1-5 | SDXL |

|----------|----------|----------|

| Pose Head | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdv15_spatial_pose.pt?download=true) | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdxl_spatial_pose.pt?download=true) |

| Depth Head | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdv15_spatial_depth.pt?download=true) | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdxl_spatial_depth.pt?download=true) |

| Edge Head | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdv15_spatial_edge.pt?download=true) | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdxl_spatial_edge.pt?download=true) |

| Correspondence Feature Head | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdv15_drag_correspondence.pt?download=true) | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdxl_drag_correspondence.pt?download=true) |

| Appearance Similarity Head | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdv15_drag_appearance.pt?download=true) | [download](https://huggingface.co/g-luo/readout-guidance/resolve/main/weights/readout_sdxl_drag_appearance.pt?download=true) |

## Demos

Note that the generation process is non-deterministic, even without Readout Guidance, so re-running the same cell or script with the exact same settings can yield better results.

- `demo_drag.ipynb`: This demo walks through drag-based manipulation on either real images or generated images, where the user can also annotate the desired drags.

- `demo_spatial.ipynb`: This demo walks through spatial control with the pose head on pose inputs derived from MSCOCO images.

## Generation Scripts

You can also automatically generate many samples using the following scripts.

```

conda activate readout

# Run drag-based manipulation on samples in data/drag/real

python3 script_drag.py configs/drag_real.yaml

# Run spatial control on samples in data/spatial/pose

python3 script_spatial.py configs/spatial.yaml

```

## Training Code

To train your own readout heads, please check out `readout_training/README.md`.

## Citing

```

@inproceedings{luo2024readoutguidance,

    title={Readout Guidance: Learning Control from Diffusion Features},

    author={Grace Luo and Trevor Darrell and Oliver Wang and Dan B Goldman and Aleksander Holynski},

    journal={CVPR},

    year={2024}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/google-research/readout_guidance

Awesome Lists containing this project

README