https://genforce.github.io/idinvert/

[ECCV 2020] In-Domain GAN Inversion for Real Image Editing
https://genforce.github.io/idinvert/

Last synced: about 2 months ago
JSON representation

[ECCV 2020] In-Domain GAN Inversion for Real Image Editing

Host: GitHub
URL: https://genforce.github.io/idinvert/
Owner: genforce
License: mit
Created: 2020-04-02T02:23:32.000Z (about 5 years ago)
Default Branch: master
Last Pushed: 2021-07-28T18:50:17.000Z (almost 4 years ago)
Last Synced: 2024-08-03T18:13:01.738Z (11 months ago)
Language: Python
Homepage: https://genforce.github.io/idinvert/
Size: 24.4 MB
Stars: 459
Watchers: 14
Forks: 67
Open Issues: 20
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

awesome-generative-modeling - paper

README

        # In-Domain GAN Inversion for Real Image Editing

![Python 3.6](https://img.shields.io/badge/python-3.6-green.svg?style=plastic)

![TensorFlow 1.12.2](https://img.shields.io/badge/tensorflow-1.12.2-green.svg?style=plastic)

![Keras 2.2.4](https://img.shields.io/badge/keras-2.2.4-green.svg?style=plastic)

![image](./docs/assets/teaser.jpg)

**Figure:** *Real image editing using the proposed In-Domain GAN inversion with a fixed GAN generator.*

> **In-Domain GAN Inversion for Real Image Editing** 


> Jiapeng Zhu*, Yujun Shen*, Deli Zhao, Bolei Zhou 


> *European Conference on Computer Vision (ECCV) 2020*

In the repository, we propose an in-domain GAN inversion method, which not only faithfully reconstructs the input image but also ensures the inverted code to be **semantically meaningful** for editing. Basically, the in-domain GAN inversion contains two steps:

1. Training **domain-guided** encoder.

2. Performing **domain-regularized** optimization.

**NEWS: Please also find [this repo](https://github.com/genforce/idinvert_pytorch), which is friendly to PyTorch users!**

[[Paper](https://arxiv.org/pdf/2004.00049.pdf)]

[[Project Page](https://genforce.github.io/idinvert/)]

[[Demo](https://www.youtube.com/watch?v=3v6NHrhuyFY)]

[[Colab](https://colab.research.google.com/github/genforce/idinvert_pytorch/blob/master/docs/Idinvert.ipynb)]

## Testing

### Pre-trained Models

Please download the pre-trained models from the following links. For each model, it contains the GAN generator and discriminator, as well as the proposed **domain-guided encoder**.

| Path | Description

| :--- | :----------

|[face_256x256](https://drive.google.com/file/d/1azAzSZg6VfNydjWr4qfl8Z4LfxktTPqM/view?usp=sharing)    | In-domain GAN trained with [FFHQ](https://github.com/NVlabs/ffhq-dataset) dataset.

|[tower_256x256](https://drive.google.com/file/d/1USfaSLor5d71IRoC8CWTbKJagS0-MJEv/view?usp=sharing)   | In-domain GAN trained with [LSUN Tower](https://github.com/fyu/lsun) dataset.

|[bedroom_256x256](https://drive.google.com/file/d/1nRa4WAE1qF_j1CtH32hxjREK0o-rpucD/view?usp=sharing) | In-domain GAN trained with [LSUN Bedroom](https://github.com/fyu/lsun) dataset.

### Inversion

```bash

MODEL_PATH='styleganinv_face_256.pkl'

IMAGE_LIST='examples/test.list'

python invert.py $MODEL_PATH $IMAGE_LIST

```

NOTE: We find that 100 iterations are good enough for inverting an image, which takes about 8s (on P40). But users can always use more iterations (much slower) for a more precise reconstruction.

### Semantic Diffusion

```bash

MODEL_PATH='styleganinv_face_256.pkl'

TARGET_LIST='examples/target.list'

CONTEXT_LIST='examples/context.list'

python diffuse.py $MODEL_PATH $TARGET_LIST $CONTEXT_LIST

```

NOTE: The diffusion process is highly similar to image inversion. The main difference is that only the target patch is used to compute loss for **masked** optimization.

### Interpolation

```bash

SRC_DIR='results/inversion/test'

DST_DIR='results/inversion/test'

python interpolate.py $MODEL_PATH $SRC_DIR $DST_DIR

```

### Manipulation

```bash

IMAGE_DIR='results/inversion/test'

BOUNDARY='boundaries/expression.npy'

python manipulate.py $MODEL_PATH $IMAGE_DIR $BOUNDARY

```

NOTE: Boundaries are obtained using [InterFaceGAN](https://github.com/genforce/interfacegan).

### Style Mixing

```bash

STYLE_DIR='results/inversion/test'

CONTENT_DIR='results/inversion/test'

python mix_style.py $MODEL_PATH $STYLE_DIR $CONTENT_DIR

```

## Training

The GAN model used in this work is [StyleGAN](https://github.com/NVlabs/stylegan). Beyond the original repository, we make following changes:

- Change repleated $w$ for all layers to different $w$s (Line 428-435 in file `training/networks_stylegan.py`).

- Add the *domain-guided* encoder in file `training/networks_encoder.py`.

- Add losses for training the *domain-guided* encoder in file `training/loss_encoder.py`.

- Add schedule for training the *domain-guided* encoder in file `training/training_loop_encoder.py`.

- Add a perceptual model (VGG16) for computing perceptual loss in file `perceptual_model.py`.

- Add training script for the *domain-guided* encoder in file `train_encoder.py`.

### Step-1: Train your own generator

```bash

python train.py

```

### Step-2: Train your own encoder

```bash

TRAINING_DATA=PATH_TO_TRAINING_DATA

TESTING_DATA=PATH_TO_TESTING_DATA

DECODER_PKL=PATH_TO_GENERATOR

python train_encoder.py $TRAINING_DATA $TESTING_DATA $DECODER_PKL

```

Note that the file `dataset_tool.py`, which is borrowed from the [StyleGAN](https://github.com/NVlabs/stylegan) repo, is used to prepared a directory of data from all resolutions. The training of the encoder does not rely on the progressive strategy, therefore, the training data and the test data should be both specified as the `.tfrecords` file with the highest resolution.

## BibTeX

```bibtex

@inproceedings{zhu2020indomain,

  title     = {In-domain GAN Inversion for Real Image Editing},

  author    = {Zhu, Jiapeng and Shen, Yujun and Zhao, Deli and Zhou, Bolei},

  booktitle = {Proceedings of European Conference on Computer Vision (ECCV)},

  year      = {2020}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://genforce.github.io/idinvert/

Awesome Lists containing this project

README