https://github.com/CompVis/iin

A Disentangling Invertible Interpretation Network
https://github.com/CompVis/iin

Last synced: 2 months ago
JSON representation

A Disentangling Invertible Interpretation Network

Host: GitHub
URL: https://github.com/CompVis/iin
Owner: CompVis
License: mit
Created: 2020-03-30T22:55:36.000Z (over 5 years ago)
Default Branch: master
Last Pushed: 2021-05-12T08:08:12.000Z (about 4 years ago)
Last Synced: 2024-11-12T18:43:17.899Z (8 months ago)
Language: Python
Homepage: https://compvis.github.io/iin/
Size: 19 MB
Stars: 122
Watchers: 8
Forks: 17
Open Issues: 9
Metadata Files:
- Readme: README.md
- License: LICENSE.md

Awesome Lists containing this project

README

        # A Disentangling Invertible Interpretation Network for Explaining Latent Representations

PyTorch code accompanying the [CVPR 2020](http://cvpr2020.thecvf.com/) paper

[**A Disentangling Invertible Interpretation Network for Explaining Latent Representations**](https://compvis.github.io/iin/)


[Patrick Esser](https://github.com/pesser)\*,

[Robin Rombach](https://github.com/rromb)\*,

[Björn Ommer](https://hci.iwr.uni-heidelberg.de/Staff/bommer)


\* equal contribution

![teaser](assets/mnist.gif)


[arXiv](https://arxiv.org/abs/2004.13166) | [BibTeX](#bibtex) | [Project Page](https://compvis.github.io/iin/)

Table of Contents

=================

* [Requirements](#requirements)

* [Data](#data)

* [Training](#training)

   * [Autoencoders](#autoencoders)

   * [Classifiers](#classifiers)

   * [Invertible Interpretation Networks](#invertible-interpretation-networks)

      * [Unsupervised](#unsupervised)

      * [Supervised](#supervised)

* [Evaluation](#evaluation)

* [Pretrained Models](#pretrained-models)

* [Results](#results)

* [BibTeX](#bibtex)

## Requirements

A suitable [conda](https://conda.io/) environment named `iin` can be created

and activated with:

```

conda env create -f environment.yaml

conda activate iin

```

Optionally, you can then also `conda install tensorflow-gpu=1.14` to speed up

FID evaluations.

## Data

`MNIST`, `FashionMNIST` and `CIFAR10` will be downloaded automatically the

first time they are used and `CelebA` will prompt you to download it. The

content of each dataset can be visualized with

```

edexplore --dataset iin.data.

```

where `` is one of `MNISTTrain`, `MNISTTest`, `FashionMNISTTrain`,

`FashionMNISTTest`, `CIFAR10Train`, `CIFAR10Test`, `CelebATrain`, `CelebATest`,

`FactorCelebATrain`, `FactorCelebATest`, `ColorfulMNISTTrain`,

`ColorfulMNISTTest`, `SingleColorfulMNISTTrain`, `SingleColorfulMNISTTest`.

## Training

### Autoencoders

To train autoencoders, run

```

edflow -b configs/_ae.yaml -t

```

where `` is one of `mnist`, `fashionmnist`, `cifar`, `celeba`,

`cmnist`. To enable logging to [wandb](https://wandb.ai), adjust

`configs/project.yaml` and add it to above command:

```

edflow -b configs/_ae.yaml configs/project.yaml -t

```

### Classifiers

To train a classifier on `ColorfulMNIST`, run

```

edflow -b configs/cmnist_clf.yaml -t

```

Once you have a checkpoint, you can estimate factor dimensionalities using

```

edflow -b configs/cmnist_clf.yaml configs/cmnist_dimeval.yaml -c 

```

For the pretrained classifier, this gives

```

[INFO] [dim_callback]: estimated factor dimensionalities: [22, 11, 31]

```

and to compare this to an autoencoder, run

```

edflow -b configs/cmnist_ae.yaml configs/cmnist_dimeval.yaml -c 

```

which gives

```

[INFO] [dim_callback]: estimated factor dimensionalities: [13, 17, 34]

```

### Invertible Interpretation Networks

#### Unsupervised on AE

To train unsupervised invertible interpretation networks, run

```

edflow -b configs/_iin.yaml [configs/project.yaml] -t

```

where `` is one of `mnist`, `fashionmnist`, `cifar`, `celeba`. If,

instead of using one of the [pretrained models](#pretrained-models), you

trained an autoencoder yourself, adjust the `first_stage` config section

accordingly.

#### Supervised

For supervised, disentangling IINs, run

```

edflow -b configs/_diin.yaml [configs/project.yaml] -t

```

where `` is one of `cmnist` or `celeba`, or run

```

edflow -b configs/cmnist_clf_diin.yaml [configs/project.yaml] -t

```

to train a dIIN on top of a classifier, with factor dimensionalities as

estimated above (dimensionalities of factors can be adjusted via the

`Transformer/factor_config` configuration entry).

## Evaluation

Evaluations run automatically after each epoch of training. To start an

evaluation manually, run

```

edflow -p logs//configs/.yaml

```

and, optionally, add `-c ` to evaluate a specific

checkpoint instead of the last one.

## Pretrained Models

Download [`logs.tar.gz`](https://heibox.uni-heidelberg.de/f/0c76b38bf4274448b8e9/)

(~2.2 GB) and extract the pretrained models:

```

tar xzf logs.tar.gz

```

## Results

Using spectral normalization for the discriminator, this code slightly improves

upon the values reported in Tab. 2 of the paper.

| Dataset      | Checkpoint | FID    |

|--------------|------------|--------|

| MNIST        | 105600     | 5.252  |

| FashionMNIST | 110400     | 9.663  |

| CelebA       | 84643      | 19.839 |

| CIFAR10      | 32000      | 38.697 |

Full training logs can be found on [Weights &

Biases](https://app.wandb.ai/trex/iin/reportlist).

## BibTeX

```

@inproceedings{esser2020invertible,

  title={A Disentangling Invertible Interpretation Network for Explaining Latent Representations},

  author={Esser, Patrick and Rombach, Robin and Ommer, Bj{\"o}rn},

  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},

  year={2020}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/CompVis/iin

Awesome Lists containing this project

README