https://github.com/ermongroup/lagvae

Lagrangian VAE
https://github.com/ermongroup/lagvae

generative-adversarial-network tensorflow variational-autoencoder variational-inference

Last synced: about 2 months ago
JSON representation

Lagrangian VAE

Host: GitHub
URL: https://github.com/ermongroup/lagvae
Owner: ermongroup
License: mit
Created: 2018-05-28T18:10:47.000Z (over 7 years ago)
Default Branch: master
Last Pushed: 2018-07-27T18:52:37.000Z (about 7 years ago)
Last Synced: 2025-03-31T16:09:41.473Z (6 months ago)
Topics: generative-adversarial-network, tensorflow, variational-autoencoder, variational-inference
Language: Python
Size: 58.6 KB
Stars: 28
Watchers: 5
Forks: 5
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE.txt

Awesome Lists containing this project

README

# Lagrangian VAE

TensorFlow implementation for the paper [A Lagrangian Perspective of Latent Variable Generative Models](https://arxiv.org/abs/1806.06514), UAI 2018 Oral.

[Shengjia Zhao](http://szhao.me), [Jiaming Song](http://tsong.me) and [Stefano Ermon](http://cs.stanford.edu/~ermon), Stanford Artificial Intelligence Laboratory

## Overview

In this paper, we generalize the objective of latent variable generative models to two targets:
- Primal Problem: "mutual information objectives", such as maximizing / minimizing mutual information between observations and latent variables.
- Constraints: "consistency", which ensures that the model posterior is close to the amortized posterior.

**Lagrangian VAE** provides a practical way to find the best trade-off between "consistency constraints" and "mutual information objectives", as opposed of performing extensive hyperparameter tuning. We demonstrate an example over **InfoVAE**, a latent variable generative model objective that requires tuning the strengths of corresponding hyperparameters.

As demonstrated in the following figure, LagVAE manages to find a near Pareto-optimal curve for the trade-off between mutual informtation and consistency.

![](lagvae.jpg)

## Requirements

- click
- gputil
- tqdm

## Files

- `methods/infovae.py`: InfoVAE implementation (does not optimize Lagrange multiplers)
- `methods/lagvae.py`: LagVAE implementation (optimization of Lagrange multipliers)

## Examples

Please set environment variables `EXP_LOG_PATH` and `DATA_PREFIX` for logging experiments and downloading data prior to running the examples.

- InfoVAE: `python examples/infovae.py --mi=1.0 --e1=1.0 --e2=1.0`
- LagVAE: `python examples/lagvae.py --mi=1.0 --e1=86.0 --e2=5.0`

Note that we scale up MMD by 10000 in the implementation, so `--e2=5.0` for LagVAE means MMD < 0.0005.

Feel free to play around with different `VariationalEncoder`, `VariationalDecoder`, optimizers, and datasets.

## References

If you find the idea or code useful for your research, please consider citing our paper:
```
@article{zhao2018the,
title={The Information Autoencoding Family: A Lagrangian Perspective on Latent Variable Generative Models},
author={Zhao, Shengjia and Song, Jiaming and Ermon, Stefano},
journal={arXiv preprint arXiv:1806.06514},
year={2018}
}
```

## Acknowledgements

`utils/logger.py` is based on an implementation in [OpenAI Baselines](https://github.com/openai/baselines).

## Contact

`tsong [at] cs [dot] stanford [dot] edu`

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/ermongroup/lagvae

Awesome Lists containing this project

README