https://github.com/junyanz/von

[NeurIPS 2018] Visual Object Networks: Image Generation with Disentangled 3D Representation.
https://github.com/junyanz/von

3d cyclegan disentangled-representations gans generative-adversarial-network pix2pix pytorch shape-generation

Last synced: 6 months ago
JSON representation

[NeurIPS 2018] Visual Object Networks: Image Generation with Disentangled 3D Representation.

Host: GitHub
URL: https://github.com/junyanz/von
Owner: junyanz
License: other
Created: 2018-10-26T02:11:52.000Z (almost 7 years ago)
Default Branch: master
Last Pushed: 2020-08-05T01:03:13.000Z (about 5 years ago)
Last Synced: 2025-03-29T09:11:42.432Z (6 months ago)
Topics: 3d, cyclegan, disentangled-representations, gans, generative-adversarial-network, pix2pix, pytorch, shape-generation
Language: Python
Homepage: http://von.csail.mit.edu
Size: 1.81 MB
Stars: 532
Watchers: 30
Forks: 90
Open Issues: 12
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# Visual Object Networks

[Project Page](http://von.csail.mit.edu) | [Paper](http://arxiv.org/abs/1812.02725)

We present Visual Object Networks (VON), an end-to-end adversarial learning framework that jointly models 3D shapes and 2D images. Our model can synthesize a 3D shape, its intermediate 2.5D depth representation, and a 2D image all at once. The VON not only generates realistic images but also enables several 3D operations.

Visual Object Networks: Image Generation with Disentangled 3D Representation.

[Jun-Yan Zhu](https://www.cs.cmu.edu/~junyanz/),
[Zhoutong Zhang](https://www.csail.mit.edu/person/zhoutong-zhang), [Chengkai Zhang](https://scholar.google.com/citations?user=rChGGwgAAAAJ&hl=en), [Jiajun Wu](https://jiajunwu.com/), [Antonio Torralba](http://web.mit.edu/torralba/www/), [ Joshua B. Tenenbaum](http://web.mit.edu/cocosci/josh.html), [William T. Freeman](http://billf.mit.edu/).

MIT CSAIL and Google Research.

In NeurIPS 2018.

## Example results
(a) Typical examples produced by a recent GAN model [Gulrajani et al., 2017].

(b) Our model produces three outputs: a 3D shape, its 2.5D projection given a viewpoint, and a final image with realistic texture.

(c) Our model allows several 3D applications including editing viewpoint, shape, or texture independently.

## More samples
Below we show more samples from DCGAN [Radford et al., 2016], LSGAN [Mao et al., 2017], WGAN-GP [Gulrajani et al., 2017], and our VON. For our method, we show both 3D shapes and 2D images. The learned 3D prior helps produce better samples.

## 3D object manipulations
Our VON allows several 3D applications such as (left) changing the viewpoint, texture, or shape independently, and (right) interpolating between two objects in shape space, texture space, or both.

## Transfer texture across objects and viewpoints
VON can transfer the texture of a real image to different shapes and viewpoints

## Prerequisites
- Linux (only tested on Ubuntu 16.04)
- Python3 (only tested with python 3.6)
- Anaconda3
- NVCC & GCC (only tested with gcc 6.3.0)
- PyTorch 0.4.1 (does not support 0.4.0)
- Currently not tested with Nvidia RTX GPU series
- Docker Engine and Nvidia-Docker2 if using Docker container.

## Getting Started ###
### Installation
- Clone this repo:
```bash
git clone -b master --single-branch https://github.com/junyanz/VON.git
cd VON
```
- Install PyTorch 0.4.1+ and torchvision from http://pytorch.org and other dependencies (e.g., [visdom](https://github.com/facebookresearch/visdom) and [dominate](https://github.com/Knio/dominate)). You can install all the dependencies by the following:
```bash
conda create --name von --file pkg_specs.txt
source activate von
```

- Compile our rendering kernel by running the following:
```bash
bash install.sh
```
We only test this step with `gcc 6.3.0`. If you need to recompile the kernel, please run `bash clean.sh` first before you recompile it.

- If you can not compile the custom kernels, we provide a Dockerfile for building a working container. To use the Dockerfile, you need to install [Docker Engine](https://www.docker.com) and [Nvidia-Docker2](https://github.com/NVIDIA/nvidia-docker) for using Nvidia GPUs inside the docker container. To build the docker image, run:
```bash
sudo docker build ./../von -t von
```
To access the container, run:
```bash
sudo docker run -it --runtime=nvidia --ipc=host von /bin/bash
```
Then, to compile the kernels, simply:
```bash
cd /app/von
source activate von
./install.sh
```

- (Optional) Install [blender](https://www.blender.org/) for visualizing generated 3D shapes. After installation, please add blender to the PATH environment variable.

### Generate 3D shapes, 2.5D sketches, and images
- Download our pretrained models:
```bash
bash ./scripts/download_model.sh
```

- Generate results with the model
```
bash ./scripts/figures.sh 0 car df
```

The test results will be saved to an HTML file here: `./results/*/*/index.html`.

### Model Training
- To train a model, download the training dataset (distance functions and images). For example, if we would like to train a car model with distance function representation on GPU 0.
```bash
bash ./scripts/download_dataset.sh
```
- To train a 3D generator:
```bash
bash ./scripts/train_shape.sh 0 car df
```
- To train a 2D texture network using ShapeNet real shapes:
```bash
bash ./scripts/train_texture_real.sh 0 car df 0
```

- To train a 2D texture network using pre-trained 3D generator:
```bash
bash ./scripts/train_texture.sh 0 car df 0
```

- Jointly finetune 3D and 2D generative models:
```bash
bash ./scripts/train_full.sh 0 car df 0
```

- To view training results and loss plots, go to http://localhost:8097 in a web browser. To see more intermediate results, check out `./checkpoints/*/web/index.html`

### Citation

If you find this useful for your research, please cite the following paper.
```
@inproceedings{VON,
title={Visual Object Networks: Image Generation with Disentangled 3{D} Representations},
author={Jun-Yan Zhu and Zhoutong Zhang and Chengkai Zhang and Jiajun Wu and Antonio Torralba and Joshua B. Tenenbaum and William T. Freeman},
booktitle={Advances in Neural Information Processing Systems (NeurIPS)},
year={2018}
}

```

### Acknowledgements
This work is supported by NSF #1231216, NSF #1524817, ONR MURI N00014-16-1-2007, Toyota Research Institute, Shell, and Facebook. We thank Xiuming Zhang, Richard Zhang, David Bau, and Zhuang Liu for valuable discussions. This code borrows from the [CycleGAN & pix2pix](https://github.com/junyanz/pytorch-CycleGAN-and-pix2pix) repo.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/junyanz/von

Awesome Lists containing this project

README