Ecosyste.ms: Awesome

An open API service indexing awesome lists of open source software.

Awesome Lists | Featured Topics | Projects

https://github.com/vt-vl-lab/3d-photo-inpainting

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting
https://github.com/vt-vl-lab/3d-photo-inpainting

3d-photo novel-view-synthesis

Last synced: 27 days ago
JSON representation

[CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

Host: GitHub
URL: https://github.com/vt-vl-lab/3d-photo-inpainting
Owner: vt-vl-lab
License: other
Created: 2020-04-08T15:31:45.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2024-08-30T23:48:56.000Z (2 months ago)
Last Synced: 2024-09-30T13:40:47.685Z (about 1 month ago)
Topics: 3d-photo, novel-view-synthesis
Language: Python
Homepage: https://shihmengli.github.io/3D-Photo-Inpainting/
Size: 127 MB
Stars: 6,914
Watchers: 148
Forks: 1,115
Open Issues: 106
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

        # [CVPR 2020] 3D Photography using Context-aware Layered Depth Inpainting

[![Open 3DPhotoInpainting in Colab](https://colab.research.google.com/assets/colab-badge.svg)](https://colab.research.google.com/drive/1706ToQrkIZshRSJSHvZ1RuCiM__YX3Bz)

### [[Paper](https://arxiv.org/abs/2004.04727)] [[Project Website](https://shihmengli.github.io/3D-Photo-Inpainting/)] [[Google Colab](https://colab.research.google.com/drive/1706ToQrkIZshRSJSHvZ1RuCiM__YX3Bz)]







We propose a method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view. We use a Layered Depth Image with explicit pixel connectivity as underlying representation, and present a learning-based inpainting model that iteratively synthesizes new local color-and-depth content into the occluded region in a spatial context-aware manner. The resulting 3D photos can be efficiently rendered with motion parallax using standard graphics engines. We validate the effectiveness of our method on a wide range of challenging everyday scenes and show fewer artifacts when compared with the state-of-the-arts.




**3D Photography using Context-aware Layered Depth Inpainting**




[Meng-Li Shih](https://shihmengli.github.io/), 

[Shih-Yang Su](https://lemonatsu.github.io/), 

[Johannes Kopf](https://johanneskopf.de/), and

[Jia-Bin Huang](https://filebox.ece.vt.edu/~jbhuang/)




In IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.

## Prerequisites

- Linux (tested on Ubuntu 18.04.4 LTS)

- Anaconda

- Python 3.7 (tested on 3.7.4)

- PyTorch 1.4.0 (tested on 1.4.0 for execution)

and the Python dependencies listed in [requirements.txt](requirements.txt)

- To get started, please run the following commands:

    ```bash

    conda create -n 3DP python=3.7 anaconda

    conda activate 3DP

    pip install -r requirements.txt

    conda install pytorch==1.4.0 torchvision==0.5.0 cudatoolkit==10.1.243 -c pytorch

    ```

- Next, please download the model weight using the following command:

    ```bash

    chmod +x download.sh

    ./download.sh

    ```    

## Quick start

Please follow the instructions in this section. 

This should allow to execute our results.

For more detailed instructions, please refer to [`DOCUMENTATION.md`](DOCUMENTATION.md).

## Execute

1. Put ```.jpg``` files (e.g., test.jpg) into the ```image``` folder. 

    - E.g., `image/moon.jpg`

2. Run the following command

    ```bash

    python main.py --config argument.yml

    ```

    - Note: The 3D photo generation process usually takes about 2-3 minutes depending on the available computing resources.

3. The results are stored in the following directories:

    - Corresponding depth map estimated by [MiDaS](https://github.com/intel-isl/MiDaS.git) 

        - E.g. ```depth/moon.npy```, ```depth/moon.png```

        - User could edit ```depth/moon.png``` manually. 

            - Remember to set the following two flags as listed below if user wants to use manually edited ```depth/moon.png``` as input for 3D Photo.

                - `depth_format: '.png'`

                - `require_midas: False`

    - Inpainted 3D mesh (Optional: User need to switch on the flag `save_ply`)

        - E.g. ```mesh/moon.ply```

    - Rendered videos with zoom-in motion

        - E.g. ```video/moon_zoom-in.mp4```

    - Rendered videos with swing motion

        - E.g. ```video/moon_swing.mp4```

    - Rendered videos with circle motion

        - E.g. ```video/moon_circle.mp4```         

    - Rendered videos with dolly zoom-in effect

        - E.g. ```video/moon_dolly-zoom-in.mp4```

        - Note: We assume that the object of focus is located at the center of the image.

4. (Optional) If you want to change the default configuration. Please read [`DOCUMENTATION.md`](DOCUMENTATION.md) and modified ```argument.yml```.

## License

This work is licensed under MIT License. See [LICENSE](LICENSE) for details. 

If you find our code/models useful, please consider citing our paper:

```

@inproceedings{Shih3DP20,

  author = {Shih, Meng-Li and Su, Shih-Yang and Kopf, Johannes and Huang, Jia-Bin},

  title = {3D Photography using Context-aware Layered Depth Inpainting},

  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},

  year = {2020}

}

```

## Acknowledgments

- We thank Pratul Srinivasan for providing clarification of the method [Srinivasan et al. CVPR 2019](https://people.eecs.berkeley.edu/~pratul/publication/mpi_extrapolation/).

- We thank the author of [Zhou et al. 2018](https://people.eecs.berkeley.edu/~tinghuiz/projects/mpi/), [Choi et al. 2019](https://github.com/NVlabs/extreme-view-synth/), [Mildenhall et al. 2019](https://github.com/Fyusion/LLFF), [Srinivasan et al. 2019](https://github.com/google-research/google-research/tree/ac9b04e1dbdac468fda53e798a326fe9124e49fe/mpi_extrapolation), [Wiles et al. 2020](http://www.robots.ox.ac.uk/~ow/synsin.html), [Niklaus et al. 2019](https://github.com/sniklaus/3d-ken-burns) for providing their implementations online.

- Our code builds upon [EdgeConnect](https://github.com/knazeri/edge-connect), [MiDaS](https://github.com/intel-isl/MiDaS.git) and [pytorch-inpainting-with-partial-conv](https://github.com/naoto0804/pytorch-inpainting-with-partial-conv)