https://github.com/vLAR-group/DM-NeRF

🔥DM-NeRF in PyTorch (ICLR 2023)
https://github.com/vLAR-group/DM-NeRF

3d 3d-computer-vision 3d-editor 3d-reconstruction 3d-segmentation deep-learning iclr2023 nerfs

Last synced: 7 months ago
JSON representation

🔥DM-NeRF in PyTorch (ICLR 2023)

Host: GitHub
URL: https://github.com/vLAR-group/DM-NeRF
Owner: vLAR-group
License: other
Created: 2022-08-12T12:28:44.000Z (about 3 years ago)
Default Branch: main
Last Pushed: 2022-10-11T05:41:09.000Z (about 3 years ago)
Last Synced: 2024-10-28T06:00:14.462Z (12 months ago)
Topics: 3d, 3d-computer-vision, 3d-editor, 3d-reconstruction, 3d-segmentation, deep-learning, iclr2023, nerfs
Language: Python
Homepage:
Size: 521 MB
Stars: 249
Watchers: 6
Forks: 17
Open Issues: 7
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

          [![arXiv](https://img.shields.io/badge/arXiv-2208.07227-b31b1b.svg)](https://arxiv.org/abs/2208.07227)

![visitors](https://visitor-badge.glitch.me/badge?page_id=vLAR-group/DM-NeRF)

[![License CC BY-NC-SA 4.0](https://img.shields.io/badge/license-CC4.0-blue.svg)](https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode)

[![Twitter Follow](https://img.shields.io/twitter/follow/vLAR_Group?style=social)](https://twitter.com/vLAR_Group)

## DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images

[Bing Wang](https://www.cs.ox.ac.uk/people/bing.wang/), [Lu Chen](https://chenlu-china.github.io/), [Bo Yang^*](https://yang7879.github.io/) 


[**Paper**](https://arxiv.org/abs/2208.07227) | [**Video**](https://www.youtube.com/watch?v=iE0RwmdLIzk) | [**DM-SR**](https://www.dropbox.com/s/1k75m38vahizbp9/dmsr.zip?dl=0)

  


The architecture of our proposed DM-NeRF. Given a 3D point $\boldsymbol{p}$, we learn an object code through a series of loss functions using both 2D and 3D supervision signals.

## 1. Decomposition and Reconstruction:

  


## 2. Decomposition and Rendering:

  


## 3. Manipulation:

  


## 4. Installation

DM-NeRF uses a Conda environment that makes it easy to install all dependencies.

1. Create the `DM-NeRF` Conda environment (Python 3.7) with [miniconda](https://docs.conda.io/en/latest/miniconda.html).

```bash

conda create --name DM-NeRF python=3.7

conda activate DM-NeRF

```

2. Install all dependencies by running:

```bash

pip install -r requirements.txt

```

### 4.1 Datasets

In this paper, we consider the following three different datasets:

#### (1) [DM-SR](https://www.dropbox.com/s/1k75m38vahizbp9/dmsr.zip?dl=0)

To the best of our knowledge, there is no existing 3D scene dataset suitable for quantitative evaluation of geometry manipulation. Therefore, we create a synthetic dataset with 8 types of different and complex indoor rooms, called DM-SR. The room types and designs follow [Hypersim Dataset](https://github.com/apple/ml-hypersim). Overall, we firstly render the static scenes, and then manipulate each scene followed by second round rendering. Each scene has a physical size of about 12x12x3 meters with around 8 objects.  We will keep updating [DM-SR](https://www.dropbox.com/s/1k75m38vahizbp9/dmsr.zip?dl=0) for future research in the community. 

#### (2) [Replica](https://www.dropbox.com/s/t1bref0zrmbq1gl/replica.zip?dl=0)

In this paper, we use 7 scenes `office0, office2, office3, office4, room0, room1, room2` from the [Replica Dataset](https://github.com/facebookresearch/Replica-Dataset). We request the authors of [Semantic-NeRF](https://github.com/Harry-Zhi/semantic_nerf) to generate color images and 2D object masks with camera poses at 640x480 pixels for each of 7 scenes. Each scene has 59~93 objects with very diverse sizes. Details of camera settings and trajectories can be found [here](https://www.dropbox.com/s/t1bref0zrmbq1gl/replica.zip?dl=0).

#### (3) [ScanNet](http://www.scan-net.org/)

In this paper, we use 8 scenes `scene0010_00, scene0012_00, scene0024_00, scene0033_00, scene0038_00, scene0088_00, scene0113_00, scene0192_00` from the ScanNet Dataset.

### 4.2 Training

For the training of our standard DM-NeRF , you can simply run the following command with a chosen config file specifying data directory and hyper-params.

```bash

CUDA_VISIBLE_DEVICES=0 python -u train_dmsr.py --config configs/dmsr/train/study.txt

```

Other working modes and set-ups can be also made via the above command by choosing different config files.

### 4.3 Evaluation

In this paper, we use PSNR, SSIM, LPIPS for rendering evaluation, and mAPs for both decomposition and manipulation evluations.

#### (1) Decomposition

##### Quantitative Evaluation

For decomposition evaluation, you need choose a specific config file and then run:

```bash 

CUDA_VISIBLE_DEVICES=0 python -u test_dmsr.py --config configs/dmsr/test/study.txt

```

##### Mesh Generation

For mesh generation, you can change the config file and then run:

```bash

CUDA_VISIBLE_DEVICES=0 python -u test_dmsr.py --config configs/dmsr/test/meshing.txt

```

#### (2) Manipulation

##### Quantitative Evaluation

We provide the DM-SR dataset for the quantitative evaluation of geometry manipulation.

Set the target object and desired manipulated settings in a sepcific config file. And then run:

```bash

CUDA_VISIBLE_DEVICES=0 python -u test_dmsr.py --config configs/dmsr/mani/study.txt --mani_mode translation

```

##### Qualitative Evaluation

For other qualitative evaluations, you can change the config file and then run:

```bash

CUDA_VISIBLE_DEVICES=0 python -u test_dmsr.py --config configs/dmsr/mani/demo_deform.txt

```

## 5. [Video (Youtube)](https://www.youtube.com/watch?v=yQtpPfM5dTA)

  


### Citation

If you find our work useful in your research, please consider citing:

      @article{wang2022dmnerf,

      title={DM-NeRF: 3D Scene Geometry Decomposition and Manipulation from 2D Images},

      author={Bing, Wang and Chen, Lu and Yang, Bo},

      journal={arXiv preprint arXiv:2208.07227},

      year={2022}

    }

### License

Licensed under the CC BY-NC-SA 4.0 license, see [LICENSE](./LICENSE).

### Updates

* 31/8/2022: Data release！

* 25/8/2022: Code release！

* 15/8/2022: Initial release！

## Related Repos

1. [RangeUDF: Semantic Surface Reconstruction from 3D Point Clouds](https://github.com/vLAR-group/RangeUDF) ![GitHub stars](https://img.shields.io/github/stars/vLAR-group/RangeUDF.svg?style=flat&label=Star)

2. [GRF: Learning a General Radiance Field for 3D Representation and Rendering](https://github.com/alextrevithick/GRF) ![GitHub stars](https://img.shields.io/github/stars/alextrevithick/GRF.svg?style=flat&label=Star)

3. [3D-BoNet: Learning Object Bounding Boxes for 3D Instance Segmentation on Point Clouds](https://github.com/Yang7879/3D-BoNet) ![GitHub stars](https://img.shields.io/github/stars/Yang7879/3D-BoNet.svg?style=flat&label=Star)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/vLAR-group/DM-NeRF

Awesome Lists containing this project

README