https://github.com/THU-DA-6D-Pose-Group/GDR-Net

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)
https://github.com/THU-DA-6D-Pose-Group/GDR-Net

6d-pose-estimation detectron2 mmcv

Last synced: about 1 month ago
JSON representation

GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. (CVPR 2021)

Host: GitHub
URL: https://github.com/THU-DA-6D-Pose-Group/GDR-Net
Owner: THU-DA-6D-Pose-Group
License: apache-2.0
Created: 2021-02-21T14:31:57.000Z (over 5 years ago)
Default Branch: main
Last Pushed: 2023-11-28T05:38:02.000Z (over 2 years ago)
Last Synced: 2023-11-28T06:30:34.743Z (over 2 years ago)
Topics: 6d-pose-estimation, detectron2, mmcv
Language: Python
Homepage: https://github.com/THU-DA-6D-Pose-Group/GDR-Net
Size: 985 KB
Stars: 214
Watchers: 11
Forks: 39
Open Issues: 4
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-robot-vision - GDR-Net - guided direct regression network for monocular 6D object pose estimation. | [GitHub](https://github.com/THU-DA-6D-Pose-Group/GDR-Net) · [arXiv](https://arxiv.org/abs/2102.12145) | (6D Object Pose Estimation / Methods)

README

          # GDR-Net

This repo provides the PyTorch implementation of the work:

**Gu Wang, Fabian Manhardt, Federico Tombari, Xiangyang Ji. GDR-Net: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation. In CVPR 2021.**

[[Paper](https://openaccess.thecvf.com/content/CVPR2021/html/Wang_GDR-Net_Geometry-Guided_Direct_Regression_Network_for_Monocular_6D_Object_Pose_CVPR_2021_paper.html)][[ArXiv](http://arxiv.org/abs/2102.12145)][[Video](https://www.bilibili.com/video/BV1dU4y1G7Ku?share_source=copy_web)][[bibtex](#Citation)]

## News

* [2023/10] An enhanced version of this work, [GPose2023](https://zhangcyg.github.io/figs/GPose_ICCV23.pdf) by Zhang et al., won BOP Challenge @ ICCV 2023. Congratulations!

* [2022/10] An enhanced version of this work, [GDRNPP](https://github.com/shanice-l/gdrnpp_bop2022.git) by Liu et al., won most of the [awards](http://cmp.felk.cvut.cz/sixd/workshop_2022/slides/bop_challenge_2022_results.pdf) on [BOP Challenge @ ECCV 2022](https://bop.felk.cvut.cz/challenges/bop-challenge-2022/). Congratulations!

* [2021/08] An extension of this work, [SO-Pose](https://arxiv.org/abs/2108.08367) by Di et al. (ICCV 2021), has been released ([SO-Pose code](https://github.com/shangbuhuan13/SO-Pose), [mirror](https://github.com/THU-DA-6D-Pose-Group/SO-Pose)).

## Overview







## Requirements

* Ubuntu 16.04/18.04, CUDA 10.1/10.2, python >= 3.6, PyTorch >= 1.6, torchvision

* Install `detectron2` from [source](https://github.com/facebookresearch/detectron2)

* `sh scripts/install_deps.sh`

* Compile the cpp extension for `farthest points sampling (fps)`:

    ```

    sh core/csrc/compile.sh

    ```

## Datasets

Download the 6D pose datasets (LM, LM-O, YCB-V) from the

[BOP website](https://bop.felk.cvut.cz/datasets/) and

[VOC 2012](https://pjreddie.com/projects/pascal-voc-dataset-mirror/)

for background images.

Please also download the `image_sets` and `test_bboxes` from

here ([BaiduNetDisk](https://pan.baidu.com/s/1gGoZGkuMYxhU9LBKxuSz0g), password: qjfk | [Cloud.THU](https://cloud.tsinghua.edu.cn/d/b2311297acf54f26b429/), password: fMNOASFHW0E8R72357T6mn9).

The structure of `datasets` folder should look like below:

```

# recommend using soft links (ln -sf)

datasets/

├── BOP_DATASETS

    ├──lm

    ├──lmo

    ├──ycbv

├── lm_imgn  # the OpenGL rendered images for LM, 1k/obj

├── lm_renders_blender  # the Blender rendered images for LM, 10k/obj (pvnet-rendering)

├── VOCdevkit

```

* `lm_imgn` comes from [DeepIM](https://github.com/liyi14/mx-DeepIM), which can be downloaded here ([BaiduNetDisk](https://pan.baidu.com/s/1e9SJoqb0EmyqVLEVlbNQIA), password: vr0i | [Cloud.THU](https://cloud.tsinghua.edu.cn/f/22c4fba9c06f47d3aba4/), password: A097fN8a07ufn0u70).

* `lm_renders_blender` comes from [pvnet-rendering](https://github.com/zju3dv/pvnet-rendering), note that we do not need the fused data.

## Training GDR-Net

`./core/gdrn_modeling/train_gdrn.sh   (other args)`

Example:

```

./core/gdrn_modeling/train_gdrn.sh configs/gdrn/lm/a6_cPnP_lm13.py 0  # multiple gpus: 0,1,2,3

# add --resume if you want to resume from an interrupted experiment.

```

Our trained GDR-Net models can be found here ([BaiduNetDisk](https://pan.baidu.com/s/1_MEZJBd67hdxcE8JzmnOtA), password: kedv | [Cloud.THU](https://cloud.tsinghua.edu.cn/d/b17100d09a9f4a208549/), password: 0M,oadfu9uIU). 


_{^{(Note that the models for BOP setup in the supplement were trained using a refactored version of this repo (not compatible), they are slightly better than the models provided here.)}}

## Evaluation

`./core/gdrn_modeling/test_gdrn.sh    (other args)`

Example:

```

./core/gdrn_modeling/test_gdrn.sh configs/gdrn/lmo/a6_cPnP_AugAAETrunc_BG0.5_lmo_real_pbr0.1_40e.py 0 output/gdrn/lmo/a6_cPnP_AugAAETrunc_BG0.5_lmo_real_pbr0.1_40e/gdrn_lmo_real_pbr.pth

```

## Citation

If you find this useful in your research, please consider citing:

```

@InProceedings{Wang_2021_GDRN,

    title     = {{GDR-Net}: Geometry-Guided Direct Regression Network for Monocular 6D Object Pose Estimation},

    author    = {Wang, Gu and Manhardt, Fabian and Tombari, Federico and Ji, Xiangyang},

    booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},

    month     = {June},

    year      = {2021},

    pages     = {16611-16621}

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/THU-DA-6D-Pose-Group/GDR-Net

Awesome Lists containing this project

README