https://github.com/PeizeSun/OneNet

[ICML2021] What Makes for End-to-End Object Detection
https://github.com/PeizeSun/OneNet

Last synced: 4 months ago
JSON representation

[ICML2021] What Makes for End-to-End Object Detection

Host: GitHub
URL: https://github.com/PeizeSun/OneNet
Owner: PeizeSun
License: mit
Created: 2020-12-04T13:47:25.000Z (over 4 years ago)
Default Branch: main
Last Pushed: 2023-04-30T19:11:20.000Z (about 2 years ago)
Last Synced: 2024-07-31T21:53:44.991Z (11 months ago)
Language: Python
Homepage:
Size: 1.49 MB
Stars: 649
Watchers: 20
Forks: 75
Open Issues: 4
Metadata Files:
- Readme: README.md
- Contributing: .github/CONTRIBUTING.md
- License: LICENSE
- Code of conduct: .github/CODE_OF_CONDUCT.md

Awesome Lists containing this project

awesome-anchor-free-object-detection - OneNet - to-End Object Detection?". (**[ICML 2021](https://proceedings.mlr.press/v139/sun21b.html)**) (Frameworks)

README

        ## OneNet: What Makes for End-to-End Object Detection?

[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)

![](onenet.jpeg)

Comparisons of different label assignment methods. H and W are height and width of feature map, respectively,

K is number of object categories. Previous works on one-stage object detection assign labels by only position cost, such

as (a) box IoU or (b) point distance between sample and ground-truth. In our method, however, (c) classification cost is

additionally introduced. We discover that **classification cost is the key to the success of end-to-end**. Without classification

cost, only location cost leads to redundant boxes of high confidence scores in inference, making NMS post-processing a

necessary component.

## Introduction

arxiv: [OneNet: Towards End-to-End One-Stage Object Detection](https://arxiv.org/abs/2012.05780v1)

paper: [What Makes for End-to-End Object Detection?](https://arxiv.org/abs/2012.05780)

## Updates

- (28/06/2021) OneNet.RetinaNet and OneNet.FCOS on CrowdHuman are available.

- (27/06/2021) OneNet.RetinaNet and OneNet.FCOS are available.

- (11/12/2020) Higher Performance for OneNet is reported by disable gradient clip.

## Comming

  - [x] Provide models and logs

  - [ ] Support to caffe, onnx, tensorRT

  - [ ] Support to MobileNet

  

## Models on COCO

We provide two models 

- dcn is for high accuracy

- nodcn is for easy deployment.

Method | inf_time | train_time | box AP | download

--- |:---:|:---:|:---:|:---:

[R18_dcn](projects/OneNet/configs/onenet.res18.dcn.yaml)     | 109 FPS | 20h  | 29.9 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

[R18_nodcn](projects/OneNet/configs/onenet.res18.nodcn.yaml) | 138 FPS | 13h  | 27.7 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

[R50_dcn](projects/OneNet/configs/onenet.res50.dcn.yaml)     | 67 FPS  | 36h  | 35.7 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

[R50_nodcn](projects/OneNet/configs/onenet.res50.nodcn.yaml) | 73 FPS  | 29h  | 32.7 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) 

[R50_RetinaNet](projects/OneNet/configs/onenet.retinanet.res50.yaml) | 26 FPS  | 31h  | 37.5 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

[R50_FCOS](projects/OneNet/configs/onenet.fcos.res50.yaml) | 27 FPS  | 21h  | 38.9 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

If download link is invalid, models and logs are also available in [Github Release](https://github.com/PeizeSun/OneNet/releases/tag/v0.1) and [Baidu Drive](https://pan.baidu.com/s/1f0lQ63UEBD-qbHTrsD97hA) by code nhr8.

#### Notes

- We observe about 0.3 AP noise.

- The training time and inference time are on 8 NVIDIA V100 GPUs. We observe the same type of GPUs in different clusters may cost different time.

- We use the models pre-trained on imagenet using torchvision. And we provide [torchvision's ResNet-18.pkl](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB?usp=sharing) model. More details can be found in [the conversion script](tools/convert-torchvision-to-d2.py).

## Models on CrowdHuman

Method | inf_time | train_time | AP50 | mMR | recall | download

--- |:---:|:---:|:---:|:---:|:---:|:---:

[R50_RetinaNet](projects/OneNet/configs/onenet.retinanet.res50.crowdhuman.yaml) | 26 FPS  | 11.5h | 90.9 | 48.8 | 98.0 |[model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

[R50_FCOS](projects/OneNet/configs/onenet.fcos.res50.crowdhuman.yaml) | 27 FPS  | 4.5h  | 90.6 | 48.6 | 97.7 | [model](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB) \| [log](https://drive.google.com/drive/folders/1LnHMj7pkJhODeZTNHW-UcUZxybKbQmTB)

If download link is invalid, models and logs are also available in [Github Release](https://github.com/PeizeSun/OneNet/releases/tag/v0.1) and [Baidu Drive](https://pan.baidu.com/s/1f0lQ63UEBD-qbHTrsD97hA) by code nhr8.

#### Notes

- The evalution code is built on top of [cvpods](https://github.com/Megvii-BaseDetection/cvpods).

- The default evaluation code in training should be ignored, since it only considers at most 100 objects in one image, while crowdhuman image contains more than 100 objects.

- The training time and inference time are on 8 NVIDIA V100 GPUs. We observe the same type of GPUs in different clusters may cost different time.

- More training steps are in the [crowdhumantools](https://github.com/PeizeSun/OneNet/tree/main/projects/OneNet/crowdhumantools).

## Installation

The codebases are built on top of [Detectron2](https://github.com/facebookresearch/detectron2) and [DETR](https://github.com/facebookresearch/detr).

#### Requirements

- Linux or macOS with Python ≥ 3.6

- PyTorch ≥ 1.5 and [torchvision](https://github.com/pytorch/vision/) that matches the PyTorch installation.

  You can install them together at [pytorch.org](https://pytorch.org) to make sure of this

- OpenCV is optional and needed by demo and visualization

#### Steps

1. Install and build libs

```

git clone https://github.com/PeizeSun/OneNet.git

cd OneNet

python setup.py build develop

```

2. Link coco dataset path to OneNet/datasets/coco

```

mkdir -p datasets/coco

ln -s /path_to_coco_dataset/annotations datasets/coco/annotations

ln -s /path_to_coco_dataset/train2017 datasets/coco/train2017

ln -s /path_to_coco_dataset/val2017 datasets/coco/val2017

```

3. Train OneNet

```

python projects/OneNet/train_net.py --num-gpus 8 \

    --config-file projects/OneNet/configs/onenet.res50.dcn.yaml

```

4. Evaluate OneNet

```

python projects/OneNet/train_net.py --num-gpus 8 \

    --config-file projects/OneNet/configs/onenet.res50.dcn.yaml \

    --eval-only MODEL.WEIGHTS path/to/model.pth

```

5. Visualize OneNet

```    

python demo/demo.py\

    --config-file projects/OneNet/configs/onenet.res50.dcn.yaml \

    --input path/to/images --output path/to/save_images --confidence-threshold 0.4 \

    --opts MODEL.WEIGHTS path/to/model.pth

```

## License

OneNet is released under MIT License.

## Citing

If you use OneNet in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

```BibTeX

@InProceedings{peize2020onenet,

  title = 	 {What Makes for End-to-End Object Detection?},

  author =       {Sun, Peize and Jiang, Yi and Xie, Enze and Shao, Wenqi and Yuan, Zehuan and Wang, Changhu and Luo, Ping},

  booktitle = 	 {Proceedings of the 38th International Conference on Machine Learning},

  pages = 	 {9934--9944},

  year = 	 {2021},

  volume = 	 {139},

  series = 	 {Proceedings of Machine Learning Research},

  publisher =    {PMLR},

}

```

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/PeizeSun/OneNet

Awesome Lists containing this project

README