https://github.com/jianzhnie/rfbnet_pytorch

RFBNet in Pytorch
https://github.com/jianzhnie/rfbnet_pytorch
mobilenet object-detection rfbnet ssd
Last synced: about 1 month ago
JSON representation
RFBNet in Pytorch
Host: GitHub
URL: https://github.com/jianzhnie/rfbnet_pytorch
Owner: jianzhnie
Created: 2019-11-28T14:15:39.000Z (almost 6 years ago)
Default Branch: master
Last Pushed: 2019-11-29T06:17:51.000Z (almost 6 years ago)
Last Synced: 2024-12-29T13:44:38.503Z (9 months ago)
Topics: mobilenet, object-detection, rfbnet, ssd
Language: Python
Size: 280 KB
Stars: 14
Watchers: 5
Forks: 8
Open Issues: 1
Metadata Files:
- Readme: README.md
Awesome Lists containing this project

README

          # Receptive Field Block Net for Accurate and Fast Object Detection

By Jianzhnie

## What's New ?

- 对原来的[代码](https://github.com/ruinmessi/RFBNet)进行了修改和优化

- 在看RFBNet的源代码时， 发现代码中的网络结构和论文中的结构不一致， 这一点在 [issuse: The difference of RFBNet in RFB_Net_vgg.py and Fig.5 ?](https://github.com/ruinmessi/RFBNet/issues/27) 中也有提到

- 修改了网络结构和论文一致

## 目录结构

```

RFBNet/

├── data

│   ├── coco.py

│   ├── config.py

│   ├── data_augment.py

│   ├── voc0712.py

│   └── voc_eval.py

├── layers

│   ├── functions

│   └── modules

├── models

│   ├── RFB_Net_E_vgg.py

│   ├── RFB_Net_mobile.py

│   └── RFB_Net_vgg.py

├── utils

│   ├── box_utils.py

│   └── timer.py

└── weights

├── test_RFB.py

├── train_RFB.py

├── main.py

```

### Introduction

Inspired by the structure of Receptive Fields (RFs) in human visual systems, we propose a novel RF Block (RFB) module, which takes the relationship between the size and eccentricity of RFs into account, to enhance the discriminability and robustness of features. We further  assemble the RFB module to the top of SSD with a lightweight CNN model, constructing the RFB Net detector. You can use the code to train/evaluate the RFB Net for object detection. For more details, please refer to our [ECCV paper](https://eccv2018.org/openaccess/content_ECCV_2018/papers/Songtao_Liu_Receptive_Field_Block_ECCV_2018_paper.pdf). 



 

 

### VOC2007 Test

| System |  *mAP* | **FPS** (Titan X Maxwell) |

|:-------|:-----:|:-------:|

| [Faster R-CNN (VGG16)](https://github.com/ShaoqingRen/faster_rcnn) | 73.2 | 7 | 

| [YOLOv2 (Darknet-19)](http://pjreddie.com/darknet/yolo/) | 78.6 | 40 | 

| [R-FCN (ResNet-101)](https://github.com/daijifeng001/R-FCN)| 80.5| 9 |

| [SSD300* (VGG16)](https://github.com/weiliu89/caffe/tree/ssd) | 77.2 | 46 |

| [SSD512* (VGG16)](https://github.com/weiliu89/caffe/tree/ssd) | 79.8 | 19 |

| RFBNet300 (VGG16) | **80.7** |**83** | 

| RFBNet512 (VGG16) | **82.2** | **38** | 

### COCO 

| System |  *test-dev mAP* | **Time** (Titan X Maxwell) |

|:-------|:-----:|:-------:|

| [Faster R-CNN++ (ResNet-101)](https://github.com/KaimingHe/deep-residual-networks) | 34.9 | 3.36s | 

| [YOLOv2 (Darknet-19)](http://pjreddie.com/darknet/yolo/) | 21.6 | 25ms| 

| [SSD300* (VGG16)](https://github.com/weiliu89/caffe/tree/ssd) | 25.1 | 22ms |

| [SSD512* (VGG16)](https://github.com/weiliu89/caffe/tree/ssd) | 28.8 | 53ms |

| [RetinaNet500 (ResNet-101-FPN)](https://arxiv.org/pdf/1708.02002.pdf) | 34.4| 90ms|

| RFBNet300 (VGG16) | **30.3** |**15ms** | 

| RFBNet512 (VGG16) | **33.8** | **30ms** |

| RFBNet512-E (VGG16) | **34.4** | **33ms** |  

### MobileNet

|System |COCO *minival mAP*| **\#parameters**|

|:-------|:-----:|:-------:|

|[SSD MobileNet](https://arxiv.org/abs/1704.04861)| 19.3| 6.8M|

|RFB MobileNet| 20.7 | 7.4M|

### Contents

1. [Installation](#installation)

2. [Datasets](#datasets)

3. [Training](#training)

4. [Evaluation](#evaluation)

5. [Models](#models)

## Installation

- Install [PyTorch-1.3.0](http://pytorch.org/) by selecting your environment on the website and running the appropriate command.

- Then download the dataset by following the [instructions](#download-voc2007-trainval--test) below and install opencv. 

```Shell

conda install opencv

```

Note: For training, we currently  support [VOC](http://host.robots.ox.ac.uk/pascal/VOC/) and [COCO](http://mscoco.org/). 

## Datasets

To make things easy, we provide simple VOC and COCO dataset loader that inherits `torch.utils.data.Dataset` making it fully compatible with the `torchvision.datasets` [API](http://pytorch.org/docs/torchvision/datasets.html).

### VOC Dataset

##### Download VOC2007 trainval & test

```Shell

# specify a directory for dataset to be downloaded into, else default is ~/data/

sh data/scripts/VOC2007.sh # 

```

##### Download VOC2012 trainval

```Shell

# specify a directory for dataset to be downloaded into, else default is ~/data/

sh data/scripts/VOC2012.sh # 

```

### COCO Dataset

Install the MS COCO dataset at /path/to/coco from [official website](http://mscoco.org/), default is ~/data/COCO. Following the [instructions](https://github.com/rbgirshick/py-faster-rcnn/blob/77b773655505599b94fd8f3f9928dbf1a9a776c7/data/README.md) to prepare *minival2014* and *valminusminival2014* annotations. All label files (.json) should be under the COCO/annotations/ folder. It should have this basic structure

```Shell

$COCO/

$COCO/cache/

$COCO/annotations/

$COCO/images/

$COCO/images/test2015/

$COCO/images/train2014/

$COCO/images/val2014/

```

*UPDATE*: The current COCO dataset has released new *train2017* and *val2017* sets which are just new splits of the same image sets. 

## Training

- First download the fc-reduced [VGG-16](https://arxiv.org/abs/1409.1556) PyTorch base network weights at:    https://s3.amazonaws.com/amdegroot-models/vgg16_reducedfc.pth

or from our [BaiduYun Driver](https://pan.baidu.com/s/1jIP86jW) 

- MobileNet pre-trained basenet is ported from [MobileNet-Caffe](https://github.com/shicai/MobileNet-Caffe), which achieves slightly better accuracy rates than the original one reported in the [paper](https://arxiv.org/abs/1704.04861), weight file is available at: https://drive.google.com/open?id=13aZSApybBDjzfGIdqN1INBlPsddxCK14 or [BaiduYun Driver](https://pan.baidu.com/s/1dFKZhdv).

- By default, we assume you have downloaded the file in the `RFBNet/weights` dir:

```Shell

mkdir weights

cd weights

wget https://s3.amazonaws.com/amdegroot-models/vgg16_reducedfc.pth

```

- To train RFBNet using the train script simply specify the parameters listed in `train_RFB.py` as a flag or manually change them.

```Shell

python train_RFB.py -d VOC -v RFB_vgg -s 300 

```

- Note:

  * -d: choose datasets, VOC or COCO.

  * -v: choose backbone version, RFB_VGG, RFB_E_VGG or RFB_mobile.

  * -s: image size, 300 or 512.

  * You can pick-up training from a checkpoint by specifying the path as one of the training parameters (again, see `train_RFB.py` for options)

  * If you want to reproduce the results in the paper, the VOC model should be trained about 240 epoches while the COCO version need 130 epoches.

  

## Evaluation

To evaluate a trained network:

```Shell

python test_RFB.py -d VOC -v RFB_vgg -s 300 --trained_model /path/to/model/weights

```

By default, it will directly output the mAP results on VOC2007 *test* or COCO *minival2014*. For VOC2012 *test* and COCO *test-dev* results, you can manually change the datasets in the `test_RFB.py` file, then save the detection results and submitted to the server. 

## Models

* 07+12 [RFB_Net300](https://drive.google.com/open?id=1apPyT3IkNwKhwuYyp432IJrTd0QHGbIN), [BaiduYun Driver](https://pan.baidu.com/s/1xOp3_FDk49YlJ-6C-xQfHw)

* COCO [RFB_Net300](https://pan.baidu.com/s/1vL_oNwhj0ksK593nApqDLw)

* COCO [RFB_Net512_E](https://drive.google.com/open?id=1pHDc6Xg9im3affOr7xaimXaRNOHtbaPM), [BaiduYun Driver](https://pan.baidu.com/s/1o8dxrom)

* COCO [RFB_Mobile Net300](https://drive.google.com/open?id=1vmbTWWgeMN_qKVWOeDfl1EN9c7yHPmOe), [BaiduYun Driver](https://pan.baidu.com/s/1bp4ik1L)
ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/jianzhnie/rfbnet_pytorch

Awesome Lists containing this project

README