Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/jessemelpolio/Faster_RCNN_for_DOTA
Code used for training Faster R-CNN on DOTA
https://github.com/jessemelpolio/Faster_RCNN_for_DOTA
aerial detection dota faster-rcnn
Last synced: 2 months ago
JSON representation
Code used for training Faster R-CNN on DOTA
- Host: GitHub
- URL: https://github.com/jessemelpolio/Faster_RCNN_for_DOTA
- Owner: jessemelpolio
- License: apache-2.0
- Created: 2018-04-30T01:16:23.000Z (over 6 years ago)
- Default Branch: master
- Last Pushed: 2018-07-07T07:38:24.000Z (over 6 years ago)
- Last Synced: 2024-08-01T22:40:24.944Z (5 months ago)
- Topics: aerial, detection, dota, faster-rcnn
- Language: Python
- Homepage: https://arxiv.org/abs/1711.10398
- Size: 262 KB
- Stars: 334
- Watchers: 13
- Forks: 102
- Open Issues: 22
-
Metadata Files:
- Readme: README.md
- License: LICENSE
Awesome Lists containing this project
- Awesome-MXNet - Faster_RCNN_for_DOTA
README
## Disclaimer
This is the official repo of paper [_DOTA: A Large-scale Dataset for Object Detection in Aerial Images_](https://arxiv.org/abs/1711.10398). This repo contains code for training Faster R-CNN on oriented bounding boxes and horizontal bounding boxes as reported in our paper.This code is mostly modified by [Zhen Zhu](https://github.com/jessemelpolio) and [Jian Ding](https://github.com/dingjiansw101).
If you use these code in your project, please contain this repo in your paper or license. Please also cite our paper:
DOTA: A Large-scale Dataset for Object Detection in Aerial Images
Gui-Song Xia\*, Xiang Bai\*, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang
In CVPR 2018. (* equal contributions)The code is built upon a fork of [Deformble Convolutional Networks](https://github.com/msracver/Deformable-ConvNets).
We use the Faster-RCNN part of it and make some modifications based on Faster-RCNN to regress a quadrangle. More details can be seen in our [paper](https://arxiv.org/abs/1711.10398).
Note that in order to achieve better results, we adjusted the order of the quadrilateral points in the annotations during training.
. You can refer to in DOTA\_devkit [get_best_begin_point](https://github.com/CAPTAIN-WHU/DOTA_devkit/blob/master/dota_utils.py).## Requirements: Software
1. MXNet from [the offical repository](https://github.com/dmlc/mxnet). We tested our code on [MXNet@(commit 62ecb60)](https://github.com/dmlc/mxnet/tree/62ecb60). Due to the rapid development of MXNet, it is recommended to checkout this version if you encounter any issues.
2. Python 2.7. We recommend using Anaconda2 to manage the environments and packages.
3. Some python packages: cython, opencv-python >= 3.2.0, easydict. If `pip` is set up on your system, those packages should be able to be fetched and installed by running:
```
pip install Cython
pip install opencv-python==3.2.0.6
pip install easydict==1.6
```
4. For Windows users, Visual Studio 2015 is needed to compile cython module.## Requirements: Hardware
Any NVIDIA GPUs with at least 4GB memory should be sufficient.
## Installation
1. Clone the repository
~~~
git clone https://github.com/jessemelpolio/Faster_RCNN_for_DOTA.git
~~~
2. For Windows users, run ``cmd .\init.bat``. For Linux user, run `sh ./init.sh`. The scripts will build cython module automatically and create some folders.## Demo & Deformable Model
We provide trained convnet models, including Faster R-CNN models trained on DOTA.
1. To use the demo with our pre-trained faster-rcnn models for DOTA, please download manually from [Google Drive](https://drive.google.com/open?id=1b6P-UMaBBpMPlcgvc38dMToPAa_Gyu6F), or [BaiduYun](https://pan.baidu.com/s/1YuB5ib7O-Ori1ZpiGf8Egw) and put it under the following folder.
Make sure it look like this:
```
./output/rcnn/DOTA_quadrangle/DOTA_quadrangle/train/rcnn_DOTA_quadrangle-0059.params
./output/rcnn/DOTA/DOTA/train/rcnn_DOTA_aligned-0032.params
```(Note) We also released the .state files recently. You can download them from [Google Drive](https://drive.google.com/open?id=1b6P-UMaBBpMPlcgvc38dMToPAa_Gyu6F), or [BaiduYun](https://pan.baidu.com/s/1YuB5ib7O-Ori1ZpiGf8Egw) and keep on fine-tuning our well-trained models on DOTA.
## Preparation for Training & Testing
1. Please download [DOTA](https://captain-whu.github.io/DOTA/dataset.html) dataset, use the [DOTA_devkit](https://github.com/CAPTAIN-WHU/DOTA_devkit) to split the data into patches. And make sure the split images look like this:
```
./path-to-dota-split/images
./path-to-dota-split/labelTxt
./path-to-dota-split/test.txt
./path-to-dota-split/train.txt
```
The test.txt and train.txt are name of the subimages(without suffix) for train and test respectively.2. Please download ImageNet-pretrained ResNet-v1-101 model manually from [OneDrive](https://1drv.ms/u/s!Am-5JzdW2XHzhqMEtxf1Ciym8uZ8sg), or [BaiduYun](https://pan.baidu.com/s/1YuB5ib7O-Ori1ZpiGf8Egw#list/path=%2F), or [Google drive](https://drive.google.com/open?id=1b6P-UMaBBpMPlcgvc38dMToPAa_Gyu6F), and put it under folder `./model`. Make sure it look like this:
```
./model/pretrained_model/resnet_v1_101-0000.params
```## Usage
1. All of our experiment settings (GPU #, dataset, etc.) are kept in yaml config files at folder `./experiments/faster_rcnn/cfgs`.
2. Set the "dataset_path" and "root_path" in DOTA.yaml and DOTA_quadrangle.yaml. The "dataset_path" should be the father folder of "images" and "labelTxt". The "root_path" is the path you want to save the cache data.
3. Set the scales and aspect ratios as your wish in DOTA.yaml and DOTA_quadrangle.yaml.
3. To conduct experiments, run the python scripts with the corresponding config file as input. For example, train and test on quadrangle in an end-to-end manner, run
```
python experiments/faster_rcnn/rcnn_dota_quadrangle_e2e.py --cfg experiments/faster_rcnn/cfgs/DOTA_quadrangle.yaml
```
4. Please find more details in config files and in our code.## Misc.
Code has been tested under:
- Ubuntu 14.04 with 4 Pascal Titan X GPUs and 32 Intel(R) Xeon(R) CPU E5-2620 v4 @ 2.10GHz
## Cite
If you use our project, please cite:
```
@InProceedings{Xia_2018_CVPR,
author = {Xia, Gui-Song and Bai, Xiang and Ding, Jian and Zhu, Zhen and Belongie, Serge and Luo, Jiebo and Datcu, Mihai and Pelillo, Marcello and Zhang, Liangpei},
title = {DOTA: A Large-Scale Dataset for Object Detection in Aerial Images},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2018}
}
```