https://github.com/YeRen123455/Infrared-Small-Target-Detection

Last synced: about 1 month ago
JSON representation

Host: GitHub
URL: https://github.com/YeRen123455/Infrared-Small-Target-Detection
Owner: YeRen123455
License: mit
Created: 2021-05-18T09:12:37.000Z (over 4 years ago)
Default Branch: master
Last Pushed: 2023-05-13T04:52:42.000Z (over 2 years ago)
Last Synced: 2024-05-14T00:54:19.587Z (over 1 year ago)
Language: Python
Size: 34.1 MB
Stars: 285
Watchers: 7
Forks: 42
Open Issues: 19
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

awesome-object-detection-datasets - NUDT-SIRST - Small-Target-Detection?style=social"/> : "Dense Nested Attention Network for Infrared Small Target Detection". (**[arXiv 2021](https://arxiv.org/abs/2106.00487)**) (Infrared Image Datasets)
awesome-yolo-object-detection - NUDT-SIRST - Small-Target-Detection?style=social"/> : "Dense Nested Attention Network for Infrared Small Target Detection". (**[arXiv 2021](https://arxiv.org/abs/2106.00487)**) (Anti-UAV Datasets)

README

          # Dense Nested Attention Network for Infrared Small Target Detection

## Good News! Our paper has been accepted by `IEEE Transaction on Image Processing`. Our team will release more interesting works and applications on SIRST soon. Please keep following our repository.

![outline](overall_structure.png)

## Algorithm Introduction

Dense Nested Attention Network for Infrared Small Target Detection, Boyang Li, Chao Xiao, Longguang Wang, and Yingqian Wang, arxiv 2021 [[Paper]](https://arxiv.org/pdf/2106.00487.pdf)

We propose a dense nested attention network (DNANet) to achieve accurate single-frame infrared small target detection and develop an open-sourced infrared small target dataset (namely, NUDT-SIRST) in this paper. Experiments on both public (e.g., NUAA-SIRST, NUST-SIRST) and our self-developed datasets demonstrate the effectiveness of our method. The contribution of this paper are as follows:

1. We propose a dense nested attention network (namely, DNANet) to maintain small targets in deep layers.

2. An open-sourced dataset (i.e., NUDT-SIRST) with rich targets.

3. Performing well on all existing SIRST datasets.

## Dataset Introduction

NUDT-SIRST is a synthesized dataset, which contains 1327 images with resolution of 256x256. The advantage of synthesized dataset compared to real dataset lies in three aspets:

1. Accurate annotations.

2. Massive generation with low cost (i.e., time and money).

3. Numerous categories of target, rich target sizes, diverse clutter backgrounds.

## Citation

If you find the code useful, please consider citing our paper using the following BibTeX entry.

```

@article{DNANet,

  title={Dense nested attention network for infrared small target detection},

  author={Li, Boyang and Xiao, Chao and Wang, Longguang and Wang, Yingqian and Lin, Zaiping and Li, Miao and An, Wei and Guo, Yulan},

  journal={IEEE Transactions on Image Processing},

  year={2023},

  volume={32},

  pages={1745-1758},

  publisher={IEEE}

}

```

## Prerequisite

* Tested on Ubuntu 16.04, with Python 3.7, PyTorch 1.7, Torchvision 0.8.1, CUDA 11.1, and 1x NVIDIA 3090 and also 

* Tested on Windows 10  , with Python 3.6, PyTorch 1.1, Torchvision 0.3.0, CUDA 10.0, and 1x NVIDIA 1080Ti.

* [The NUDT-SIRST download dir](https://pan.baidu.com/s/1WdA_yOHDnIiyj4C9SbW_Kg?pwd=nudt) (Extraction Code: nudt)

* [The NUAA-SIRST download dir](https://github.com/YimianDai/sirst) [[ACM]](https://arxiv.org/pdf/2009.14530.pdf)

* [The NUST-SIRST download dir](https://github.com/wanghuanphd/MDvsFA_cGAN) [[MDvsFA]](https://openaccess.thecvf.com/content_ICCV_2019/papers/Wang_Miss_Detection_vs._False_Alarm_Adversarial_Learning_for_Small_Object_ICCV_2019_paper.pdf)

## Usage

#### On windows:

```

Click on train.py and run it. 

```

#### On Ubuntu:

#### 1. Train.

```bash

python train.py --base_size 256 --crop_size 256 --epochs 1500 --dataset [dataset-name] --split_method 50_50 --model [model name] --backbone resnet_18  --deep_supervision True --train_batch_size 16 --test_batch_size 16 --mode TXT

```

#### 2. Test.

```bash

python test.py --base_size 256 --crop_size 256 --st_model [trained model path] --model_dir [model_dir] --dataset [dataset-name] --split_method 50_50 --model [model name] --backbone resnet_18  --deep_supervision True --test_batch_size 1 --mode TXT 

```

#### (Optional 1) Visulize your predicts.

```bash

python visulization.py --base_size 256 --crop_size 256 --st_model [trained model path] --model_dir [model_dir] --dataset [dataset-name] --split_method 50_50 --model [model name] --backbone resnet_18  --deep_supervision True --test_batch_size 1 --mode TXT 

```

#### (Optional 2) Test and visulization.

```bash

python test_and_visulization.py --base_size 256 --crop_size 256 --st_model [trained model path] --model_dir [model_dir] --dataset [dataset-name] --split_method 50_50 --model [model name] --backbone resnet_18  --deep_supervision True --test_batch_size 1 --mode TXT 

```

#### (Optional 3) Demo (with your own IR image).

```bash

python demo.py --base_size 256 --crop_size 256 --img_demo_dir [img_demo_dir] --img_demo_index [image_name]  --model [model name] --backbone resnet_18  --deep_supervision True --test_batch_size 1 --mode TXT  --suffix [img_suffix]

```

## Results and Trained Models

#### Qualitative Results

![outline](Qualitative_result.png)

#### Quantative Results 

on NUDT-SIRST

| Model         | mIoU (x10(-2)) | Pd (x10(-2))|  Fa (x10(-6)) ||

| ------------- |:-------------:|:-----:|:-----:|:-----:|

| DNANet-VGG-10 | 85.23 | 96.95 | 6.782|

| DNANet-ResNet-10| 86.36 | 97.39 | 6.897 |

| DNANet-ResNet-18| 87.09 | 98.73 | 4.223 |

| DNANet-ResNet-18| 88.61 | 98.42 | 4.30 | [[Weights]](https://drive.google.com/file/d/1NDvjOiWecfWNPaO12KeIgiJMTKSFS6wj/view?usp=sharing) |

| DNANet-ResNet-34| 86.87 | 97.98 | 3.710 |

on NUAA-SIRST

| Model         | mIoU (x10(-2)) | Pd (x10(-2))|  Fa (x10(-6)) ||

| ------------- |:-------------:|:-----:|:-----:|:-----:|

| DNANet-VGG-10 | 74.96 | 97.34 | 26.73 |

| DNANet-ResNet-10| 76.24 | 97.71 | 12.80 |

| DNANet-ResNet-18| 77.47 | 98.48 | 2.353 |

| DNANet-ResNet-18| 79.26 | 98.48 | 2.30 | [[Weights]](https://drive.google.com/file/d/1W0jFN9ZlaIdGFemYKi34tmJfGxjUGCRc/view?usp=sharing) |

| DNANet-ResNet-34| 77.54 | 98.10 | 2.510 |

on NUST-SIRST

| Model         | mIoU (x10(-2)) | Pd (x10(-2))|  Fa (x10(-6)) ||

| ------------- |:-------------:|:-----:|:-----:|:-----:|

| DNANet-ResNet-18| 46.73 | 81.29 | 33.87 | [[Weights]](https://drive.google.com/file/d/1TF0bZRMsGuKzMhlHKH1LygScBveMcCS2/view?usp=sharing) |

*This code is highly borrowed from [ACM](https://github.com/YimianDai/open-acm). Thanks to Yimian Dai.

*The overall repository style is highly borrowed from [PSA](https://github.com/jiwoon-ahn/psa). Thanks to jiwoon-ahn.

## Referrences

1. Dai Y, Wu Y, Zhou F, et al. Asymmetric contextual modulation for infrared small target detection[C]//Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2021: 950-959. [[code]](https://github.com/YimianDai/open-acm) 

2. Zhou Z, Siddiquee M M R, Tajbakhsh N, et al. Unet++: Redesigning skip connections to exploit multiscale features in image segmentation[J]. IEEE transactions on medical imaging, 2019, 39(6): 1856-1867. [[code]](https://github.com/MrGiovanni/UNetPlusPlus)

3. He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770-778. [[code]](https://github.com/rwightman/pytorch-image-models)

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/YeRen123455/Infrared-Small-Target-Detection

Awesome Lists containing this project

README