Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/huanglianghua/video-detection-paper-list
Video object detection benchmark.
https://github.com/huanglianghua/video-detection-paper-list
detection video video-detector-benchmark
Last synced: 2 months ago
JSON representation
Video object detection benchmark.
- Host: GitHub
- URL: https://github.com/huanglianghua/video-detection-paper-list
- Owner: huanglianghua
- Created: 2019-01-10T06:43:13.000Z (about 6 years ago)
- Default Branch: master
- Last Pushed: 2019-01-24T14:00:24.000Z (almost 6 years ago)
- Last Synced: 2024-07-31T04:01:58.159Z (5 months ago)
- Topics: detection, video, video-detector-benchmark
- Homepage:
- Size: 9.77 KB
- Stars: 19
- Watchers: 3
- Forks: 2
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- awesomeai - Video Object Detection
- awesome-ai-awesomeness - Video Object Detection
- awesome-ai-awesomeness - Video Object Detection
README
# Video Object Detection on ILSVRC VID Dataset
## Video Detector Benchmark
Evaluation results on ILSVRC VID validation set. All entries are ordered by the mAP scores.
| Methods | mAP(%) | Runtime(ms/fps) | Venue |
|:---------------:|:----------:|:-----------------:|:---------:|
| STMN (ResNet-101) | __80.5__ | 75 ms | ECCV'18 |
| MANet + SeqNMS | 80.3 | - | ECCV'18 |
| ST-Lattice (denser) | 79.6 | 20 fps | CVPR'18 |
| ST-Lattice (sparser) | 79.0 | __62 fps__ | CVPR'18 |
| D & T (τ=1) | 79.8 | 5.3 fps | ICCV'17 |
| D & T (τ=10) | 78.6 | 53 fps | ICCV'17 |
| THPVOD | 78.6 | 13 fps | CVPR'18 |
| MANet | 78.1 | 202 ms | ECCV'18 |
| FGFA | 76.5 | 733 ms | ICCV'17 |
| D (& T loss) | 75.8 | - | ICCV'17 |
| D (R-FCN) | 74.2 | - | ICCV'17 |
| TCNN | 73.8 | - | TCSVR'17 |
| R-FCN | 73.6 | - | NIPS'16 |
| DFF | 72.8 | - | CVPR'17 |
| TPN-LSTM | 68.4 | 2.1 fps | CVPR'17 |
| STMN (VGG-16) | 61.7 | - | ECCV'18 |## Video Object Detection
### ECCV 2018
* __MANet__: Shiyao Wang, Yucong Zhou, Junjie Yan, Zhidong Deng. "Fully Motion-Aware Network for Video Object Detection." ECCV 2018.
[[pdf](http://openaccess.thecvf.com/content_ECCV_2018/papers/Shiyao_Wang_Fully_Motion-Aware_Network_ECCV_2018_paper.pdf)]
[[code](https://github.com/wangshy31/MANet_for_Video_Object_Detection)]* __STMN__: Fanyi Xiao, Yong Jae Lee. "Video Object Detection with an Aligned Spatial-Temporal Memory." ECCV 2018.
[[pdf](http://openaccess.thecvf.com/content_ECCV_2018/papers/Fanyi_Xiao_Object_Detection_with_ECCV_2018_paper.pdf)]
[[project](http://fanyix.cs.ucdavis.edu/project/stmn/project.html)]
[[code](https://github.com/fanyix/STMN)]### CVPR 2018
* __TAFM__: Mason Liu, Menglong Zhu. "Mobile Video Object Detection With Temporally-Aware Feature Maps." CVPR 2018.
[[pdf](http://openaccess.thecvf.com/content_cvpr_2018/papers/Liu_Mobile_Video_Object_CVPR_2018_paper.pdf)]* __THPVOD__: Xizhou Zhu, Jifeng Dai, Lu Yuan, Yichen Wei. "Towards High Performance Video Object Detection." CVPR 2018.
[[pdf](http://openaccess.thecvf.com/content_cvpr_2018/papers/Zhu_Towards_High_Performance_CVPR_2018_paper.pdf)]* __ST-Lattice__: Kai Chen, Jiaqi Wang, Shuo Yang, Xingcheng Zhang, Yuanjun Xiong, Chen Change Loy, Dahua Lin. "Optimizing Video Object Detection via a Scale-Time Lattice." CVPR 2018.
[[pdf](http://openaccess.thecvf.com/content_cvpr_2018/papers/Chen_Optimizing_Video_Object_CVPR_2018_paper.pdf)]
[[project](http://mmlab.ie.cuhk.edu.hk/projects/ST-Lattice/)]
[[code](https://github.com/hellock/scale-time-lattice)]### ICCV 2017
* __FGFA__: Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, Yichen Wei. "Flow-Guided Feature Aggregation for Video Object Detection." ICCV 2017.
[[pdf](http://openaccess.thecvf.com/content_ICCV_2017/papers/Zhu_Flow-Guided_Feature_Aggregation_ICCV_2017_paper.pdf)]
[[video](https://www.youtube.com/watch?v=R2h3DbTPvVg&feature=youtu.be)]
[[code](https://github.com/msracver/Flow-Guided-Feature-Aggregation)]* __TD-Graph LSTM__: Yuan Yuan, Xiaodan Liang, Xiaolong Wang, Dit-Yan Yeung, Abhinav Gupta. "Temporal Dynamic Graph LSTM for Action-Driven Video Object Detection." ICCV 2017.
[[pdf](http://openaccess.thecvf.com/content_ICCV_2017/papers/Yuan_Temporal_Dynamic_Graph_ICCV_2017_paper.pdf)]
[[supp](http://openaccess.thecvf.com/content_ICCV_2017/supplemental/Yuan_Temporal_Dynamic_Graph_ICCV_2017_supplemental.pdf)]* __a\_LSTM__: Yongyi Lu, Cewu Lu, Chi-Keung Tang. "Online Video Object Detection Using Association LSTM." ICCV 2017.
[[pdf](http://openaccess.thecvf.com/content_ICCV_2017/papers/Lu__Online_Video_ICCV_2017_paper.pdf)]* __ULFV__: Ioana Croitoru, Simion-Vlad Bogolin, Marius Leordeanu. "Unsupervised Learning From Video to Detect Foreground Objects in Single Images." ICCV 2017.
[[pdf](http://openaccess.thecvf.com/content_ICCV_2017/papers/Croitoru_Unsupervised_Learning_From_ICCV_2017_paper.pdf)]* __D & T__: Christoph Feichtenhofer, Axel Pinz and Andrew Zisserman. "Detect to Track and Track to Detect." ICCV 2017.
[[pdf](https://www.robots.ox.ac.uk/~vgg/publications/2017/Feichtenhofer17/feichtenhofer17.pdf)]
[[code](https://github.com/feichtenhofer/detect-track)]### CVPR 2017
* __VPNs__: Varun Jampani, Raghudeep Gadde, Peter V. Gehler. "Video Propagation Networks." CVPR 2017.
[[pdf](http://openaccess.thecvf.com/content_cvpr_2017/papers/Jampani_Video_Propagation_Networks_CVPR_2017_paper.pdf)]
[[supp](http://openaccess.thecvf.com/content_cvpr_2017/supplemental/Jampani_Video_Propagation_Networks_2017_CVPR_supplemental.pdf)]* __FCN-LSTM__: Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell. "End-To-End Learning of Driving Models From Large-Scale Video Datasets." CVPR 2017.
[[pdf](http://openaccess.thecvf.com/content_cvpr_2017/papers/Xu_End-To-End_Learning_of_CVPR_2017_paper.pdf)]* __TPN-LSTM__: Kang Kai, Li Hongsheng, Xiao Tong, et al. "Object Detection in Videos with Tubelet Proposal Networks." CVPR 2017.
[[pdf](https://arxiv.org/pdf/1702.06355)]### ECCV 2016
* Hanul Kim, Chang-Su Kim. "CDT: Cooperative Detection and Tracking for Tracing Multiple Objects in Video Sequences."
[[pdf](http://www.eccv2016.org/files/posters/P-4A-31.pdf)]* Koteswar Jerripothula, Jianfei Cai, Junsong Yuan. "CATS: Co-saliency Activated Tracklet Selection for Video Co-localization."
[[pdf](http://www.eccv2016.org/files/posters/P-4A-44.pdf)]### CVPR 2016
* __TCN__: Kai Kang, Wanli Ouyang, Hongsheng Li, Xiaogang Wang. "Object Detection From Video Tubelets With Convolutional Neural Networks."
[[project](https://github.com/myfavouritekk/vdetlib)]
[[pdf](https://www.cv-foundation.org/openaccess/content_cvpr_2016/papers/Kang_Object_Detection_From_CVPR_2016_paper.pdf)]### ArXiv 2016
* __SeqNMS__: Wei Han, Pooya Khorrami, Tom Le Paine, et al. "Seq-NMS for Video Object Detection." ArXiv 2016.
[[pdf](https://arxiv.org/abs/1602.08465)]### ICCV 2015
* __UnsupVideo__: Suha Kwak, Minsu Cho, Ivan Laptev, Jean Ponce, Cordelia Schmid. "Unsupervised Object Discovery and Tracking in Video Collections."
[[pdf](http://openaccess.thecvf.com/content_iccv_2015/papers/Kwak_Unsupervised_Object_Discovery_ICCV_2015_paper.pdf)]* __UnsupParse__: Ozan Sener, Amir R. Zamir, Silvio Savarese, Ashutosh Saxena. "Unsupervised Semantic Parsing of Video Collections."
[[pdf](http://openaccess.thecvf.com/content_iccv_2015/papers/Sener_Unsupervised_Semantic_Parsing_ICCV_2015_paper.pdf)]### ArXiv
* Tuan-Hung Vu, Anton Osokin, Ivan Laptev. "Tube-CNN: Modeling temporal evolution of appearance for object detection in video."
[[pdf](https://arxiv.org/pdf/1812.02619)]* Shiyao Wang, Hongchao Lu, Pavel Dmitriev, Zhidong Deng. "Fast Object Detection in Compressed Video."
[[pdf](https://arxiv.org/pdf/1811.11057)]* Hao Luo, Wenxuan Xie, Xinggang Wang, Wenjun Zeng. "Detect or Track: Towards Cost-Effective Video Object Detection/Tracking."
[[pdf](https://arxiv.org/pdf/1811.05340)]* Vít Růžička, Franz Franchetti. "Fast and accurate object detection in high resolution 4K and 8K video using GPUs."
[[pdf](https://arxiv.org/pdf/1810.10551)]