{"id":13408867,"url":"https://github.com/amusi/awesome-object-detection","last_synced_at":"2026-02-18T22:37:08.039Z","repository":{"id":37426930,"uuid":"128416044","full_name":"amusi/awesome-object-detection","owner":"amusi","description":"Awesome Object Detection based on handong1587 github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html","archived":false,"fork":false,"pushed_at":"2022-12-17T04:28:58.000Z","size":79,"stargazers_count":7499,"open_issues_count":7,"forks_count":1938,"subscribers_count":431,"default_branch":"master","last_synced_at":"2025-11-04T03:02:03.171Z","etag":null,"topics":["computer-vision","deep-learning","detection","object-detection","object-localisation"],"latest_commit_sha":null,"homepage":"","language":null,"has_issues":true,"has_wiki":null,"has_pages":null,"mirror_url":null,"source_name":null,"license":null,"status":null,"scm":"git","pull_requests_enabled":true,"icon_url":"https://github.com/amusi.png","metadata":{"files":{"readme":"README.md","changelog":null,"contributing":null,"funding":null,"license":null,"code_of_conduct":null,"threat_model":null,"audit":null,"citation":null,"codeowners":null,"security":null,"support":null}},"created_at":"2018-04-06T15:58:50.000Z","updated_at":"2025-11-03T09:23:16.000Z","dependencies_parsed_at":"2023-01-29T16:45:32.552Z","dependency_job_id":null,"html_url":"https://github.com/amusi/awesome-object-detection","commit_stats":null,"previous_names":[],"tags_count":0,"template":false,"template_full_name":null,"purl":"pkg:github/amusi/awesome-object-detection","repository_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2Fawesome-object-detection","tags_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2Fawesome-object-detection/tags","releases_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2Fawesome-object-detection/releases","manifests_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2Fawesome-object-detection/manifests","owner_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners/amusi","download_url":"https://codeload.github.com/amusi/awesome-object-detection/tar.gz/refs/heads/master","sbom_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories/amusi%2Fawesome-object-detection/sbom","scorecard":null,"host":{"name":"GitHub","url":"https://github.com","kind":"github","repositories_count":286080680,"owners_count":29597293,"icon_url":"https://github.com/github.png","version":null,"created_at":"2022-05-30T11:31:42.601Z","updated_at":"2026-02-18T22:25:43.180Z","status":"ssl_error","status_checked_at":"2026-02-18T22:25:42.766Z","response_time":162,"last_error":"SSL_connect returned=1 errno=0 peeraddr=140.82.121.5:443 state=error: unexpected eof while reading","robots_txt_status":"success","robots_txt_updated_at":"2025-07-24T06:49:26.215Z","robots_txt_url":"https://github.com/robots.txt","online":false,"can_crawl_api":true,"host_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub","repositories_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repositories","repository_names_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/repository_names","owners_url":"https://repos.ecosyste.ms/api/v1/hosts/GitHub/owners"}},"keywords":["computer-vision","deep-learning","detection","object-detection","object-localisation"],"created_at":"2024-07-30T20:00:55.996Z","updated_at":"2026-02-18T22:37:08.017Z","avatar_url":"https://github.com/amusi.png","language":null,"funding_links":[],"categories":["Uncategorized","Deep Learning","Awesome Lists","Others","Computer Vision","CV","Table of Contents","object detection","Object Detection","Summary","对象检测_分割","References and other awesome lists","Appendix: Object Detection for Natural Scene","References","Computer version","Multimodal, Vision-Language, and Generative AI"],"sub_categories":["Uncategorized","Problems","资源传输下载","RS Application","**[Tutorials/Blogs]**","Computer Vision"],"readme":"# object-detection\r\n\r\n[TOC]\r\n\r\nThis is a list of awesome articles about object detection. If you want to read the paper according to time, you can refer to [Date](Date.md).\r\n\r\n- R-CNN\r\n- Fast R-CNN\r\n- Faster R-CNN\r\n- Mask R-CNN\r\n- Light-Head R-CNN\r\n- Cascade R-CNN\r\n- SPP-Net\r\n- YOLO\r\n- YOLOv2\r\n- YOLOv3\r\n- YOLT\r\n- SSD\r\n- DSSD\r\n- FSSD\r\n- ESSD\r\n- MDSSD\r\n- Pelee\r\n- Fire SSD\r\n- R-FCN\r\n- FPN\r\n- DSOD\r\n- RetinaNet\r\n- MegDet\r\n- RefineNet\r\n- DetNet\r\n- SSOD\r\n- CornerNet\r\n- M2Det\r\n- 3D Object Detection\r\n- ZSD（Zero-Shot Object Detection）\r\n- OSD（One-Shot object Detection）\r\n- Weakly Supervised Object Detection\r\n- Softer-NMS\r\n- 2018\r\n- 2019\r\n- Other\r\n\r\nBased on handong1587's github: https://handong1587.github.io/deep_learning/2015/10/09/object-detection.html\r\n\r\n# Survey\r\n\r\n**Imbalance Problems in Object Detection: A Review**\r\n\r\n- intro: under review at TPAMI\r\n- arXiv: \u003chttps://arxiv.org/abs/1909.00169\u003e\r\n\r\n**Recent Advances in Deep Learning for Object Detection**\r\n\r\n- intro: From 2013 (OverFeat) to 2019 (DetNAS)\r\n- arXiv: \u003chttps://arxiv.org/abs/1908.03673\u003e\r\n\r\n**A Survey of Deep Learning-based Object Detection**\r\n\r\n- intro：From Fast R-CNN to NAS-FPN\r\n\r\n- arXiv：\u003chttps://arxiv.org/abs/1907.09408\u003e\r\n\r\n**Object Detection in 20 Years: A Survey**\r\n\r\n- intro：This work has been submitted to the IEEE TPAMI for possible publication\r\n- arXiv：\u003chttps://arxiv.org/abs/1905.05055\u003e\r\n\r\n**《Recent Advances in Object Detection in the Age of Deep Convolutional Neural Networks》**\r\n\r\n- intro: awesome\r\n\r\n\r\n- arXiv: https://arxiv.org/abs/1809.03193\r\n\r\n**《Deep Learning for Generic Object Detection: A Survey》**\r\n\r\n- intro: Submitted to IJCV 2018\r\n- arXiv: https://arxiv.org/abs/1809.02165\r\n\r\n# Papers\u0026Codes\r\n\r\n## R-CNN\r\n\r\n**Rich feature hierarchies for accurate object detection and semantic segmentation**\r\n\r\n- intro: R-CNN\r\n- arxiv: \u003chttp://arxiv.org/abs/1311.2524\u003e\r\n- supp: \u003chttp://people.eecs.berkeley.edu/~rbg/papers/r-cnn-cvpr-supp.pdf\u003e\r\n- slides: \u003chttp://www.image-net.org/challenges/LSVRC/2013/slides/r-cnn-ilsvrc2013-workshop.pdf\u003e\r\n- slides: \u003chttp://www.cs.berkeley.edu/~rbg/slides/rcnn-cvpr14-slides.pdf\u003e\r\n- github: \u003chttps://github.com/rbgirshick/rcnn\u003e\r\n- notes: \u003chttp://zhangliliang.com/2014/07/23/paper-note-rcnn/\u003e\r\n- caffe-pr(\"Make R-CNN the Caffe detection example\"): \u003chttps://github.com/BVLC/caffe/pull/482\u003e\r\n\r\n## Fast R-CNN\r\n\r\n**Fast R-CNN**\r\n\r\n- arxiv: \u003chttp://arxiv.org/abs/1504.08083\u003e\r\n- slides: \u003chttp://tutorial.caffe.berkeleyvision.org/caffe-cvpr15-detection.pdf\u003e\r\n- github: \u003chttps://github.com/rbgirshick/fast-rcnn\u003e\r\n- github(COCO-branch): \u003chttps://github.com/rbgirshick/fast-rcnn/tree/coco\u003e\r\n- webcam demo: \u003chttps://github.com/rbgirshick/fast-rcnn/pull/29\u003e\r\n- notes: \u003chttp://zhangliliang.com/2015/05/17/paper-note-fast-rcnn/\u003e\r\n- notes: \u003chttp://blog.csdn.net/linj_m/article/details/48930179\u003e\r\n- github(\"Fast R-CNN in MXNet\"): \u003chttps://github.com/precedenceguo/mx-rcnn\u003e\r\n- github: \u003chttps://github.com/mahyarnajibi/fast-rcnn-torch\u003e\r\n- github: \u003chttps://github.com/apple2373/chainer-simple-fast-rnn\u003e\r\n- github: \u003chttps://github.com/zplizzi/tensorflow-fast-rcnn\u003e\r\n\r\n**A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection**\r\n\r\n- intro: CVPR 2017\r\n- arxiv: \u003chttps://arxiv.org/abs/1704.03414\u003e\r\n- paper: \u003chttp://abhinavsh.info/papers/pdfs/adversarial_object_detection.pdf\u003e\r\n- github(Caffe): \u003chttps://github.com/xiaolonw/adversarial-frcnn\u003e\r\n\r\n## Faster R-CNN\r\n\r\n**Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks**\r\n\r\n- intro: NIPS 2015\r\n- arxiv: \u003chttp://arxiv.org/abs/1506.01497\u003e\r\n- gitxiv: \u003chttp://www.gitxiv.com/posts/8pfpcvefDYn2gSgXk/faster-r-cnn-towards-real-time-object-detection-with-region\u003e\r\n- slides: \u003chttp://web.cs.hacettepe.edu.tr/~aykut/classes/spring2016/bil722/slides/w05-FasterR-CNN.pdf\u003e\r\n- github(official, Matlab): \u003chttps://github.com/ShaoqingRen/faster_rcnn\u003e\r\n- github(Caffe): \u003chttps://github.com/rbgirshick/py-faster-rcnn\u003e\r\n- github(MXNet): \u003chttps://github.com/msracver/Deformable-ConvNets/tree/master/faster_rcnn\u003e\r\n- github(PyTorch--recommend): \u003chttps://github.com//jwyang/faster-rcnn.pytorch\u003e\r\n- github: \u003chttps://github.com/mitmul/chainer-faster-rcnn\u003e\r\n- github(Torch):: \u003chttps://github.com/andreaskoepf/faster-rcnn.torch\u003e\r\n- github(Torch):: \u003chttps://github.com/ruotianluo/Faster-RCNN-Densecap-torch\u003e\r\n- github(TensorFlow): \u003chttps://github.com/smallcorgi/Faster-RCNN_TF\u003e\r\n- github(TensorFlow): \u003chttps://github.com/CharlesShang/TFFRCNN\u003e\r\n- github(C++ demo): \u003chttps://github.com/YihangLou/FasterRCNN-Encapsulation-Cplusplus\u003e\r\n- github(Keras): \u003chttps://github.com/yhenon/keras-frcnn\u003e\r\n- github: \u003chttps://github.com/Eniac-Xie/faster-rcnn-resnet\u003e\r\n- github(C++): \u003chttps://github.com/D-X-Y/caffe-faster-rcnn/tree/dev\u003e\r\n\r\n**R-CNN minus R**\r\n\r\n- intro: BMVC 2015\r\n- arxiv: \u003chttp://arxiv.org/abs/1506.06981\u003e\r\n\r\n**Faster R-CNN in MXNet with distributed implementation and data parallelization**\r\n\r\n- github: \u003chttps://github.com/dmlc/mxnet/tree/master/example/rcnn\u003e\r\n\r\n**Contextual Priming and Feedback for Faster R-CNN**\r\n\r\n- intro: ECCV 2016. Carnegie Mellon University\r\n- paper: \u003chttp://abhinavsh.info/context_priming_feedback.pdf\u003e\r\n- poster: \u003chttp://www.eccv2016.org/files/posters/P-1A-20.pdf\u003e\r\n\r\n**An Implementation of Faster RCNN with Study for Region Sampling**\r\n\r\n- intro: Technical Report, 3 pages. CMU\r\n- arxiv: \u003chttps://arxiv.org/abs/1702.02138\u003e\r\n- github: \u003chttps://github.com/endernewton/tf-faster-rcnn\u003e\r\n- github: https://github.com/ruotianluo/pytorch-faster-rcnn\r\n\r\n**Interpretable R-CNN**\r\n\r\n- intro: North Carolina State University \u0026 Alibaba\r\n- keywords: AND-OR Graph (AOG)\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.05226\u003e\r\n\r\n**Domain Adaptive Faster R-CNN for Object Detection in the Wild**\r\n\r\n- intro: CVPR 2018. ETH Zurich \u0026 ESAT/PSI\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.03243\u003e\r\n\r\n## Mask R-CNN\r\n\r\n- arxiv: \u003chttp://arxiv.org/abs/1703.06870\u003e\r\n- github(Keras): https://github.com/matterport/Mask_RCNN\r\n- github(Caffe2): https://github.com/facebookresearch/Detectron\r\n- github(Pytorch): \u003chttps://github.com/wannabeOG/Mask-RCNN\u003e\r\n- github(MXNet): https://github.com/TuSimple/mx-maskrcnn\r\n- github(Chainer): https://github.com/DeNA/Chainer_Mask_R-CNN\r\n\r\n## Light-Head R-CNN\r\n\r\n**Light-Head R-CNN: In Defense of Two-Stage Object Detector**\r\n\r\n- intro: Tsinghua University \u0026 Megvii Inc\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.07264\u003e\r\n- github(offical): https://github.com/zengarden/light_head_rcnn\r\n- github: \u003chttps://github.com/terrychenism/Deformable-ConvNets/blob/master/rfcn/symbols/resnet_v1_101_rfcn_light.py#L784\u003e\r\n\r\n## Cascade R-CNN\r\n\r\n**Cascade R-CNN: Delving into High Quality Object Detection**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1712.00726\u003e\r\n- github: \u003chttps://github.com/zhaoweicai/cascade-rcnn\u003e\r\n\r\n## SPP-Net\r\n\r\n**Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition**\r\n\r\n- intro: ECCV 2014 / TPAMI 2015\r\n- arxiv: \u003chttp://arxiv.org/abs/1406.4729\u003e\r\n- github: \u003chttps://github.com/ShaoqingRen/SPP_net\u003e\r\n- notes: \u003chttp://zhangliliang.com/2014/09/13/paper-note-sppnet/\u003e\r\n\r\n**DeepID-Net: Deformable Deep Convolutional Neural Networks for Object Detection**\r\n\r\n- intro: PAMI 2016\r\n- intro: an extension of R-CNN. box pre-training, cascade on region proposals, deformation layers and context representations\r\n- project page: \u003chttp://www.ee.cuhk.edu.hk/%CB%9Cwlouyang/projects/imagenetDeepId/index.html\u003e\r\n- arxiv: \u003chttp://arxiv.org/abs/1412.5661\u003e\r\n\r\n**Object Detectors Emerge in Deep Scene CNNs**\r\n\r\n- intro: ICLR 2015\r\n- arxiv: \u003chttp://arxiv.org/abs/1412.6856\u003e\r\n- paper: \u003chttps://www.robots.ox.ac.uk/~vgg/rg/papers/zhou_iclr15.pdf\u003e\r\n- paper: \u003chttps://people.csail.mit.edu/khosla/papers/iclr2015_zhou.pdf\u003e\r\n- slides: \u003chttp://places.csail.mit.edu/slide_iclr2015.pdf\u003e\r\n\r\n**segDeepM: Exploiting Segmentation and Context in Deep Neural Networks for Object Detection**\r\n\r\n- intro: CVPR 2015\r\n- project(code+data): \u003chttps://www.cs.toronto.edu/~yukun/segdeepm.html\u003e\r\n- arxiv: \u003chttps://arxiv.org/abs/1502.04275\u003e\r\n- github: \u003chttps://github.com/YknZhu/segDeepM\u003e\r\n\r\n**Object Detection Networks on Convolutional Feature Maps**\r\n\r\n- intro: TPAMI 2015\r\n- keywords: NoC\r\n- arxiv: \u003chttp://arxiv.org/abs/1504.06066\u003e\r\n\r\n**Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction**\r\n\r\n- arxiv: \u003chttp://arxiv.org/abs/1504.03293\u003e\r\n- slides: \u003chttp://www.ytzhang.net/files/publications/2015-cvpr-det-slides.pdf\u003e\r\n- github: \u003chttps://github.com/YutingZhang/fgs-obj\u003e\r\n\r\n**DeepBox: Learning Objectness with Convolutional Networks**\r\n\r\n- keywords: DeepBox\r\n- arxiv: \u003chttp://arxiv.org/abs/1505.02146\u003e\r\n- github: \u003chttps://github.com/weichengkuo/DeepBox\u003e\r\n\r\n## YOLO\r\n\r\n**You Only Look Once: Unified, Real-Time Object Detection**\r\n\r\n[![img](https://camo.githubusercontent.com/e69d4118b20a42de4e23b9549f9a6ec6dbbb0814/687474703a2f2f706a7265646469652e636f6d2f6d656469612f66696c65732f6461726b6e65742d626c61636b2d736d616c6c2e706e67)](https://camo.githubusercontent.com/e69d4118b20a42de4e23b9549f9a6ec6dbbb0814/687474703a2f2f706a7265646469652e636f6d2f6d656469612f66696c65732f6461726b6e65742d626c61636b2d736d616c6c2e706e67)\r\n\r\n- arxiv: \u003chttp://arxiv.org/abs/1506.02640\u003e\r\n- code: \u003chttps://pjreddie.com/darknet/yolov1/\u003e\r\n- github: \u003chttps://github.com/pjreddie/darknet\u003e\r\n- blog: \u003chttps://pjreddie.com/darknet/yolov1/\u003e\r\n- slides: \u003chttps://docs.google.com/presentation/d/1aeRvtKG21KHdD5lg6Hgyhx5rPq_ZOsGjG5rJ1HP7BbA/pub?start=false\u0026loop=false\u0026delayms=3000\u0026slide=id.p\u003e\r\n- reddit: \u003chttps://www.reddit.com/r/MachineLearning/comments/3a3m0o/realtime_object_detection_with_yolo/\u003e\r\n- github: \u003chttps://github.com/gliese581gg/YOLO_tensorflow\u003e\r\n- github: \u003chttps://github.com/xingwangsfu/caffe-yolo\u003e\r\n- github: \u003chttps://github.com/frankzhangrui/Darknet-Yolo\u003e\r\n- github: \u003chttps://github.com/BriSkyHekun/py-darknet-yolo\u003e\r\n- github: \u003chttps://github.com/tommy-qichang/yolo.torch\u003e\r\n- github: \u003chttps://github.com/frischzenger/yolo-windows\u003e\r\n- github: \u003chttps://github.com/AlexeyAB/yolo-windows\u003e\r\n- github: \u003chttps://github.com/nilboy/tensorflow-yolo\u003e\r\n\r\n**darkflow - translate darknet to tensorflow. Load trained weights, retrain/fine-tune them using tensorflow, export constant graph def to C++**\r\n\r\n- blog: \u003chttps://thtrieu.github.io/notes/yolo-tensorflow-graph-buffer-cpp\u003e\r\n- github: \u003chttps://github.com/thtrieu/darkflow\u003e\r\n\r\n**Start Training YOLO with Our Own Data**\r\n\r\n[![img](https://camo.githubusercontent.com/2f99b692dd7ce47d7832385f3e8a6654e680d92a/687474703a2f2f6775616e6768616e2e696e666f2f626c6f672f656e2f77702d636f6e74656e742f75706c6f6164732f323031352f31322f696d616765732d34302e6a7067)](https://camo.githubusercontent.com/2f99b692dd7ce47d7832385f3e8a6654e680d92a/687474703a2f2f6775616e6768616e2e696e666f2f626c6f672f656e2f77702d636f6e74656e742f75706c6f6164732f323031352f31322f696d616765732d34302e6a7067)\r\n\r\n- intro: train with customized data and class numbers/labels. Linux / Windows version for darknet.\r\n- blog: \u003chttp://guanghan.info/blog/en/my-works/train-yolo/\u003e\r\n- github: \u003chttps://github.com/Guanghan/darknet\u003e\r\n\r\n**YOLO: Core ML versus MPSNNGraph**\r\n\r\n- intro: Tiny YOLO for iOS implemented using CoreML but also using the new MPS graph API.\r\n- blog: \u003chttp://machinethink.net/blog/yolo-coreml-versus-mps-graph/\u003e\r\n- github: \u003chttps://github.com/hollance/YOLO-CoreML-MPSNNGraph\u003e\r\n\r\n**TensorFlow YOLO object detection on Android**\r\n\r\n- intro: Real-time object detection on Android using the YOLO network with TensorFlow\r\n- github: \u003chttps://github.com/natanielruiz/android-yolo\u003e\r\n\r\n**Computer Vision in iOS – Object Detection**\r\n\r\n- blog: \u003chttps://sriraghu.com/2017/07/12/computer-vision-in-ios-object-detection/\u003e\r\n- github:\u003chttps://github.com/r4ghu/iOS-CoreML-Yolo\u003e\r\n\r\n## YOLOv2\r\n\r\n**YOLO9000: Better, Faster, Stronger**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1612.08242\u003e\r\n- code: \u003chttp://pjreddie.com/yolo9000/\u003e    https://pjreddie.com/darknet/yolov2/\r\n- github(Chainer): \u003chttps://github.com/leetenki/YOLOv2\u003e\r\n- github(Keras): \u003chttps://github.com/allanzelener/YAD2K\u003e\r\n- github(PyTorch): \u003chttps://github.com/longcw/yolo2-pytorch\u003e\r\n- github(Tensorflow): \u003chttps://github.com/hizhangp/yolo_tensorflow\u003e\r\n- github(Windows): \u003chttps://github.com/AlexeyAB/darknet\u003e\r\n- github: \u003chttps://github.com/choasUp/caffe-yolo9000\u003e\r\n- github: \u003chttps://github.com/philipperemy/yolo-9000\u003e\r\n- github(TensorFlow): \u003chttps://github.com/KOD-Chen/YOLOv2-Tensorflow\u003e\r\n- github(Keras): \u003chttps://github.com/yhcc/yolo2\u003e\r\n- github(Keras): \u003chttps://github.com/experiencor/keras-yolo2\u003e\r\n- github(TensorFlow): \u003chttps://github.com/WojciechMormul/yolo2\u003e\r\n\r\n**darknet_scripts**\r\n\r\n- intro: Auxilary scripts to work with (YOLO) darknet deep learning famework. AKA -\u003e How to generate YOLO anchors?\r\n- github: \u003chttps://github.com/Jumabek/darknet_scripts\u003e\r\n\r\n**Yolo_mark: GUI for marking bounded boxes of objects in images for training Yolo v2**\r\n\r\n- github: \u003chttps://github.com/AlexeyAB/Yolo_mark\u003e\r\n\r\n**LightNet: Bringing pjreddie's DarkNet out of the shadows**\r\n\r\n\u003chttps://github.com//explosion/lightnet\u003e\r\n\r\n**YOLO v2 Bounding Box Tool**\r\n\r\n- intro: Bounding box labeler tool to generate the training data in the format YOLO v2 requires.\r\n- github: \u003chttps://github.com/Cartucho/yolo-boundingbox-labeler-GUI\u003e\r\n\r\n**Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors**\r\n\r\n- intro: **LRM** is the first hard example mining strategy which could fit YOLOv2 perfectly and make it better applied in series of real scenarios where both real-time rates and accurate detection are strongly demanded.\r\n- arxiv: https://arxiv.org/abs/1804.04606\r\n\r\n**Object detection at 200 Frames Per Second**\r\n\r\n- intro: faster than Tiny-Yolo-v2\r\n- arxiv: https://arxiv.org/abs/1805.06361\r\n\r\n**Event-based Convolutional Networks for Object Detection in Neuromorphic Cameras**\r\n\r\n- intro: YOLE--Object Detection in Neuromorphic Cameras\r\n- arxiv:https://arxiv.org/abs/1805.07931\r\n\r\n**OmniDetector: With Neural Networks to Bounding Boxes**\r\n\r\n- intro: a person detector on n fish-eye images of indoor scenes（NIPS 2018）\r\n- arxiv:https://arxiv.org/abs/1805.08503\r\n- datasets:https://gitlab.com/omnidetector/omnidetector\r\n\r\n## YOLOv3\r\n\r\n**YOLOv3: An Incremental Improvement**\r\n\r\n- arxiv:https://arxiv.org/abs/1804.02767\r\n- paper:https://pjreddie.com/media/files/papers/YOLOv3.pdf\r\n- code: \u003chttps://pjreddie.com/darknet/yolo/\u003e\r\n- github(Official):https://github.com/pjreddie/darknet\r\n- github:https://github.com/mystic123/tensorflow-yolo-v3\r\n- github:https://github.com/experiencor/keras-yolo3\r\n- github:https://github.com/qqwweee/keras-yolo3\r\n- github:https://github.com/marvis/pytorch-yolo3\r\n- github:https://github.com/ayooshkathuria/pytorch-yolo-v3\r\n- github:https://github.com/ayooshkathuria/YOLO_v3_tutorial_from_scratch\r\n- github:https://github.com/eriklindernoren/PyTorch-YOLOv3\r\n- github:https://github.com/ultralytics/yolov3\r\n- github:https://github.com/BobLiu20/YOLOv3_PyTorch\r\n- github:https://github.com/andy-yun/pytorch-0.4-yolov3\r\n- github:https://github.com/DeNA/PyTorch_YOLOv3\r\n\r\n## YOLT\r\n\r\n**You Only Look Twice: Rapid Multi-Scale Object Detection In Satellite Imagery**\r\n\r\n- intro: Small Object Detection\r\n\r\n\r\n- arxiv:https://arxiv.org/abs/1805.09512\r\n- github:https://github.com/avanetten/yolt\r\n\r\n## SSD\r\n\r\n**SSD: Single Shot MultiBox Detector**\r\n\r\n[![img](https://camo.githubusercontent.com/ad9b147ed3a5f48ffb7c3540711c15aa04ce49c6/687474703a2f2f7777772e63732e756e632e6564752f7e776c69752f7061706572732f7373642e706e67)](https://camo.githubusercontent.com/ad9b147ed3a5f48ffb7c3540711c15aa04ce49c6/687474703a2f2f7777772e63732e756e632e6564752f7e776c69752f7061706572732f7373642e706e67)\r\n\r\n- intro: ECCV 2016 Oral\r\n- arxiv: \u003chttp://arxiv.org/abs/1512.02325\u003e\r\n- paper: \u003chttp://www.cs.unc.edu/~wliu/papers/ssd.pdf\u003e\r\n- slides: [http://www.cs.unc.edu/%7Ewliu/papers/ssd_eccv2016_slide.pdf](http://www.cs.unc.edu/~wliu/papers/ssd_eccv2016_slide.pdf)\r\n- github(Official): \u003chttps://github.com/weiliu89/caffe/tree/ssd\u003e\r\n- video: \u003chttp://weibo.com/p/2304447a2326da963254c963c97fb05dd3a973\u003e\r\n- github: \u003chttps://github.com/zhreshold/mxnet-ssd\u003e\r\n- github: \u003chttps://github.com/zhreshold/mxnet-ssd.cpp\u003e\r\n- github: \u003chttps://github.com/rykov8/ssd_keras\u003e\r\n- github: \u003chttps://github.com/balancap/SSD-Tensorflow\u003e\r\n- github: \u003chttps://github.com/amdegroot/ssd.pytorch\u003e\r\n- github(Caffe): \u003chttps://github.com/chuanqi305/MobileNet-SSD\u003e\r\n\r\n**What's the diffience in performance between this new code you pushed and the previous code? #327**\r\n\r\n\u003chttps://github.com/weiliu89/caffe/issues/327\u003e\r\n\r\n## DSSD\r\n\r\n**DSSD : Deconvolutional Single Shot Detector**\r\n\r\n- intro: UNC Chapel Hill \u0026 Amazon Inc\r\n- arxiv: \u003chttps://arxiv.org/abs/1701.06659\u003e\r\n- github: \u003chttps://github.com/chengyangfu/caffe/tree/dssd\u003e\r\n- github: \u003chttps://github.com/MTCloudVision/mxnet-dssd\u003e\r\n- demo: \u003chttp://120.52.72.53/www.cs.unc.edu/c3pr90ntc0td/~cyfu/dssd_lalaland.mp4\u003e\r\n\r\n**Enhancement of SSD by concatenating feature maps for object detection**\r\n\r\n- intro: rainbow SSD (R-SSD)\r\n- arxiv: \u003chttps://arxiv.org/abs/1705.09587\u003e\r\n\r\n**Context-aware Single-Shot Detector**\r\n\r\n- keywords: CSSD, DiCSSD, DeCSSD, effective receptive fields (ERFs), theoretical receptive fields (TRFs)\r\n- arxiv: \u003chttps://arxiv.org/abs/1707.08682\u003e\r\n\r\n**Feature-Fused SSD: Fast Detection for Small Objects**\r\n\r\n\u003chttps://arxiv.org/abs/1709.05054\u003e\r\n\r\n## FSSD\r\n\r\n**FSSD: Feature Fusion Single Shot Multibox Detector**\r\n\r\n\u003chttps://arxiv.org/abs/1712.00960\u003e\r\n\r\n**Weaving Multi-scale Context for Single Shot Detector**\r\n\r\n- intro: WeaveNet\r\n- keywords: fuse multi-scale information\r\n- arxiv: \u003chttps://arxiv.org/abs/1712.03149\u003e\r\n\r\n## ESSD\r\n\r\n**Extend the shallow part of Single Shot MultiBox Detector via Convolutional Neural Network**\r\n\r\n\u003chttps://arxiv.org/abs/1801.05918\u003e\r\n\r\n**Tiny SSD: A Tiny Single-shot Detection Deep Convolutional Neural Network for Real-time Embedded Object Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1802.06488\u003e\r\n\r\n## MDSSD\r\n\r\n**MDSSD: Multi-scale Deconvolutional Single Shot Detector for small objects**\r\n\r\n- arxiv: https://arxiv.org/abs/1805.07009\r\n\r\n## Pelee\r\n\r\n**Pelee: A Real-Time Object Detection System on Mobile Devices**\r\n\r\nhttps://github.com/Robert-JunWang/Pelee\r\n\r\n- intro: (ICLR 2018 workshop track)\r\n\r\n\r\n- arxiv: https://arxiv.org/abs/1804.06882\r\n- github: https://github.com/Robert-JunWang/Pelee\r\n\r\n## Fire SSD\r\n\r\n**Fire SSD: Wide Fire Modules based Single Shot Detector on Edge Device**\r\n\r\n- intro:low cost, fast speed and high mAP on  factor edge computing devices\r\n\r\n\r\n- arxiv:https://arxiv.org/abs/1806.05363\r\n\r\n## R-FCN\r\n\r\n**R-FCN: Object Detection via Region-based Fully Convolutional Networks**\r\n\r\n- arxiv: \u003chttp://arxiv.org/abs/1605.06409\u003e\r\n- github: \u003chttps://github.com/daijifeng001/R-FCN\u003e\r\n- github(MXNet): \u003chttps://github.com/msracver/Deformable-ConvNets/tree/master/rfcn\u003e\r\n- github: \u003chttps://github.com/Orpine/py-R-FCN\u003e\r\n- github: \u003chttps://github.com/PureDiors/pytorch_RFCN\u003e\r\n- github: \u003chttps://github.com/bharatsingh430/py-R-FCN-multiGPU\u003e\r\n- github: \u003chttps://github.com/xdever/RFCN-tensorflow\u003e\r\n\r\n**R-FCN-3000 at 30fps: Decoupling Detection and Classification**\r\n\r\n\u003chttps://arxiv.org/abs/1712.01802\u003e\r\n\r\n**Recycle deep features for better object detection**\r\n\r\n- arxiv: \u003chttp://arxiv.org/abs/1607.05066\u003e\r\n\r\n## FPN\r\n\r\n**Feature Pyramid Networks for Object Detection**\r\n\r\n- intro: Facebook AI Research\r\n- arxiv: \u003chttps://arxiv.org/abs/1612.03144\u003e\r\n\r\n**Action-Driven Object Detection with Top-Down Visual Attentions**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1612.06704\u003e\r\n\r\n**Beyond Skip Connections: Top-Down Modulation for Object Detection**\r\n\r\n- intro: CMU \u0026 UC Berkeley \u0026 Google Research\r\n- arxiv: \u003chttps://arxiv.org/abs/1612.06851\u003e\r\n\r\n**Wide-Residual-Inception Networks for Real-time Object Detection**\r\n\r\n- intro: Inha University\r\n- arxiv: \u003chttps://arxiv.org/abs/1702.01243\u003e\r\n\r\n**Attentional Network for Visual Object Detection**\r\n\r\n- intro: University of Maryland \u0026 Mitsubishi Electric Research Laboratories\r\n- arxiv: \u003chttps://arxiv.org/abs/1702.01478\u003e\r\n\r\n**Learning Chained Deep Features and Classifiers for Cascade in Object Detection**\r\n\r\n- keykwords: CC-Net\r\n- intro: chained cascade network (CC-Net). 81.1% mAP on PASCAL VOC 2007\r\n- arxiv: \u003chttps://arxiv.org/abs/1702.07054\u003e\r\n\r\n**DeNet: Scalable Real-time Object Detection with Directed Sparse Sampling**\r\n\r\n- intro: ICCV 2017 (poster)\r\n- arxiv: \u003chttps://arxiv.org/abs/1703.10295\u003e\r\n\r\n**Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries**\r\n\r\n- intro: CVPR 2017\r\n- arxiv: \u003chttps://arxiv.org/abs/1704.03944\u003e\r\n\r\n**Spatial Memory for Context Reasoning in Object Detection**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1704.04224\u003e\r\n\r\n**Accurate Single Stage Detector Using Recurrent Rolling Convolution**\r\n\r\n- intro: CVPR 2017. SenseTime\r\n- keywords: Recurrent Rolling Convolution (RRC)\r\n- arxiv: \u003chttps://arxiv.org/abs/1704.05776\u003e\r\n- github: \u003chttps://github.com/xiaohaoChen/rrc_detection\u003e\r\n\r\n**Deep Occlusion Reasoning for Multi-Camera Multi-Target Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1704.05775\u003e\r\n\r\n**LCDet: Low-Complexity Fully-Convolutional Neural Networks for Object Detection in Embedded Systems**\r\n\r\n- intro: Embedded Vision Workshop in CVPR. UC San Diego \u0026 Qualcomm Inc\r\n- arxiv: \u003chttps://arxiv.org/abs/1705.05922\u003e\r\n\r\n**Point Linking Network for Object Detection**\r\n\r\n- intro: Point Linking Network (PLN)\r\n- arxiv: \u003chttps://arxiv.org/abs/1706.03646\u003e\r\n\r\n**Perceptual Generative Adversarial Networks for Small Object Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1706.05274\u003e\r\n\r\n**Few-shot Object Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1706.08249\u003e\r\n\r\n**Yes-Net: An effective Detector Based on Global Information**\r\n\r\n\u003chttps://arxiv.org/abs/1706.09180\u003e\r\n\r\n**SMC Faster R-CNN: Toward a scene-specialized multi-object detector**\r\n\r\n\u003chttps://arxiv.org/abs/1706.10217\u003e\r\n\r\n**Towards lightweight convolutional neural networks for object detection**\r\n\r\n\u003chttps://arxiv.org/abs/1707.01395\u003e\r\n\r\n**RON: Reverse Connection with Objectness Prior Networks for Object Detection**\r\n\r\n- intro: CVPR 2017\r\n- arxiv: \u003chttps://arxiv.org/abs/1707.01691\u003e\r\n- github: \u003chttps://github.com/taokong/RON\u003e\r\n\r\n**Mimicking Very Efficient Network for Object Detection**\r\n\r\n- intro: CVPR 2017. SenseTime \u0026 Beihang University\r\n- paper: \u003chttp://openaccess.thecvf.com/content_cvpr_2017/papers/Li_Mimicking_Very_Efficient_CVPR_2017_paper.pdf\u003e\r\n\r\n**Residual Features and Unified Prediction Network for Single Stage Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1707.05031\u003e\r\n\r\n**Deformable Part-based Fully Convolutional Network for Object Detection**\r\n\r\n- intro: BMVC 2017 (oral). Sorbonne Universités \u0026 CEDRIC\r\n- arxiv: \u003chttps://arxiv.org/abs/1707.06175\u003e\r\n\r\n**Adaptive Feeding: Achieving Fast and Accurate Detections by Adaptively Combining Object Detectors**\r\n\r\n- intro: ICCV 2017\r\n- arxiv: \u003chttps://arxiv.org/abs/1707.06399\u003e\r\n\r\n**Recurrent Scale Approximation for Object Detection in CNN**\r\n\r\n- intro: ICCV 2017\r\n- keywords: Recurrent Scale Approximation (RSA)\r\n- arxiv: \u003chttps://arxiv.org/abs/1707.09531\u003e\r\n- github: \u003chttps://github.com/sciencefans/RSA-for-object-detection\u003e\r\n\r\n## DSOD\r\n\r\n**DSOD: Learning Deeply Supervised Object Detectors from Scratch**\r\n\r\n![img](https://user-images.githubusercontent.com/3794909/28934967-718c9302-78b5-11e7-89ee-8b514e53e23c.png)\r\n\r\n- intro: ICCV 2017. Fudan University \u0026 Tsinghua University \u0026 Intel Labs China\r\n- arxiv: \u003chttps://arxiv.org/abs/1708.01241\u003e\r\n- github: \u003chttps://github.com/szq0214/DSOD\u003e\r\n- github:https://github.com/Windaway/DSOD-Tensorflow\r\n- github:https://github.com/chenyuntc/dsod.pytorch\r\n\r\n**Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids**\r\n\r\n- arxiv:https://arxiv.org/abs/1712.00886\r\n- github:https://github.com/szq0214/GRP-DSOD\r\n\r\n**Tiny-DSOD: Lightweight Object Detection for Resource-Restricted Usages**\r\n\r\n- intro: BMVC 2018\r\n- arXiv: https://arxiv.org/abs/1807.11013\r\n\r\n**Object Detection from Scratch with Deep Supervision**\r\n\r\n- intro: This is an extended version of DSOD\r\n- arXiv: https://arxiv.org/abs/1809.09294\r\n\r\n## RetinaNet\r\n\r\n**Focal Loss for Dense Object Detection**\r\n\r\n- intro: ICCV 2017 Best student paper award. Facebook AI Research\r\n- keywords: RetinaNet\r\n- arxiv: \u003chttps://arxiv.org/abs/1708.02002\u003e\r\n\r\n**CoupleNet: Coupling Global Structure with Local Parts for Object Detection**\r\n\r\n- intro: ICCV 2017\r\n- arxiv: \u003chttps://arxiv.org/abs/1708.02863\u003e\r\n\r\n**Incremental Learning of Object Detectors without Catastrophic Forgetting**\r\n\r\n- intro: ICCV 2017. Inria\r\n- arxiv: \u003chttps://arxiv.org/abs/1708.06977\u003e\r\n\r\n**Zoom Out-and-In Network with Map Attention Decision for Region Proposal and Object Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1709.04347\u003e\r\n\r\n**StairNet: Top-Down Semantic Aggregation for Accurate One Shot Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1709.05788\u003e\r\n\r\n**Dynamic Zoom-in Network for Fast Object Detection in Large Images**\r\n\r\n\u003chttps://arxiv.org/abs/1711.05187\u003e\r\n\r\n**Zero-Annotation Object Detection with Web Knowledge Transfer**\r\n\r\n- intro: NTU, Singapore \u0026 Amazon\r\n- keywords: multi-instance multi-label domain adaption learning framework\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.05954\u003e\r\n\r\n## MegDet\r\n\r\n**MegDet: A Large Mini-Batch Object Detector**\r\n\r\n- intro: Peking University \u0026 Tsinghua University \u0026 Megvii Inc\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.07240\u003e\r\n\r\n**Receptive Field Block Net for Accurate and Fast Object Detection**\r\n\r\n- intro: RFBNet\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.07767\u003e\r\n- github: \u003chttps://github.com//ruinmessi/RFBNet\u003e\r\n\r\n**An Analysis of Scale Invariance in Object Detection - SNIP**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.08189\u003e\r\n- github: \u003chttps://github.com/bharatsingh430/snip\u003e\r\n\r\n**Feature Selective Networks for Object Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1711.08879\u003e\r\n\r\n**Learning a Rotation Invariant Detector with Rotatable Bounding Box**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.09405\u003e\r\n- github: \u003chttps://github.com/liulei01/DRBox\u003e\r\n\r\n**Scalable Object Detection for Stylized Objects**\r\n\r\n- intro: Microsoft AI \u0026 Research Munich\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.09822\u003e\r\n\r\n**Learning Object Detectors from Scratch with Gated Recurrent Feature Pyramids**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1712.00886\u003e\r\n- github: \u003chttps://github.com/szq0214/GRP-DSOD\u003e\r\n\r\n**Deep Regionlets for Object Detection**\r\n\r\n- keywords: region selection network, gating network\r\n- arxiv: \u003chttps://arxiv.org/abs/1712.02408\u003e\r\n\r\n**Training and Testing Object Detectors with Virtual Images**\r\n\r\n- intro: IEEE/CAA Journal of Automatica Sinica\r\n- arxiv: \u003chttps://arxiv.org/abs/1712.08470\u003e\r\n\r\n**Large-Scale Object Discovery and Detector Adaptation from Unlabeled Video**\r\n\r\n- keywords: object mining, object tracking, unsupervised object discovery by appearance-based clustering, self-supervised detector adaptation\r\n- arxiv: \u003chttps://arxiv.org/abs/1712.08832\u003e\r\n\r\n**Spot the Difference by Object Detection**\r\n\r\n- intro: Tsinghua University \u0026 JD Group\r\n- arxiv: \u003chttps://arxiv.org/abs/1801.01051\u003e\r\n\r\n**Localization-Aware Active Learning for Object Detection**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1801.05124\u003e\r\n\r\n**Object Detection with Mask-based Feature Encoding**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1802.03934\u003e\r\n\r\n**LSTD: A Low-Shot Transfer Detector for Object Detection**\r\n\r\n- intro: AAAI 2018\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.01529\u003e\r\n\r\n**Pseudo Mask Augmented Object Detection**\r\n\r\n\u003chttps://arxiv.org/abs/1803.05858\u003e\r\n\r\n**Revisiting RCNN: On Awakening the Classification Power of Faster RCNN**\r\n\r\n\u003chttps://arxiv.org/abs/1803.06799\u003e\r\n\r\n**Learning Region Features for Object Detection**\r\n\r\n- intro: Peking University \u0026 MSRA\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.07066\u003e\r\n\r\n**Single-Shot Bidirectional Pyramid Networks for High-Quality Object Detection**\r\n\r\n- intro: Singapore Management University \u0026 Zhejiang University\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.08208\u003e\r\n\r\n**Object Detection for Comics using Manga109 Annotations**\r\n\r\n- intro: University of Tokyo \u0026 National Institute of Informatics, Japan\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.08670\u003e\r\n\r\n**Task-Driven Super Resolution: Object Detection in Low-resolution Images**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.11316\u003e\r\n\r\n**Transferring Common-Sense Knowledge for Object Detection**\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1804.01077\u003e\r\n\r\n**Multi-scale Location-aware Kernel Representation for Object Detection**\r\n\r\n- intro: CVPR 2018\r\n- arxiv: \u003chttps://arxiv.org/abs/1804.00428\u003e\r\n- github: \u003chttps://github.com/Hwang64/MLKP\u003e\r\n\r\n\r\n**Loss Rank Mining: A General Hard Example Mining Method for Real-time Detectors**\r\n\r\n- intro: National University of Defense Technology\r\n- arxiv: https://arxiv.org/abs/1804.04606\r\n\r\n**Robust Physical Adversarial Attack on Faster R-CNN Object Detector**\r\n\r\n- arxiv: https://arxiv.org/abs/1804.05810\r\n\r\n## RefineNet\r\n\r\n**Single-Shot Refinement Neural Network for Object Detection**\r\n\r\n- intro: CVPR 2018\r\n\r\n- arxiv: \u003chttps://arxiv.org/abs/1711.06897\u003e\r\n- github: \u003chttps://github.com/sfzhang15/RefineDet\u003e\r\n- github: https://github.com/lzx1413/PytorchSSD\r\n- github: https://github.com/ddlee96/RefineDet_mxnet\r\n- github: https://github.com/MTCloudVision/RefineDet-Mxnet\r\n\r\n## DetNet\r\n\r\n**DetNet: A Backbone network for Object Detection**\r\n\r\n- intro: Tsinghua University \u0026 Face++\r\n- arxiv: https://arxiv.org/abs/1804.06215\r\n\r\n\r\n## SSOD\r\n\r\n**Self-supervisory Signals for Object Discovery and Detection**\r\n\r\n- Google Brain\r\n- arxiv:https://arxiv.org/abs/1806.03370\r\n\r\n## CornerNet\r\n\r\n**CornerNet: Detecting Objects as Paired Keypoints**\r\n\r\n- intro: ECCV 2018\r\n- arXiv: https://arxiv.org/abs/1808.01244\r\n- github: \u003chttps://github.com/umich-vl/CornerNet\u003e\r\n\r\n## M2Det\r\n\r\n**M2Det: A Single-Shot Object Detector based on Multi-Level Feature Pyramid Network**\r\n\r\n- intro: AAAI 2019\r\n- arXiv: https://arxiv.org/abs/1811.04533\r\n- github: https://github.com/qijiezhao/M2Det\r\n\r\n## 3D Object Detection\r\n\r\n**3D Backbone Network for 3D Object Detection**\r\n\r\n- arXiv: https://arxiv.org/abs/1901.08373\r\n\r\n**LMNet: Real-time Multiclass Object Detection on CPU using 3D LiDARs**\r\n\r\n- arxiv: https://arxiv.org/abs/1805.04902\r\n- github: https://github.com/CPFL/Autoware/tree/feature/cnn_lidar_detection\r\n\r\n\r\n## ZSD（Zero-Shot Object Detection）\r\n\r\n**Zero-Shot Detection**\r\n\r\n- intro: Australian National University\r\n- keywords: YOLO\r\n- arxiv: \u003chttps://arxiv.org/abs/1803.07113\u003e\r\n\r\n**Zero-Shot Object Detection**\r\n\r\n- arxiv: https://arxiv.org/abs/1804.04340\r\n\r\n**Zero-Shot Object Detection: Learning to Simultaneously Recognize and Localize Novel Concepts**\r\n\r\n- arxiv: https://arxiv.org/abs/1803.06049\r\n\r\n**Zero-Shot Object Detection by Hybrid Region Embedding**\r\n\r\n- arxiv: https://arxiv.org/abs/1805.06157\r\n\r\n## OSD（One-Shot Object Detection）\r\n\r\n**Comparison Network for One-Shot Conditional Object Detection**\r\n\r\n- arXiv: https://arxiv.org/abs/1904.02317\r\n\r\n**One-Shot Object Detection**\r\n\r\nRepMet: Representative-based metric learning for classification and one-shot object detection\r\n\r\n- intro: IBM Research AI\r\n- arxiv:https://arxiv.org/abs/1806.04728\r\n- github: TODO\r\n\r\n## Weakly Supervised Object Detection\r\n\r\n**Weakly Supervised Object Detection in Artworks**\r\n\r\n- intro: ECCV 2018 Workshop Computer Vision for Art Analysis\r\n- arXiv: https://arxiv.org/abs/1810.02569\r\n- Datasets: https://wsoda.telecom-paristech.fr/downloads/dataset/IconArt_v1.zip\r\n\r\n**Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation**\r\n\r\n- intro: CVPR 2018\r\n- arXiv: https://arxiv.org/abs/1803.11365\r\n- homepage: https://naoto0804.github.io/cross_domain_detection/\r\n- paper: http://openaccess.thecvf.com/content_cvpr_2018/html/Inoue_Cross-Domain_Weakly-Supervised_Object_CVPR_2018_paper.html\r\n- github: https://github.com/naoto0804/cross-domain-detection\r\n\r\n## Softer-NMS\r\n\r\n**《Softer-NMS: Rethinking Bounding Box Regression for Accurate Object Detection》**\r\n\r\n- intro: CMU \u0026 Face++\r\n- arXiv: https://arxiv.org/abs/1809.08545\r\n- github: https://github.com/yihui-he/softer-NMS\r\n\r\n## 2019\r\n\r\n**Feature Selective Anchor-Free Module for Single-Shot Object Detection**\r\n\r\n- intro: CVPR 2019\r\n\r\n- arXiv: https://arxiv.org/abs/1903.00621\r\n\r\n**Object Detection based on Region Decomposition and Assembly**\r\n\r\n- intro: AAAI 2019\r\n\r\n- arXiv: https://arxiv.org/abs/1901.08225\r\n\r\n**Bottom-up Object Detection by Grouping Extreme and Center Points**\r\n\r\n- intro: one stage 43.2% on COCO test-dev\r\n- arXiv: https://arxiv.org/abs/1901.08043\r\n- github: https://github.com/xingyizhou/ExtremeNet\r\n\r\n**ORSIm Detector: A Novel Object Detection Framework in Optical Remote Sensing Imagery Using Spatial-Frequency Channel Features**\r\n\r\n- intro: IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING\r\n\r\n- arXiv: https://arxiv.org/abs/1901.07925\r\n\r\n**Consistent Optimization for Single-Shot Object Detection**\r\n\r\n- intro: improves RetinaNet from 39.1 AP to 40.1 AP on COCO datase\r\n\r\n- arXiv: https://arxiv.org/abs/1901.06563\r\n\r\n**Learning Pairwise Relationship for Multi-object Detection in Crowded Scenes**\r\n\r\n- arXiv: https://arxiv.org/abs/1901.03796\r\n\r\n**RetinaMask: Learning to predict masks improves state-of-the-art single-shot detection for free**\r\n\r\n- arXiv: https://arxiv.org/abs/1901.03353\r\n- github: https://github.com/chengyangfu/retinamask\r\n\r\n**Region Proposal by Guided Anchoring**\r\n\r\n- intro: CUHK - SenseTime Joint Lab\r\n- arXiv: https://arxiv.org/abs/1901.03278\r\n\r\n**Scale-Aware Trident Networks for Object Detection**\r\n\r\n- intro: mAP of **48.4** on the COCO dataset\r\n- arXiv: https://arxiv.org/abs/1901.01892\r\n\r\n## 2018\r\n\r\n**Large-Scale Object Detection of Images from Network Cameras in Variable Ambient Lighting Conditions**\r\n\r\n- arXiv: https://arxiv.org/abs/1812.11901\r\n\r\n**Strong-Weak Distribution Alignment for Adaptive Object Detection**\r\n\r\n- arXiv: https://arxiv.org/abs/1812.04798\r\n\r\n**AutoFocus: Efficient Multi-Scale Inference**\r\n\r\n- intro: AutoFocus obtains an **mAP of 47.9%** (68.3% at 50% overlap) on the **COCO test-dev** set while processing **6.4 images per second on a Titan X (Pascal) GPU** \r\n- arXiv: https://arxiv.org/abs/1812.01600\r\n\r\n**NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-Supervised Object Detection**\r\n\r\n- intro: Google Could\r\n- arXiv: https://arxiv.org/abs/1812.00124\r\n\r\n**SPLAT: Semantic Pixel-Level Adaptation Transforms for Detection**\r\n\r\n- intro: UC Berkeley\r\n- arXiv: https://arxiv.org/abs/1812.00929\r\n\r\n**Grid R-CNN**\r\n\r\n- intro: SenseTime\r\n- arXiv: https://arxiv.org/abs/1811.12030\r\n\r\n**Deformable ConvNets v2: More Deformable, Better Results**\r\n\r\n- intro: Microsoft Research Asia\r\n\r\n- arXiv: https://arxiv.org/abs/1811.11168\r\n\r\n**Anchor Box Optimization for Object Detection**\r\n\r\n- intro: Microsoft Research\r\n- arXiv: https://arxiv.org/abs/1812.00469\r\n\r\n**Efficient Coarse-to-Fine Non-Local Module for the Detection of Small Objects**\r\n\r\n- intro: https://arxiv.org/abs/1811.12152\r\n\r\n**NOTE-RCNN: NOise Tolerant Ensemble RCNN for Semi-Supervised Object Detection**\r\n\r\n- arXiv: https://arxiv.org/abs/1812.00124\r\n\r\n**Learning RoI Transformer for Detecting Oriented Objects in Aerial Images**\r\n\r\n- arXiv: https://arxiv.org/abs/1812.00155\r\n\r\n**Integrated Object Detection and Tracking with Tracklet-Conditioned Detection**\r\n\r\n- intro: Microsoft Research Asia\r\n- arXiv: https://arxiv.org/abs/1811.11167\r\n\r\n**Deep Regionlets: Blended Representation and Deep Learning for Generic Object Detection**\r\n\r\n- arXiv: https://arxiv.org/abs/1811.11318\r\n\r\n **Gradient Harmonized Single-stage Detector**\r\n\r\n- intro: AAAI 2019\r\n- arXiv: https://arxiv.org/abs/1811.05181\r\n\r\n**CFENet: Object Detection with Comprehensive Feature Enhancement Module**\r\n\r\n- intro: ACCV 2018\r\n- github: https://github.com/qijiezhao/CFENet\r\n\r\n**DeRPN: Taking a further step toward more general object detection**\r\n\r\n- intro: AAAI 2019\r\n- arXiv: https://arxiv.org/abs/1811.06700\r\n- github: https://github.com/HCIILAB/DeRPN\r\n\r\n**Hybrid Knowledge Routed Modules for Large-scale Object Detection**\r\n\r\n- intro: Sun Yat-Sen University \u0026 Huawei Noah’s Ark Lab\r\n- arXiv: https://arxiv.org/abs/1810.12681\r\n- github: https://github.com/chanyn/HKRM\r\n\r\n**《Receptive Field Block Net for Accurate and Fast Object Detection》**\r\n\r\n- intro: ECCV 2018\r\n- arXiv: [https://arxiv.org/abs/1711.07767](https://arxiv.org/abs/1711.07767)\r\n- github: [https://github.com/ruinmessi/RFBNet](https://github.com/ruinmessi/RFBNet)\r\n\r\n**Deep Feature Pyramid Reconfiguration for Object Detection**\r\n\r\n- intro: ECCV 2018\r\n- arXiv: https://arxiv.org/abs/1808.07993\r\n\r\n**Unsupervised Hard Example Mining from Videos for Improved Object Detection**\r\n\r\n- intro: ECCV 2018\r\n- arXiv: https://arxiv.org/abs/1808.04285\r\n\r\n**Acquisition of Localization Confidence for Accurate Object Detection**\r\n\r\n- intro: ECCV 2018\r\n- arXiv: https://arxiv.org/abs/1807.11590\r\n- github: https://github.com/vacancy/PreciseRoIPooling\r\n\r\n**Toward Scale-Invariance and Position-Sensitive Region Proposal Networks**\r\n\r\n- intro: ECCV 2018\r\n- arXiv: https://arxiv.org/abs/1807.09528\r\n\r\n**MetaAnchor: Learning to Detect Objects with Customized Anchors**\r\n\r\n- arxiv: https://arxiv.org/abs/1807.00980\r\n\r\n**Relation Network for Object Detection**\r\n\r\n- intro: CVPR 2018\r\n- arxiv: https://arxiv.org/abs/1711.11575\r\n- github:https://github.com/msracver/Relation-Networks-for-Object-Detection\r\n\r\n**Quantization Mimic: Towards Very Tiny CNN for Object Detection**\r\n\r\n- Tsinghua University1 \u0026 The Chinese University of Hong Kong2 \u0026SenseTime3\r\n- arxiv: https://arxiv.org/abs/1805.02152\r\n\r\n**Learning Rich Features for Image Manipulation Detection**\r\n\r\n- intro: CVPR 2018 Camera Ready\r\n- arxiv: https://arxiv.org/abs/1805.04953\r\n\r\n**SNIPER: Efficient Multi-Scale Training**\r\n\r\n- arxiv:https://arxiv.org/abs/1805.09300\r\n- github:https://github.com/mahyarnajibi/SNIPER\r\n\r\n**Soft Sampling for Robust Object Detection**\r\n\r\n- intro: the robustness of object detection under the presence of missing annotations\r\n- arxiv:https://arxiv.org/abs/1806.06986\r\n\r\n**Cost-effective Object Detection: Active Sample Mining with Switchable Selection Criteria**\r\n\r\n- intro: TNNLS 2018\r\n- arxiv:https://arxiv.org/abs/1807.00147\r\n- code: http://kezewang.com/codes/ASM_ver1.zip\r\n\r\n## Other\r\n\r\n**R3-Net: A Deep Network for Multi-oriented Vehicle Detection in Aerial Images and Videos**\r\n\r\n- arxiv: https://arxiv.org/abs/1808.05560\r\n- youtube: https://youtu.be/xCYD-tYudN0\r\n\r\n# Detection Toolbox\r\n\r\n- [Detectron(FAIR)](https://github.com/facebookresearch/Detectron): Detectron is Facebook AI Research's software system that implements state-of-the-art object detection algorithms, including [Mask R-CNN](https://arxiv.org/abs/1703.06870). It is written in Python and powered by the [Caffe2](https://github.com/caffe2/caffe2) deep learning framework.\r\n- [Detectron2](https://github.com/facebookresearch/detectron2): Detectron2 is FAIR's next-generation research platform for object detection and segmentation.\r\n- [maskrcnn-benchmark(FAIR)](https://github.com/facebookresearch/maskrcnn-benchmark): Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.\r\n- [mmdetection(SenseTime\u0026CUHK)](https://github.com/open-mmlab/mmdetection): mmdetection is an open source object detection toolbox based on PyTorch. It is a part of the open-mmlab project developed by [Multimedia Laboratory, CUHK](http://mmlab.ie.cuhk.edu.hk/).\r\n","project_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famusi%2Fawesome-object-detection","html_url":"https://awesome.ecosyste.ms/projects/github.com%2Famusi%2Fawesome-object-detection","lists_url":"https://awesome.ecosyste.ms/api/v1/projects/github.com%2Famusi%2Fawesome-object-detection/lists"}