https://github.com/coderonion/awesome-video-object-detection
A collection of some awesome video object detection series projects.
https://github.com/coderonion/awesome-video-object-detection
List: awesome-video-object-detection
fgfa mamba mega object-detection video-object-detection yolo yolov5 yolov7 yolov8
Last synced: 3 months ago
JSON representation
A collection of some awesome video object detection series projects.
- Host: GitHub
- URL: https://github.com/coderonion/awesome-video-object-detection
- Owner: coderonion
- Created: 2022-04-28T14:53:21.000Z (about 3 years ago)
- Default Branch: main
- Last Pushed: 2024-02-22T12:21:27.000Z (over 1 year ago)
- Last Synced: 2025-02-07T10:01:53.176Z (4 months ago)
- Topics: fgfa, mamba, mega, object-detection, video-object-detection, yolo, yolov5, yolov7, yolov8
- Homepage:
- Size: 6.84 KB
- Stars: 26
- Watchers: 3
- Forks: 1
- Open Issues: 0
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- ultimate-awesome - awesome-video-object-detection - A collection of some awesome video object detection series projects. . (Other Lists / Julia Lists)
README
# Awesome-Video-Object-Detection
[](https://github.com/sindresorhus/awesome)🔥🔥🔥 This repository lists some awesome video object detection series projects.
## Contents
- [Awesome-Video-Object-Detection](#awesome-anchor-free-object-detection)
- [Summary](#summary)
- [Awesome List](#awesome-list)
- [Frameworks](#frameworks)- ## Summary
- ### Awesome List
- [zhanghengdev/awesome-video-object-detection](https://github.com/zhanghengdev/awesome-video-object-detection)
: This is a list of awesome articles about object detection from video.
- [junliang230/video_object_detection_paper](https://github.com/junliang230/video_object_detection_paper)
: record some video object detection papers and dataset (视频目标检测论文整理).
## Frameworks
- [NoScope](https://github.com/stanford-futuredata/noscope)
: "Noscope: optimizing neural network queries over video at scale". (**[arXiv 2017](https://arxiv.org/abs/1703.02529)**)
- [FGFA](https://github.com/msracver/Flow-Guided-Feature-Aggregation): "Flow-Guided Feature Aggregation for Video Object Detection". (**[ICCV 2017](https://openaccess.thecvf.com/content_iccv_2017/html/Zhu_Flow-Guided_Feature_Aggregation_ICCV_2017_paper.html)**)
- [STSN](https://github.com/lyj96/STSN)
: "Object Detection in Video with Spatiotemporal Sampling Networks". (**[ECCV 2018](https://openaccess.thecvf.com/content_ECCV_2018/html/Gedas_Bertasius_Object_Detection_in_ECCV_2018_paper.html)**)
- [MANet](https://github.com/wangshy31/MANet_for_Video_Object_Detection)
: "Fully Motion-Aware Network for Video Object Detection". (**[ECCV 2018](https://openaccess.thecvf.com/content_ECCV_2018/html/Shiyao_Wang_Fully_Motion-Aware_Network_ECCV_2018_paper.html)**)
- RDN: "Relation Distillation Networks for Video Object Detection". (**[ICCV 2019](https://openaccess.thecvf.com/content_ICCV_2019/html/Deng_Relation_Distillation_Networks_for_Video_Object_Detection_ICCV_2019_paper.html)**)
- [MGA](https://github.com/lhaof/Motion-Guided-Attention)
: "Motion Guided Attention for Video Salient Object Detection". (**[ICCV 2019](https://openaccess.thecvf.com/content_ICCV_2019/html/Li_Motion_Guided_Attention_for_Video_Salient_Object_Detection_ICCV_2019_paper.html)**)
- [SELSA](https://github.com/happywu/Sequence-Level-Semantics-Aggregation)
: "Sequence level semantics aggregation for video object detection". (**[ICCV 2019](https://openaccess.thecvf.com/content_ICCV_2019/html/Wu_Sequence_Level_Semantics_Aggregation_for_Video_Object_Detection_ICCV_2019_paper.html)**)
- [LSTS](https://github.com/jiangzhengkai/LSTS)
: "Learning Where to Focus for Efficient Video Object Detection". (**[ECCV 2020](https://link.springer.com/chapter/10.1007/978-3-030-58517-4_2)**)
- [HVR-Net](https://github.com/youthHan/HVRNet)
: "Mining Inter-Video Proposal Relations for Video Object Detection". (**[ECCV 2020](https://link.springer.com/chapter/10.1007/978-3-030-58589-1_26)**)
- [MEGA](https://github.com/Scalsol/mega.pytorch)
: "Memory enhanced global-local aggregation for video object detection". (**[CVPR 2020](https://openaccess.thecvf.com/content_CVPR_2020/html/Chen_Memory_Enhanced_Global-Local_Aggregation_for_Video_Object_Detection_CVPR_2020_paper.html)**)
- [PyramidCSA](https://github.com/guyuchao/PyramidCSA)
: "Pyramid constrained self-attention network for fast video salient object detection". (**[AAAI 2020](https://ojs.aaai.org/index.php/AAAI/article/view/6718)**)
- [REPP](https://github.com/AlbertoSabater/Robust-and-efficient-post-processing-for-video-object-detection)
: "Robust and efficient post-processing for video object detection". (**[IROS 2020](https://ieeexplore.ieee.org/abstract/document/9341600)**)
- [FFAVOD](https://github.com/hu64/FFAVOD)
: "FFAVOD: Feature fusion architecture for video object detection". (**[Pattern Recognition Letters, 2021](https://www.sciencedirect.com/science/article/abs/pii/S016786552100307X)**)
- [STMTrack](https://github.com/fzh0917/STMTrack)
: "STMTrack: Template-free Visual Tracking with Space-time Memory Networks". (**[CVPR 2021](https://openaccess.thecvf.com/content/CVPR2021/html/Fu_STMTrack_Template-Free_Visual_Tracking_With_Space-Time_Memory_Networks_CVPR_2021_paper.html)**)
- [TraDeS](https://github.com/JialianW/TraDeS)
: "Track to Detect and Segment: An Online Multi-Object Tracker". (**[CVPR 2021](https://openaccess.thecvf.com/content/CVPR2021/html/Wu_Track_To_Detect_and_Segment_An_Online_Multi-Object_Tracker_CVPR_2021_paper.html)**)
- [TROIA](https://github.com/open-mmlab/mmtracking)
: "Temporal ROI Align for Video Object Recognition". (**[AAAI 2021](https://www.aaai.org/AAAI21Papers/AAAI-3370.GongT.pdf)**)
- MeGA-CDA: "MeGA-CDA: Memory Guided Attention for Category-Aware Unsupervised Domain Adaptive Object Detection". (**[CVPR 2021](https://openaccess.thecvf.com/content/CVPR2021/html/VS_MeGA-CDA_Memory_Guided_Attention_for_Category-Aware_Unsupervised_Domain_Adaptive_Object_CVPR_2021_paper.html)**)
- [SSTVOS](https://github.com/dukebw/SSTVOS)
: "SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation". (**[CVPR 2021](https://openaccess.thecvf.com/content/CVPR2021/html/Duke_SSTVOS_Sparse_Spatiotemporal_Transformers_for_Video_Object_Segmentation_CVPR_2021_paper.html)**)
- [TransVOD](https://github.com/SJTU-LuHe/TransVOD)
: "End-to-End Video Object Detection with Spatial-Temporal Transformers". (**[ACM 2021](https://arxiv.org/pdf/2105.10920.pdf)**)
- [qianyuzqy/TransVOD_Lite](https://github.com/qianyuzqy/TransVOD_Lite)
: (TPAMI 2023) TransVOD:End-to-End Video Object Detection with Spatial-Temporal Transformers (implementations of TransVOD Lite).
- [VidVRD](https://github.com/Dawn-LX/VidVRD-tracklets)
: "Video Relation Detection via Tracklet based Visual Transformer". (**[ACM 2021](https://arxiv.org/pdf/2108.08669.pdf)**)
- [DSFNet](https://github.com/ChaoXiao12/Moving-object-detection-DSFNet)
: "DSFNet: Dynamic and Static Fusion Network for Moving Object Detection in Satellite Videos". (**[IEEE Geoscience and Remote Sensing Letters 2021](https://ieeexplore.ieee.org/abstract/document/9594855)**)
- [MAMBA](https://github.com/Duckduckgod/MAMBA)
: Self implementation of AAAI21 paper MAMBA for video object detection. "MAMBA: Multi-level Aggregation via Memory Bank for Video Object Detection". (**[AAAI 2021](https://www.aaai.org/AAAI21Papers/AAAI-9815.SunG.pdf)**)
- [STFT](https://github.com/lingyunwu14/STFT)
: "Multi-frame Collaboration for Effective Endoscopic Video Polyp Detection via Spatial-Temporal Feature Transformation". (**[MICCAI 2021](https://link.springer.com/chapter/10.1007/978-3-030-87240-3_29)**)
- [TAdaConv](https://github.com/alibaba-mmai-research/TAdaConv)
: "TAda! Temporally-Adaptive Convolutions for Video Understanding". (**[ICLR 2022](https://arxiv.org/abs/2110.06178)**)
- [TCTrack](https://github.com/vision4robotics/TCTrack)
: "TCTrack: Temporal Contexts for Aerial Tracking". (**[CVPR 2022](https://arxiv.org/abs/2203.01885)**)
- [SLT-Net](https://github.com/XuelianCheng/SLT-Net)
: "Implicit Motion Handling for Video Camouflaged Object Detection". (**[CVPR 2022](https://arxiv.org/abs/2203.07363)**)
- [StreamYOLO](https://github.com/yancie-yjr/StreamYOLO)
: "Real-time Object Detection for Streaming Perception". (**[CVPR 2022](https://arxiv.org/abs/2203.12338v1)**)
- [suyukun666/UFO](https://github.com/suyukun666/UFO)
: "A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection". (**[arXiv 2022](https://arxiv.org/abs/2203.04708)**)
- QueryProp: "QueryProp: Object Query Propagation for High-Performance Video Object Detection". (**[AAAI 2022](https://www.aaai.org/AAAI22Papers/AAAI-471.HeF.pdf)**)
- [YOLOV](https://github.com/YuHengsss/YOLOV)
: "YOLOV: Making Still Image Object Detectors Great at Video Object Detection". (**[arXiv 2022](https://arxiv.org/abs/2208.09686)**)