Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
https://github.com/zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
https://github.com/zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
List: Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Last synced: about 1 month ago
JSON representation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
- Host: GitHub
- URL: https://github.com/zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
- Owner: zhenyingfang
- Created: 2020-03-09T13:49:55.000Z (almost 5 years ago)
- Default Branch: master
- Last Pushed: 2024-10-21T02:16:00.000Z (2 months ago)
- Last Synced: 2024-10-21T05:34:18.027Z (2 months ago)
- Size: 398 KB
- Stars: 430
- Watchers: 21
- Forks: 36
- Open Issues: 1
-
Metadata Files:
- Readme: README.md
Awesome Lists containing this project
- ultimate-awesome - Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation - Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation. (Other Lists / Monkey C Lists)
- awesome_long_form_video_understanding - Github repo: Awesome Temporal Action Detection Temporal Action Proposal Generation
- awesome_long_form_video_understanding - Github repo: Awesome Temporal Action Detection Temporal Action Proposal Generation
- awesome-ai-data-github-repos - Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
- awesome-ai-data-github-repos - Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
README
# Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation [![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation)
Temporal Action Detection & Weakly Supervised & Semi Supervised Temporal Action Detection & Temporal Action Proposal Generation & Open-Vocabulary Temporal Action Detection-----
**Contents**- [Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation](#awesome-temporal-action-detection-temporal-action-proposal-generation)
- [**about pretrained model**](#about-pretrained-model)
- [**ActivityNet Challenge**](#activitynet-challenge)
- [**Temporal Action Proposal Generation**](#papers-temporal-action-proposal-generation)
- [2023](#2023) - [2022](#2022) - [2021](#2021) - [2020](#2020) - [2019](#2019) - [2018](#2018) - [2017](#2017) - [before](#before)
- [**Temporal Action Detection**](#papers-temporal-action-detection)
- [2024](#2024) - [2023](#2023-1) - [2022](#2022-1) - [2021](#2021-1) - [2020](#2020-1) - [2019](#2019-1) - [2018](#2018-1) - [2017](#2017-1) - [before](#before-1)
- [**Weakly Supervised Temporal Action Detection**](#papers-weakly-supervised-temporal-action-detection)
- [2024](#2024-1) - [2023](#2023-2) - [2022](#2022-2) - [2021](#2021-2) - [2020](#2020-2) - [2019](#2019-2) - [2018](#2018-2) - [2017](#2017-2)
- [**Online Action Detection**](#papers-online-action-detection)
- [2024](#2024-2) - [2023](#2023-3) - [2022](#2022-3) - [2021](#2021-3)
- [**Semi Supervised Temporal Action Detection**](#semi-supervised)
- [2024](#2024-3) - [2023](#2023-4) - [2022](#2022-4) - [2021](#2021-4) - [2019](#2019-3)
- [**Open-Vocabulary Temporal Action Detection**](#open-vocabulary-temporal-action-detection)
- [2024](#2024-4) - [2023](#2023-5) - [2022](#2022-5)-----
# **about pretrained model**
1. (BSP) [Boundary-sensitive Pre-training for Temporal Localization in Videos](https://arxiv.org/abs/2011.10830) (ICCV 2021)
2. (TSP) [TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks](https://arxiv.org/abs/2011.11479) (ICCVW 2021)
3. (UP-TAL) [Unsupervised Pre-training for Temporal Action Localization Tasks](https://arxiv.org/abs/2203.13609) (CVPR 2022) [code](https://github.com/zhang-can/UP-TAL)
4. [Contrastive Language-Action Pre-training for Temporal Localization](https://arxiv.org/abs/2204.12293) (arxiv 2022)
5. [Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization](https://arxiv.org/abs/2103.15233) (NeurIPS 2021)# **ActivityNet Challenge and talks**
1. (2021) [AcitvityNet 2021](http://activity-net.org/challenges/2021/challenge.html)
2. (2021) [Transformer在时序行为检测中的应用 & 基于自监督学习的半监督时序行为检测](https://www.techbeat.net/talk-info?id=545) (DAMO Academy, Alibaba Group)# **Papers: Temporal Action Proposal Generation**
## 2023
1. (MIFNet) [MIFNet: Multiple Instances Focused Temporal Action Proposal Generation](https://www.sciencedirect.com/science/article/abs/pii/S0925231223000553) (Neurocomputing 2023)
2. (SMBG) [Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator](https://arxiv.org/abs/2303.03166) (arxiv 2023) [code](https://github.com/zhouyang-001/SMBG-for-temporal-action-proposal)
3. (MCBD) [Multi-Level Content-Aware Boundary Detection for Temporal Action Proposal Generation](Tip 2023) [code](https://mic.tongji.edu.cn/ff/32/c9778a327474/page.htm)## 2022
1. (BCNet) [Temporal Action Proposal Generation with Background Constraint](https://arxiv.org/abs/2112.07984) (AAAI 2022)
2. (PRSA-Net) [Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation](https://arxiv.org/abs/2206.10095) (BMVC 2022) [code](https://github.com/handhand123/PRSA-Net)
3. (TDN) [Modeling long-term video semantic distribution for temporal action proposal generation](https://www.sciencedirect.com/science/article/abs/pii/S0925231221017616) (Neurocomputing 2022)
4. (AOE-Net) [AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation](https://arxiv.org/abs/2210.02578) (IJCV 2022)## 2021
1. (BSN++) [BSN++: Complementary Boundary Regressor with Scale-Balanced RelationModeling for Temporal Action Proposal Generation](https://arxiv.org/abs/2009.07641) (AAAI 2021) [Author's Zhihu](https://zhuanlan.zhihu.com/p/344065976)
2. (RTD-Net) [Relaxed Transformer Decoders for Direct Action Proposal Generation](https://arxiv.org/abs/2102.01894) (ICCV 2021) [code](https://github.com/MCG-NJU/RTD-Action) [Zhihu](https://zhuanlan.zhihu.com/p/363133304)
3. (TCANet) [Temporal Context Aggregation Network for Temporal Action Proposal Refinement](https://arxiv.org/abs/2103.13141) (CVPR 2021) [Zhihu](https://zhuanlan.zhihu.com/p/358754602)
4. [Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation](https://arxiv.org/abs/2103.16024) (arxiv 2021)
5. (TAPG) [Temporal Action Proposal Generation with Transformers](https://arxiv.org/abs/2105.12043) (arxiv 2021)
6. (AEN) [Agent-Environment Network for Temporal Action Proposal Generation](https://arxiv.org/abs/2107.08323) (ICASSP 2021)
7. (AEI) [AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation](https://arxiv.org/abs/2110.11474) (BMVC 2021) [code](https://github.com/vhvkhoa/TAPG-AgentEnvInteration)## 2020
1. **VALSE talk by Tianwei Lin** (2020.03.18) [link](https://pan.baidu.com/s/18uPJX3l69qJHaYOdeJ0IQw?errmsg=Auth+Login+Sucess&errno=0&ssnerror=0&) (7y8g)
2. (RapNet) **Accurate Temporal Action Proposal Generation with Relation-Aware Pyramid Network** (AAAI 2020) [pre-paper 2019 ActivityNet task-1 2nd](https://arxiv.org/abs/1908.03448)
3. (DBG) **Fast Learning of Temporal Action Proposal via Dense Boundary Generator** (AAAI 2020) [paper](https://arxiv.org/abs/1911.04127) [code.TensorFlow](https://github.com/TencentYoutuResearch/ActionDetection-DBG)
4. (BC-GNN) **Boundary Content Graph Neural Network for Temporal Action Proposal Generation** (ECCV 2020) [paper](https://arxiv.org/abs/2008.01432v1)
5. [Bottom-Up Temporal Action Localization with Mutual Regularization](https://arxiv.org/abs/2002.07358) (ECCV 2020) [code.TensorFlow](https://github.com/PeisenZhao/Bottom-Up-TAL-with-MR)
6. (TSI) [TSI: Temporal Scale Invariant Network for Action Proposal Generation](https://openaccess.thecvf.com/content/ACCV2020/html/Liu_TSI_Temporal_Scale_Invariant_Network_for_Action_Proposal_Generation_ACCV_2020_paper.html) (ACCV 2020)## 2019
1. (SRG) **SRG: Snippet Relatedness-based Temporal Action Proposal Generator** (IEEE Trans 2019) [paper](https://arxiv.org/abs/1911.11306)
2. (DPP) **Deep Point-wise Prediction for Action Temporal Proposal** (ICONIP 2019) [paper](https://arxiv.org/abs/1909.07725) [code.PyTorch](https://github.com/liluxuan1997/DPP)
3. (BMN) **BMN: Boundary-Matching Network for Temporal Action Proposal Generation** (ICCV 2019) [paper](https://arxiv.org/abs/1907.09702) [code.PaddlePaddle](https://github.com/PaddlePaddle/models/tree/develop/PaddleCV/video) [code.PyTorch_unofficial](https://github.com/JJBOY/BMN-Boundary-Matching-Network)
4. (MGG) **Multi-granularity Generator for Temporal Action Proposal** (CVPR 2019) [paper](https://arxiv.org/abs/1811.11524)
5. **Investigation on Combining 3D Convolution of Image Data and Optical Flow to Generate Temporal Action Proposals** (2019 CVPR Workshop) [paper](https://arxiv.org/abs/1903.04176)
6. (CMSN) **CMSN: Continuous Multi-stage Network and Variable Margin Cosine Loss for Temporal Action Proposal Generation** (arxiv 2019) [paper](https://arxiv.org/abs/1911.06080)
7. **A high performance computing method for accelerating temporal action proposal generation** (arxiv 2019) [paper](https://arxiv.org/abs/1906.06496)
8. **Multi-Granularity Fusion Network for Proposal and Activity Localization: Submission to ActivityNet Challenge 2019 Task 1 and Task 2** (ActvityNet challenge 2019) [paper](https://arxiv.org/abs/1907.12223)
9. [Joint Learning of Local and Global Context for Temporal Action Proposal Generation](https://ieeexplore.ieee.org/abstract/document/8941024) (TCSVT 2019)## 2018
1. (CTAP) **CTAP: Complementary Temporal Action Proposal Generation** (ECCV 2018) [paper](https://arxiv.org/abs/1807.04821) [code.TensorFlow](https://github.com/jiyanggao/CTAP)
2. (BSN) **BSN: Boundary Sensitive Network for Temporal Action Proposal Generation** (ECCV 2018) [paper](https://arxiv.org/abs/1806.02964) [code.TensorFlow](https://github.com/wzmsltw/BSN-boundary-sensitive-network) [code.PyTorch](https://github.com/wzmsltw/BSN-boundary-sensitive-network.pytorch)
3. (SAP) **SAP: Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning** (AAAI 2018) [paper](https://github.com/hjjpku/Action_Detection_DQN/blob/master/camera%20ready.pdf) [code.Torch](https://github.com/hjjpku/Action_Detection_DQN)## 2017
1. (TURN TAP) **TURN TAP: Temporal Unit Regression Network for Temporal Action Proposals** (ICCV 2017) [paper](https://arxiv.org/abs/1703.06189) [code.TensorFlow](https://github.com/jiyanggao/TURN-TAP)
2. (SST) **SST: Single-Stream Temporal Action Proposals** (CVPR 2017) [paper](http://vision.stanford.edu/pdf/buch2017cvpr.pdf) [code.theano](https://github.com/shyamal-b/sst/) [code.TensorFlow](https://github.com/JaywongWang/SST-Tensorflow)
3. **YoTube: Searching Action Proposal via Recurrent and Static Regression Networks** (IEEE Trans 2017) [paper](https://arxiv.org/abs/1706.08218)
4. **A Pursuit of Temporal Accuracy in General Activity Detection** (arxiv 2017) [paper](https://arxiv.org/abs/1703.02716) [code.PyTorch](https://github.com/yjxiong/action-detection)## before
1. (DAPs) **DAPs: Deep Action Proposals for Action Understanding** (ECCV 2016) [paper](https://drive.google.com/file/d/0B0ZXjo_p8lHBcjh1WDlmYVN3R2M/view) [code](https://github.com/escorciav/deep-action-proposals)
----
# **Papers: Temporal Action Detection**## 2024
1. (DenoiseLoc) [Boundary Denoising for Video Activity Localization](https://openreview.net/forum?id=bLpUtGyf9g) (ICLR 2024) [code](https://github.com/frostinassiky/denoiseloc)
2. (LITA) [LITA: Language Instructed Temporal-Localization Assistant](https://arxiv.org/abs/2403.19046) (arXiv 2024) [code](https://github.com/NVlabs/LITA)
3. (PLOT-TAL) (few-shot) [PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization](https://arxiv.org/abs/2403.18915) (Arxiv 2024)
4. [Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions](https://arxiv.org/abs/2403.20254) (CVPR 2024) [code](https://github.com/Alvin-Zeng/temporal-robustness-benchmark)
5. (zero-shot) (T3AL) [Test-Time Zero-Shot Temporal Action Localization](https://arxiv.org/abs/2404.05426) (CVPR 2024) [code](https://github.com/benedettaliberatori/T3AL)
6. (UniMD) [UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection](https://arxiv.org/abs/2404.04933) (ECCV 2024) [code](https://github.com/yingsen1/UniMD)
7. [Adapting Short-Term Transformers for Action Detection in Untrimmed Videos](https://arxiv.org/abs/2312.01897) (CVPR 2024)
8. (AdaTAD) [End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames](https://arxiv.org/abs/2311.17241) (CVPR 2024) [code](https://github.com/sming256/OpenTAD/tree/main/configs/adatad)
9. [Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding](https://arxiv.org/abs/2403.09626) (ECCV 2024) [code](https://github.com/OpenGVLab/video-mamba-suite)
10. (TE-TAD) [TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression](https://arxiv.org/abs/2404.02405) (CVPR 2024) [code](https://github.com/Dotori-HJ/TE-TAD)
11. (ADI-Diff) [Action Detection via an Image Diffusion Process](https://arxiv.org/abs/2404.01051) (CVPR 2024)
12. (DualDETR) [Dual DETRs for Multi-Label Temporal Action Detection](https://arxiv.org/abs/2404.00653) (CVPR 2024) [code](https://github.com/MCG-NJU/DualDETR)
13. [An Effective-Efficient Approach for Dense Multi-Label Action Detection](https://arxiv.org/abs/2406.06187) (arXiv 2024)
14. (Spatio-Temporal) [End-to-End Spatio-Temporal Action Localisation with Video Transformers](https://openaccess.thecvf.com/content/CVPR2024/html/Gritsenko_End-to-End_Spatio-Temporal_Action_Localisation_with_Video_Transformers_CVPR_2024_paper.html) (CVPR 2024)
15. (DyFADet) [DyFADet: Dynamic Feature Aggregation for Temporal Action Detection](https://arxiv.org/abs/2407.03197) (ECCV 2024) [code](https://github.com/yangle15/DyFADet-pytorch)
16. (causaltad) [Harnessing Temporal Causality for Advanced Temporal Action Detection](https://arxiv.org/abs/2407.17792) (arxiv 2024) [code](https://github.com/sming256/OpenTAD/causaltad)
17. (LTP) [Long-Term Pre-training for Temporal Action Detection with Transformers](https://arxiv.org/abs/2408.13152) (arxiv 2024)
18. (Pred-DETR) [Prediction-Feedback DETR for Temporal Action Detection](https://arxiv.org/abs/2408.16729) (arxiv 2024)
19. [Introducing Gating and Context into Temporal Action Detection](https://arxiv.org/abs/2409.04205) (ECCV W 2024)
20. (ContextDet) [ContextDet: Temporal Action Detection with Adaptive Context Aggregation](https://arxiv.org/abs/2410.15279) (arXiv 2024)## 2023
1. (AMNet) [Action-aware Masking Network with Group-based Attention for Temporal Action Localization](https://openaccess.thecvf.com/content/WACV2023/papers/Kang_Action-Aware_Masking_Network_With_Group-Based_Attention_for_Temporal_Action_Localization_WACV_2023_paper.pdf) (WACV 2023)
2. (ContextLoc++) [ContextLoc++: A Unified Context Model for Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10018461) (TPAMI 2023)
3. [Temporal action detection with dynamic weights based on curriculum learning](https://www.sciencedirect.com/science/article/abs/pii/S0925231222015557) (Neurocomputing 2023)
4. (GAP) [Post-Processing Temporal Action Detection](https://arxiv.org/abs/2211.14924) (CVPR 2023) [code](https://github.com/sauradip/GAP)
5. (TriDet) [TriDet: Temporal Action Detection with Relative Boundary Modeling](https://arxiv.org/abs/2303.07347) (CVPR 2023) [code](https://github.com/sssste/TriDet)
- [Temporal Action Localization with Enhanced Instant Discriminability](https://arxiv.org/abs/2309.05590) (extend version)
6. (TemporalMaxer) [TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization](https://arxiv.org/pdf/2303.09055.pdf) (ArXiv 2023) [code](https://github.com/tuantng/temporalmaxer)
7. (DiffTAD) [DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion](https://arxiv.org/abs/2303.14863) (ICCV 2023) [code](https://github.com/sauradip/DiffusionTAD)
8. [Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection](https://arxiv.org/abs/2303.17285) (CVPR 2023)
9. [Boundary-Denoising for Video Activity Localization](https://arxiv.org/abs/2304.02934) (Arxiv 2023)
10. (ASL) [Action Sensitivity Learning for Temporal Action Localization](https://arxiv.org/abs/2305.15701) (ICCV 2023)
11. (MMNet) [A Multi-Modal Transformer Network for Action Detection](https://www.sciencedirect.com/science/article/pii/S0031320323004119) (Pattern Recognition 2023)
12. [Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection](https://www.sciencedirect.com/science/article/pii/S0031320323003825) (Pattern Recognition 2023)
13. (MSST) [A Multitemporal Scale and Spatial–Temporal Transformer Network for Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10120600) (IEEE Transactions on Human-Machine Systems 2023)
14. [Exploring Action Centers for Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10058582) (TMM 2023)
15. (ETAD) [ETAD: Training Action Detection End to End on a Laptop](https://openaccess.thecvf.com/content/CVPR2023W/ECV/html/Liu_ETAD_Training_Action_Detection_End_to_End_on_a_Laptop_CVPRW_2023_paper.html) (CVPRW 2023) [code](https://github.com/sming256/ETAD)
16. (BasicTAD) [BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection](https://arxiv.org/abs/2205.02717) (CVIU 2023) [code](https://github.com/MCG-NJU/BasicTAD)
17. (Re2TAL) [Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2023/papers/Zhao_Re2TAL_Rewiring_Pretrained_Video_Backbones_for_Reversible_Temporal_Action_Localization_CVPR_2023_paper.pdf) (CVPR 2023) [code](https://github.com/coolbay/Re2TAL)
18. (SoLa) [Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks](https://openaccess.thecvf.com/content/CVPR2023/papers/Kang_Soft-Landing_Strategy_for_Alleviating_the_Task_Discrepancy_Problem_in_Temporal_CVPR_2023_paper.pdf) (CVPR 2023)
19. (APN) [Progression-Guided Temporal Action Detection in Videos](https://arxiv.org/abs/2308.09268) (Arxiv 2023) [code](https://github.com/makecent/APN)
20. (Self-DETR) [Self-Feedback DETR for Temporal Action Detection](https://openaccess.thecvf.com/content/ICCV2023/html/Kim_Self-Feedback_DETR_for_Temporal_Action_Detection_ICCV_2023_paper.html) (ICCV 2023)
21. (UnLoc) [UnLoc: A Unified Framework for Video Localization Tasks](https://arxiv.org/abs/2308.11062) (ICCV 2023) [code](https://github.com/google-research/scenic)
22. [Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models](https://arxiv.org/abs/2308.13082) (ICCV 2023 Workshop)
23. (BAPG) [Boundary-Aware Proposal Generation Method for Temporal Action Localization](https://arxiv.org/abs/2309.13810) (Arxiv 2023)
24. (MENet) [Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection](https://openaccess.thecvf.com/content/ICCV2023/html/Zhao_Movement_Enhancement_toward_Multi-Scale_Video_Feature_Representation_for_Temporal_Action_ICCV_2023_paper.html) (ICCV 2023)
25. (MRAV-FF) [Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization](https://arxiv.org/abs/2310.03456) (Arxiv 2023)
26. (BDRC-Net) [Boundary Discretization and Reliable Classification Network for Temporal Action Detection](https://arxiv.org/abs/2310.06403) (Arxiv 2023) [code](https://github.com/zhenyingfang/BDRC-Net)
27. (STAN) [STAN: Spatial-Temporal Awareness Network for Temporal Action Detection](https://dl.acm.org/doi/abs/10.1145/3606038.3616169) (ACM MM W 2023)
28. (RefineTAD) [RefineTAD: Learning Proposal-free Refinement for Temporal Action Detection](https://dl.acm.org/doi/abs/10.1145/3581783.3611872) (ACM MM 2023)
29. [SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization](https://arxiv.org/abs/2312.13377) (arXiv 2023) [code](https://github.com/davidpujol/SADA)## 2022
1. (DCAN) [DCAN: Improving Temporal Action Detection via Dual Context Aggregation](https://arxiv.org/abs/2112.03612) (AAAI 2022)
2. (TVNet) [TVNet: Temporal Voting Network for Action Localization](https://arxiv.org/pdf/2201.00434.pdf) (arxiv 2022) [code](https://github.com/hanielwang/TVNet)
3. (ActionFormer) [ActionFormer: Localizing Moments of Actions with Transformers](https://arxiv.org/abs/2202.07925) (ECCV 2022) [code](https://github.com/happyharrycn/actionformer_release)
4. (SegTAD)[SegTAD: Precise Temporal Action Detection via Semantic Segmentation](https://arxiv.org/abs/2203.01542) (arxiv 2022)
5. (OpenTAL) [OpenTAL: Towards Open Set Temporal Action Localization](https://arxiv.org/pdf/2203.05114.pdf) (CVPR 2022) [code](https://www.rit.edu/actionlab/opental)
6. (TALLFormer) [TALLFormer: Temporal Action Localization with Long-memory Transformer](https://arxiv.org/abs/2204.01680) (CVPR 2022)
7. [An Empirical Study of End-to-End Temporal Action Detection](https://arxiv.org/abs/2204.02932) (CVPR 2022) [code](https://github.com/xlliu7/E2E-TAD)
8. (BREM) [Estimation of Reliable Proposal Quality for Temporal Action Detection](https://arxiv.org/abs/2204.11695) (ACM MM 2022)
9. [Structured Attention Composition for Temporal Action Localization](https://arxiv.org/abs/2205.09956) (Tip 2022) [code](https://github.com/VividLe/Structured-Attention-Composition)
10. (RCL) [RCL: Recurrent Continuous Localization for Temporal Action Detection](https://openaccess.thecvf.com/content/CVPR2022/papers/Wang_RCL_Recurrent_Continuous_Localization_for_Temporal_Action_Detection_CVPR_2022_paper.pdf) (CVPR 2022)
11. (RefactorNet) [Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2022/papers/Xia_Learning_To_Refactor_Action_and_Co-Occurrence_Features_for_Temporal_Action_CVPR_2022_paper.pdf) (CVPR 2022)
12. (MS-TCT) [MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection](https://openaccess.thecvf.com/content/CVPR2022/papers/Dai_MS-TCT_Multi-Scale_Temporal_ConvTransformer_for_Action_Detection_CVPR_2022_paper.pdf) (CVPR 2022) [code](https://github.com/dairui01/MS-TCT)
13. (OATD) [One-stage Action Detection Transformer](https://arxiv.org/abs/2206.10080) (EPICKITCHENS-100 2022 V. 26.35 N. 25.83)
14. [Context-aware Proposal Network for Temporal Action Detection](https://arxiv.org/abs/2206.09082) (CVPR-2022 ActivityNet Challenge winning solution)
15. [Dual relation network for temporal action localization](https://www.sciencedirect.com/science/article/abs/pii/S0031320322002060) (Pattern Recognition 2022)
16. [Learning Disentangled Classification and Localization Representations for Temporal Action Localization](https://www.aaai.org/AAAI22Papers/AAAI-926.ZhuZ.pdf) (AAAI 2022)
17. (DDM) [Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection](https://openaccess.thecvf.com/content/CVPR2022/papers/Tang_Progressive_Attention_on_Multi-Level_Dense_Difference_Maps_for_Generic_Event_CVPR_2022_paper.pdf) (CVPR 2022) [code](https://github.com/MCG-NJU/DDM)
18. [Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach](https://arxiv.org/pdf/2206.15268.pdf) (CVPR 2022 Challenge)
19. (HTNet) [HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers](https://arxiv.org/abs/2207.09662) (arxiv 2022)
20. (STPT) [An Efficient Spatio-Temporal Pyramid Transformer for Action Detection](https://arxiv.org/abs/2207.10448) (ECCV 2022)
21. (TAGS) [Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning](https://arxiv.org/abs/2207.06580) (ECCV 2022) [code](https://github.com/sauradip/TAGS)
22. [Prompting Visual-Language Models for Efficient Video Understanding](https://arxiv.org/abs/2112.04478) (ECCV 2022) [code](https://github.com/ju-chen/Efficient-Prompt)
23. (ReAct) [ReAct: Temporal Action Detection with Relational Queries](https://arxiv.org/abs/2207.07097) (ECCV 2022) [code](https://github.com/sssste/React)
24. (TadTR) [End-to-end Temporal Action Detection with Transformer](https://arxiv.org/abs/2106.10271) (TIP 2022) [code](https://github.com/xlliu7/TadTR)
25. (TAL-MTS) [Temporal Action Localization with Multi-temporal Scales](https://arxiv.org/abs/2208.07493) (arxiv 2022)
26. (AdaPerFormer) [Adaptive Perception Transformer for Temporal Action Localization](https://arxiv.org/abs/2208.11908) (arxiv 2022) [code](https://github.com/SouperO/AdaPerFormer)
27. (PointTAD) [PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points](https://arxiv.org/abs/2210.11035) (NeurIPS 2022) [code](https://github.com/MCG-NJU/PointTAD) (multi action detection, eg: multiTHUMOS, charades)
28. (SoLa) [Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks](https://arxiv.org/abs/2211.06023) (arxiv 2022)
29. (Re2TAL) [Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization](https://arxiv.org/pdf/2211.14053.pdf) (arxiv 2022)
30. (MUPPET) [Multi-Modal Few-Shot Temporal Action Detection](https://arxiv.org/abs/2211.14905) (arxiv 2022) [code](https://github.com/sauradip/MUPPET)
31. [Deep Learning-Based Action Detection in Untrimmed Videos: A Survey](https://ieeexplore.ieee.org/document/9839464) (TPAMI 2022)## 2021
1. (activity graph transformer) [Activity Graph Transformer for Temporal Action Localization](https://arxiv.org/abs/2101.08540) (arxiv 2021) [project](https://www.sfu.ca/~mnawhal/projects/agt.html) [code](https://github.com/Nmegha2601/activitygraph_transformer)
2. [Coarse-Fine Networks for Temporal Activity Detection in Videos](https://arxiv.org/abs/2103.01302) (CVPR 2021) [code](https://github.com/kkahatapitiya/Coarse-Fine-Networks)
3. (MLAD) [Modeling Multi-Label Action Dependencies for Temporal Action Localization](https://arxiv.org/abs/2103.03027) (CVPR 2021)
4. (PcmNet) [PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization](https://arxiv.org/abs/2103.05270) (Tip 2021)
5. (AFSD) [Learning Salient Boundary Feature for Anchor-free Temporal Action Localization](https://arxiv.org/abs/2103.13137) (CVPR 2021) [code](https://github.com/TencentYoutuResearch/ActionDetection-AFSD?utm_source=catalyzex.com)
6. [Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization](https://arxiv.org/abs/2103.15233) (arxiv 2021)
7. [Read and Attend: Temporal Localisation in Sign Language Videos](https://arxiv.org/abs/2103.16481) (CVPR 2021) (Sign Language Videos)
8. [Low Pass Filter for Anti-aliasing in Temporal Action Localization](https://arxiv.org/abs/2104.11403) (arxiv 2021)
9. [FineAction: A Fined Video Dataset for Temporal Action Localization](https://arxiv.org/abs/2105.11107) (One track of DeeperAction Workshop@ICCV2021) [Homepage](https://deeperaction.github.io/fineaction/)
10. [Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations](https://openaccess.thecvf.com/content/CVPR2021/html/Li_Three_Birds_with_One_Stone_Multi-Task_Temporal_Action_Detection_via_CVPR_2021_paper.html) (CVPR 2021)
11. [Proposal Relation Network for Temporal Action Detection](https://arxiv.org/abs/2106.11812) (CVPRW 2021)
12. [Exploring Stronger Feature for Temporal Action Localization](https://arxiv.org/abs/2106.13014) (CVPRW 2021)
13. (SRF-Net) [SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection](https://arxiv.org/abs/2106.15258) (ICASSP 2021)
14. [RGB Stream Is Enough for Temporal Action Detection](https://arxiv.org/abs/2107.04362) (arxiv 2021) [code](https://github.com/Media-Smart/vedatad?utm_source=catalyzex.com)
15. (AVFusion) [Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization](https://arxiv.org/pdf/2106.14118v1.pdf) (arxiv 2021) [Code](https://github.com/skelemoa/tal-hmo)
16. [Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021](https://arxiv.org/abs/2107.12618) (HACS challenge 2021)
17. [Enriching Local and Global Contexts for Temporal Action Localization](https://arxiv.org/abs/2107.12960) (ICCV 2021)
18. (CSA) [Class Semantics-based Attention for Action Detection](https://arxiv.org/abs/2109.02613) (ICCV 2021)
19. (SP-TAD) [Towards High-Quality Temporal Action Detection with Sparse Proposals](https://arxiv.org/abs/2109.08847) (arxiv 2021) [Code](https://github.com/wjn922/SP-TAD)
20. [Few-Shot Temporal Action Localization with Query Adaptive Transformer](https://arxiv.org/abs/2110.10552) (BMVC 2021) [code](https://github.com/sauradip/fewshotQAT) (Few-Shot)
21. [Graph Convolutional Module for Temporal Action Localization in Videos](https://arxiv.org/abs/2112.00302) (TPAMI 2021)
22. [MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection](https://arxiv.org/abs/2112.03902) (arxiv 2021)
23. (VSGN) [Video Self-Stitching Graph Network for Temporal Action Localization](https://arxiv.org/abs/2011.14598) (ICCV 2021) [code](https://github.com/coolbay/VSGN)
24. (MUSES) [Multi-shot Temporal Event Localization: a Benchmark](https://arxiv.org/abs/2012.09434) (CVPR 2021) [project](https://songbai.site/muses/) [code](https://github.com/xlliu7/MUSES) [dataset](https://songbai.site/muses/)## 2020
1. (G-TAD) **G-TAD: Sub-Graph Localization for Temporal Action Detection** (CVPR 2020) [paper](https://arxiv.org/abs/1911.11462) [code.PyTorch](https://github.com/frostinassiky/gtad) [video](https://www.youtube.com/watch?v=BlPxnDcykUo)
2. (AGCN-P-3DCNNs) **Graph Attention based Proposal 3D ConvNets for Action Detection** (AAAI 2020) [paper](https://www.aaai.org/Papers/AAAI/2020GB/AAAI-LiJ.1424.pdf)
3. (PBRNet) **Progressive Boundary Refinement Network for Temporal Action Detection** (AAAI 2020) [paper](https://www.aaai.org/Papers/AAAI/2020GB/AAAI-LiuQ.4870.pdf)
4. (TsaNet) **Scale Matters: Temporal Scale Aggregation Network for Precise Action Localization in Untrimmed Videos** (ICME 2020) [paper](https://arxiv.org/abs/1908.00707)
5. **Constraining Temporal Relationship for Action Localization** (arxiv 2020) [paper](https://arxiv.org/abs/2002.07358)
6. (CBR-Net) **CBR-Net: Cascade Boundary Refinement Network for Action Detection: Submission to ActivityNet Challenge 2020 (Task 1)** (ActivityNet Challenge 2020) [paper](https://arxiv.org/abs/2006.07526v2)
7. [Temporal Action Localization with Variance-Aware Networks](https://arxiv.org/abs/2008.11254) (arxiv 2020)
8. [Boundary Uncertainty in a Single-Stage Temporal Action Localization Network](https://arxiv.org/abs/2008.11170) (arxiv 2020, Tech report)
9. [Revisiting Anchor Mechanisms for Temporal Action Localization](https://arxiv.org/abs/2008.09837) (Tip 2020) [code.PyTorch](https://github.com/VividLe/A2Net?utm_source=catalyzex.com)
10. (C-TCN) [Deep Concept-wise Temporal Convolutional Networks for Action Localization](https://arxiv.org/abs/1908.09442) (ACM MM 2020) [code.PaddlePaddle](https://github.com/PaddlePaddle/models/tree/develop/PaddleCV/video)
11. (MLTPN) [Multi-Level Temporal Pyramid Network for Action Detection](https://arxiv.org/abs/2008.03270) (PRCV 2020)
12. (SALAD) [SALAD: Self-Assessment Learning for Action Detection](https://arxiv.org/abs/2011.06958) (arxiv 2020)## 2019
1. (CMS-RC3D) **Contextual Multi-Scale Region Convolutional 3D Network for Activity Detection** (ICCVBIC 2019) [paper](https://arxiv.org/abs/1801.09184)
2. (TGM) **Temporal Gaussian Mixture Layer for Videos** (ICML 2019) [paper](https://arxiv.org/abs/1803.06316) [code.PyTorch](https://github.com/piergiaj/tgm-icml19)
3. (Decouple-SSAD) **Decoupling Localization and Classification in Single Shot Temporal Action Detection** (ICME 2019) [paper](https://arxiv.org/abs/1904.07442) [code.TensorFlow](https://github.com/HYPJUDY/Decouple-SSAD)
4. **Exploring Feature Representation and Training strategies in Temporal Action Localization** (ICIP 2019) [paper](https://arxiv.org/abs/1905.10608)
5. (PGCN) **Graph Convolutional Networks for Temporal Action Localization** (ICCV 2019) [paper](https://arxiv.org/abs/1909.03252) [code.PyTorch](https://github.com/Alvin-Zeng/PGCN)
6. (S-2D-TAN) **Learning Sparse 2D Temporal Adjacent Networks for Temporal Action Localization** (ICCV 2019) (*winner solution for the HACS Temporal Action Localization Challenge at ICCV 2019*) [paper](https://arxiv.org/abs/1912.03612)
- (2D-TAN) **Learning 2D Temporal Adjacent Networks for Moment Localization with Natural Language** (AAAI 2020) [paper](https://arxiv.org/abs/1912.03590) [code.PyTorch](https://github.com/microsoft/2D-TAN)
7. (LCDC) **Learning Motion in Feature Space: Locally-Consistent Deformable Convolution Networks for Fine-Grained Action Detection** (ICCV 2019) [paper](https://arxiv.org/abs/1811.08815) [slide](https://knmac.github.io/projects/lcdc/LCDC_slides_extended.pdf) [code.TensorFlow](https://github.com/knmac/LCDC_release)
8. (BLP) **BLP -- Boundary Likelihood Pinpointing Networks for Accurate Temporal Action Localization** (ICASSP 2019) [paper](https://arxiv.org/abs/1811.02189)
9. (GTAN) **Gaussian Temporal Awareness Networks for Action Localization** (CVPR 2019) [paper](https://arxiv.org/abs/1909.03877)
10. **Temporal Action Localization using Long Short-Term Dependency** (arxiv 2019) [paper](https://arxiv.org/abs/1911.01060)
11. **Relation Attention for Temporal Action Localization** (IEEE Trans TMM 2019) [paper](https://ieeexplore.ieee.org/document/8933113/versions)
12. (AFO-TAD) **AFO-TAD: Anchor-free One-Stage Detector for Temporal Action Detection** (arxiv 2019) [paper](https://arxiv.org/abs/1910.08250)
13. (DBS) **Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos** (AAAI 2019) [paper](https://www.aaai.org/ojs/index.php/AAAI/article/view/4846)## 2018
1. **Diagnosing Error in Temporal Action Detectors** (ECCV 2018) [paper](http://openaccess.thecvf.com/content_ECCV_2018/papers/Humam_Alwassel_Diagnosing_Error_in_ECCV_2018_paper.pdf)
2. (ETP) **Precise Temporal Action Localization by Evolving Temporal Proposals** (ICMR 2018) [paper](https://arxiv.org/abs/1804.04803)
3. (Action Search) **Action Search: Spotting Actions in Videos and Its Application to Temporal Action Localization** (ECCV 2018) [paper](https://arxiv.org/abs/1706.04269) [code.TensorFlow](https://github.com/HumamAlwassel/action-search)
4. (TAL-Net) **Rethinking the Faster R-CNN Architecture for Temporal Action Localization** (CVPR 2018) [paper](https://arxiv.org/abs/1804.07667)
5. **One-shot Action Localization by Learning Sequence Matching Network** (CVPR 2018) [paper](http://www.porikli.com/mysite/pdfs/porikli%202018%20-%20One-shot%20action%20localization%20by%20learning%20sequence%20matching%20network.pdf)
6. **Temporal Action Detection by Joint Identification-Verification** (arxiv 2018) [paper](https://arxiv.org/abs/1810.08375)
7. (TPC) **Exploring Temporal Preservation Networks for Precise Temporal Action Localization** (AAAI 2018) [paper](https://arxiv.org/abs/1708.03280)
8. (SAP) **A Self-Adaptive Proposal Model for Temporal Action Detection based on Reinforcement Learning** (AAAI 2018) [paper](https://arxiv.org/abs/1706.07251) [code.Torch](https://github.com/hjjpku/Action_Detection_DQN)## 2017
1. (TCN) **Temporal Context Network for Activity Localization in Videos** (ICCV 2017) [paper](https://arxiv.org/abs/1708.02349) [code.caffe](https://github.com/vdavid70619/TCN)
2. (SSN) **Temporal Action Detection with Structured Segment Networks** (ICCV 2017) [paper](https://arxiv.org/abs/1704.06228) [code.PyTorch](https://github.com/yjxiong/action-detection)
3. (R-C3D) **R-C3D: Region Convolutional 3D Network for Temporal Activity Detection** (ICCV 2017) [paper](https://arxiv.org/abs/1703.07814) [code.caffe](https://github.com/VisionLearningGroup/R-C3D) [code.PyTorch](https://github.com/sunnyxiaohu/R-C3D.pytorch)
4. (TCNs) **Temporal Convolutional Networks for Action Segmentation and Detection** (CVPR 2017) [paper](https://arxiv.org/abs/1611.05267) [code.TensorFlow](https://github.com/colincsl/TemporalConvolutionalNetworks)
5. (SMS) **Temporal Action Localization by Structured Maximal Sums** (CVPR 2017) [paper](https://arxiv.org/abs/1704.04671) [code](https://github.com/shallowyuan/struct-max-sum)
6. (SCC) **SCC: Semantic Context Cascade for Efficient Action Detection** (CVPR 2017) [paper](http://openaccess.thecvf.com/content_cvpr_2017/papers/Heilbron_SCC_Semantic_Context_CVPR_2017_paper.pdf)
7. (CDC) **CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos** (CVPR 2017) [paper](https://arxiv.org/abs/1703.01515) [code](https://bitbucket.org/columbiadvmm/cdc/src/master/) [project](http://www.ee.columbia.edu/ln/dvmm/researchProjects/cdc/cdc.html)
8. (SS-TAD) **End-to-End, Single-Stream Temporal ActionDetection in Untrimmed Videos** (BMVC 2017) [paper](http://vision.stanford.edu/pdf/buch2017bmvc.pdf) [code.PyTorch](https://github.com/shyamal-b/ss-tad/)
9. (CBR) **Cascaded Boundary Regression for Temporal Action Detection** (BMVC 2017) [paper](https://arxiv.org/abs/1705.01180) [code.TensorFlow](https://github.com/jiyanggao/CBR)
10. (SSAD) **Single Shot Temporal Action Detection** (ACM MM 2017) [paper](https://arxiv.org/abs/1710.06236)## before
1. (PSDF) **Temporal Action Localization with Pyramid of Score Distribution Features** (CVPR 2016) [paper](https://www.zpascal.net/cvpr2016/Yuan_Temporal_Action_Localization_CVPR_2016_paper.pdf)
2. **Temporal Action Detection using a Statistical Language Model** (CVPR 2016) [paper](https://www.zpascal.net/cvpr2016/Richard_Temporal_Action_Detection_CVPR_2016_paper.pdf) [code](https://github.com/alexanderrichard/squirrel)
3. (S-CNN) **Temporal Action Localization in Untrimmed Videos via Multi-stage CNNs** (CVPR 2016) [paper](https://arxiv.org/abs/1601.02129) [code](https://github.com/zhengshou/scnn/) [project](http://www.ee.columbia.edu/ln/dvmm/researchProjects/cdc/scnn.html)
4. **End-to-end Learning of Action Detection from Frame Glimpses in Videos** (CVPR 2016) [paper](https://arxiv.org/abs/1511.06984) [code](https://github.com/syyeung/frameglimpses)----
# **Papers: Weakly Supervised Temporal Action Detection**## 2024
1. [Weakly-Supervised Temporal Action Localization by Inferring Snippet-Feature Affinity](https://arxiv.org/abs/2303.12332) (AAAI 2024)
2. (HR-Pro) [HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation](https://arxiv.org/abs/2308.12608) (AAAI 2024) [code](https://github.com/pipixin321/HR-Pro)
3. [STAT: Towards Generalizable Temporal Action Localization](https://arxiv.org/abs/2404.13311) (Arxiv 2024)
4. (TSPNet) [Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2024/html/Xia_Realigning_Confidence_with_Temporal_Saliency_Information_for_Point-Level_Weakly-Supervised_Temporal_CVPR_2024_paper.html) (CVPR 2024) [code](https://github.com/zyxia1009/CVPR2024-TSPNet)
5. (M2PT) [Weakly-Supervised Temporal Action Localization with Multi-Modal Plateau Transformers](https://openaccess.thecvf.com/content/CVPR2024W/L3D-IVU/html/Hu_Weakly-Supervised_Temporal_Action_Localization_with_Multi-Modal_Plateau_Transformers_CVPRW_2024_paper.html) (CVPR Workshop 2024)
6. (EPNet) [Ensemble Prototype Network For Weakly Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10479157) (TNNLS 2024)
7. (FuSTAL) [Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization](https://arxiv.org/abs/2407.08971) (arXiv 2024) [code](https://github.com/fqhank/FuSTAL)
8. (PVLR) [Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2408.05955) (ACM MM 2024) [code](https://github.com/sejong-rcv/PVLR)
9. (zero-shot) [Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization](https://arxiv.org/abs/2408.13777) (ICPR 2024) [code](https://github.com/Run542968/GAP)
10. (SMBD) [Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization](https://eccv.ecva.net/virtual/2024/poster/390) (ECCV 2024)
11. [Zero-shot Action Localization via the Confidence of Large Vision-Language Models](https://arxiv.org/abs/2410.14340) (arXiv 2024)## 2023
1. (ASCN) [A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10007033) (TMM 2023)
2. (TFE-DCN) [Temporal Feature Enhancement Dilated Convolution Network for Weakly-Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/WACV2023/html/Zhou_Temporal_Feature_Enhancement_Dilated_Convolution_Network_for_Weakly-Supervised_Temporal_Action_WACV_2023_paper.html) (WACV 2023)
3. (JCDNet) [JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2303.17294) (Arxiv 2023)
4. (P-MIL) [Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2023/html/Ren_Proposal-Based_Multiple_Instance_Learning_for_Weakly-Supervised_Temporal_Action_Localization_CVPR_2023_paper.html) (CVPR 2023) [code](https://github.com/RenHuan1999/CVPR2023_P-MIL)
5. [Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms](https://openaccess.thecvf.com/content/CVPR2023/html/Wang_Two-Stream_Networks_for_Weakly-Supervised_Temporal_Action_Localization_With_Semantic-Aware_Mechanisms_CVPR_2023_paper.html) (CVPR 2023)
6. [Boosting Weakly-Supervised Temporal Action Localization with Text Information](https://arxiv.org/abs/2305.00607) (CVPR 2023) [code](https://github.com/lgzlIlIlI/Boosting-WTAL)
7. (PivoTAL) [PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2023/html/Rizve_PivoTAL_Prior-Driven_Supervision_for_Weakly-Supervised_Temporal_Action_Localization_CVPR_2023_paper.html) (CVPR 2023)
8. [Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels](https://openaccess.thecvf.com/content/CVPR2023/papers/Zhou_Improving_Weakly_Supervised_Temporal_Action_Localization_by_Bridging_Train-Test_Gap_CVPR_2023_paper.pdf) (CVPR 2023) [code](https://github.com/zhou745/GauFuse_WSTAL)
9. (MTP) [Multiple Temporal Pooling Mechanisms for Weakly Supervised Temporal Action Localization](https://dl.acm.org/doi/10.1145/3567828) (TOMM 2023)
10. (VQK-Net) [Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2305.04186)
11. (DFE) [Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10096383) (ICASSP 2023)
12. (FBA-Net) [Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10115434) (TCSVT 2023)
13. (Bi-SCC) [Weakly Supervised Temporal Action Localization With Bidirectional Semantic Consistency Constraint](https://ieeexplore.ieee.org/abstract/document/10115234) (TNNLS 2023)
14. (F3-Net) [Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10091234) (TMM 2023) [code](https://moniruzzamanmd.github.io/F3-Net/)
15. (LPR) [Learning Proposal-aware Re-ranking for Weakly-supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10144792) (TCSVT 2023)
16. (STCL-Net) [Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/10155179) (TPAMI 2023)
17. [Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2212.09335) (CVPR 2023)
18. [Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling](https://arxiv.org/abs/2308.09946) (ICCV 2023)
19. [Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2308.12609) (TCSVT 2023)
20. (SPL-Loc) [Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization](https://arxiv.org/abs/2309.09060) (arXiv 2023)
21. (DDG-Net) [DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization](https://openaccess.thecvf.com/content/ICCV2023/html/Tang_DDG-Net_Discriminability-Driven_Graph_Network_for_Weakly-supervised_Temporal_Action_Localization_ICCV_2023_paper.html) (ICCV 2023) [code](https://github.com/XiaojunTang22/ICCV2023-DDGNet)
22. [Proposal-based Temporal Action Localization with Point-level Supervision](https://arxiv.org/abs/2310.05511) (BMVC 2023)
23. (LPR) [LPR: learning point-level temporal action localization through re-training](https://link.springer.com/article/10.1007/s00530-023-01128-4) (MMSJ 2023)
24. (POTLoc) [POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization](https://arxiv.org/abs/2310.13585) (arXiv 2023)
25. (ADM-Loc) [ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization](https://arxiv.org/abs/2311.15916) (arXiv 2023)
26. [Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach](https://openaccess.thecvf.com/content/ICCV2023/html/Liu_Revisiting_Foreground_and_Background_Separation_in_Weakly-supervised_Temporal_Action_Localization_ICCV_2023_paper.html) (ICCV 2023) [code](https://github.com/Qinying-Liu/CASE)
27. [Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization](https://arxiv.org/abs/2309.09060) (arXiv 2023)## 2022
1. (ACGNet) [ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization](https://arxiv.org/pdf/2112.10977.pdf) (AAAI 2022)
2. (RSKP) [Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation](https://arxiv.org/pdf/2203.02925.pdf) (CVPR 2022) [code](https://github.com/LeonHLJ/RSKP)
3. (ASM-Loc) [ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2203.15187) (CVPR 2022) [code](https://github.com/boheumd/ASM-Loc)
4. (FTCL) [Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization](https://arxiv.org/abs/2203.16800) (CVPR 2022) [code](https://github.com/MengyuanChen21/CVPR2022-FTCL)
5. (C3BN) [Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization](https://arxiv.org/abs/2205.00400) (arxiv 2022)
6. (DCC) [Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2022/papers/Li_Exploring_Denoised_Cross-Video_Contrast_for_Weakly-Supervised_Temporal_Action_Localization_CVPR_2022_paper.pdf) (CVPR 2022)
7. (HAAN) [Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions](https://arxiv.org/abs/2207.11805) (ECCV 2022) [code](https://github.com/lizhi1104/HAAN)
8. (STALE) (**Zero-Shot**) [Zero-Shot Temporal Action Detection via Vision-Language Prompting](https://arxiv.org/abs/2207.08184) (ECCV 2022) [code](https://github.com/sauradip/stale)
9. (SMEN) [Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2211.11324) (TCSVT 2022)
10. [Dilation-Erosion for Single-Frame Supervised Temporal Action Localization](https://arxiv.org/abs/2212.06348) (arxiv 2022)
11. (AMS) [Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2104.02357) (TMM 2022)
12. (DELU) [Dual-Evidential Learning for Weakly-supervised Temporal Action Localization](https://link.springer.com/chapter/10.1007/978-3-031-19772-7_12) (ECCV 2022) [code](https://github.com/MengyuanChen21/ECCV2022-DELU)## 2021
1. (HAM-Net) [A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2101.00545). (AAAI 2021)
2. [Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization](https://openreview.net/forum?id=hWr3e3r-oH5) (ICLR 2021)
3. [Weakly-supervised Temporal Action Localization by Uncertainty Modeling](https://arxiv.org/abs/2006.07006) (AAAI 2021) [code](https://github.com/Pilhyeon/WTAL-Uncertainty-Modeling)
4. (TS-PCA) [The Blessings of Unlabeled Background in Untrimmed Videos](https://arxiv.org/abs/2103.13183) (CVPR 2021) [code](https://github.com/aliyun/The-Blessings-of-Unlabeled-Background-in-Untrimmed-Videos)
5. (ACSNet) [ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2103.15088) (AAAI 2021)
6. (CoLA) [CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning](https://arxiv.org/abs/2103.16392) (CVPR 2021)
7. [Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context](https://arxiv.org/abs/2103.16155) (AAAI 2021)
8. [ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2104.02967) (arxiv 2021, submitted to Tip) [code](https://github.com/ispc-lab/ACM-Net)
9. (AUMN) [Action Unit Memory Network for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2104.14135) (CVPR 2021)
10. (ASL) [Weakly Supervised Action Selection Learning in Video](https://arxiv.org/abs/2105.02439) (CVPR 2021)
11. (ActShufNet) [Action Shuffling for Weakly Supervised Temporal Localization](https://arxiv.org/abs/2105.04208) (arxiv 2021)
12. [Few-Shot Action Localization without Knowing Boundaries](https://arxiv.org/abs/2106.04150) (arxiv 2021)
13. [Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection](https://openaccess.thecvf.com/content/CVPR2021/html/Yang_Uncertainty_Guided_Collaborative_Training_for_Weakly_Supervised_Temporal_Action_Detection_CVPR_2021_paper.html) (CVPR 2021)
14. [Two-Stream Consensus Network: Submission to HACS Challenge 2021Weakly-Supervised Learning Track](https://arxiv.org/abs/2106.10829) (CVPRW 2021)
15. [Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling](https://arxiv.org/abs/2106.11811) (CVPRW 2021)
16. [Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/2107.12589) (ACM MM 2021) [code](https://github.com/harlanhong/MM2021-CO2-Net)
17. [Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization](https://arxiv.org/abs/2108.05029) (ICCV 2021) [code](https://github.com/Pilhyeon/Learning-Action-Completeness-from-Points)
18. [Deep Motion Prior for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2108.05607) (submit to Tip 2021) [project](https://sites.google.com/view/mengcao/publication/dmp-net?authuser=0)
19. [Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/ICCV2021/papers/Huang_Foreground-Action_Consistency_Network_for_Weakly_Supervised_Temporal_Action_Localization_ICCV_2021_paper.pdf) (ICCV 2021)
20. (BackTAL) [Background-Click Supervision for Temporal Action Localization](https://arxiv.org/abs/2111.12449) (TPAMI 2021) [code](https://github.com/VividLe/BackTAL)
21. (ACN) [Action Coherence Network for Weakly-Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/9404867) (TMM 2021)## 2020
1. (WSGN) **Weakly Supervised Gaussian Networks for Action Detection** (WACV 2020) [paper](https://arxiv.org/abs/1904.07774)
2. **Weakly Supervised Temporal Action Localization Using Deep Metric Learning** (WACV 2020) [paper](https://arxiv.org/abs/2001.07793)
3. **Action Graphs: Weakly-supervised Action Localization with Graph Convolution Networks** (WACV 2020) [paper](https://arxiv.org/abs/2002.01449)
4. (DGAM) **Weakly-Supervised Action Localization by Generative Attention Modeling** (CVPR 2020) [paper](https://arxiv.org/abs/2003.12424) [code.PyTorch](https://github.com/bfshi/DGAM-Weakly-Supervised-Action-Localization)
5. (EM-MIL) **Weakly-Supervised Action Localization with Expectation-Maximization Multi-Instance Learning** (ECCV 2020) [paper](https://arxiv.org/abs/2004.00163)
6. **Relational Prototypical Network for Weakly Supervised Temporal ActionLocalization** (AAAI 2020) [paper](https://aaai.org/Papers/AAAI/2020GB/AAAI-HuangL.1235.pdf)
7. (BaS-Net) **Background Suppression Networkfor Weakly-supervised Temporal Action Localization** (AAAI 2020) [paper](https://arxiv.org/abs/1911.09963) [code.PyTorch](https://github.com/Pilhyeon/BaSNet-pytorch)
8. **Background Modeling via Uncertainty Estimation for Weakly-supervised Action Localization** (arxiv 2020) [paper](https://arxiv.org/abs/2006.07006) [code.PyTorch](https://github.com/Pilhyeon/Background-Modeling-via-Uncertainty-Estimation)
9. (A2CL-PT) **Adversarial Background-Aware Loss for Weakly-supervised Temporal Activity Localization** (ECCV 2020) [paper](https://arxiv.org/abs/2007.06643) [code.PyTorch](https://github.com/MichiganCOG/A2CL-PT)
10. **Weakly Supervised Temporal Action Localization with Segment-Level Labels** (arxiv 2020)
11. (ECM) **Equivalent Classification Mapping for Weakly Supervised Temporal Action Localization** (arxiv 2020 -> TPAMI 2022) [paper](https://arxiv.org/abs/2008.07728v1)
12. [Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization](https://arxiv.org/abs/2010.11594v1) (ECCV 2020 spotlight)
13. [Learning Temporal Co-Attention Models for Unsupervised Video Action Localization](https://openaccess.thecvf.com/content_CVPR_2020/html/Gong_Learning_Temporal_Co-Attention_Models_for_Unsupervised_Video_Action_Localization_CVPR_2020_paper.html) (CVPR 2020)
14. [Action Completeness Modeling with Background Aware Networks for Weakly-Supervised Temporal Action Localization](https://dl.acm.org/doi/abs/10.1145/3394171.3413687) (ACM MM 2020)
15. (D2-Net) [D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddingsand Denoised Activations](https://arxiv.org/abs/2012.06440) (arxiv 2020) (THUMOS'14 [email protected]: 35.9)
16. (SF-Net) [SF-Net: Single-Frame Supervision for Temporal Action Localization](https://arxiv.org/abs/2003.06845) (ECCV 2020) [code.PyTorch](https://github.com/Flowerfan/SF-Net)
17. [Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses](https://arxiv.org/abs/2012.08236) (arxiv 2020)
18. [Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection](https://ieeexplore.ieee.org/abstract/document/9105103/keywords#keywords) (TMM 2020)
19. [ActionBytes: Learning From Trimmed Videos to Localize Actions](https://openaccess.thecvf.com/content_CVPR_2020/html/Jain_ActionBytes_Learning_From_Trimmed_Videos_to_Localize_Actions_CVPR_2020_paper.html) (CVPR 2020)## 2019
1. (AdapNet) **AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization** (IEEE Transactions on Neural Networks and Learning Systems) [paper](https://arxiv.org/abs/1911.11961)
2. **Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization** (IEEE Transactions on Image Processing) [paper](https://tanmingkui.github.io/files/publications/Breaking.pdf)
3. **Weakly-Supervised Temporal Localization via Occurrence Count Learning** (ICML 2019) [paper](https://arxiv.org/abs/1905.07293) [code.TensorFlow](https://github.com/SchroeterJulien/ICML-2019-Weakly-Supervised-Temporal-Localization-via-Occurrence-Count-Learning)
4. (MAAN) **Marginalized Average Attentional Network for Weakly-Supervised Learning** (ICLR 2019) [paper](https://arxiv.org/abs/1905.08586) [code.PyTorch](https://github.com/yyuanad/MAAN)
5. **Weakly-supervised Action Localization with Background Modeling** (ICCV 2019) [paper](https://arxiv.org/abs/1908.06552)
6. (TSM) **Temporal Structure Mining for Weakly Supervised Action Detection** (ICCV 2019) [paper](http://openaccess.thecvf.com/content_ICCV_2019/papers/Yu_Temporal_Structure_Mining_for_Weakly_Supervised_Action_Detection_ICCV_2019_paper.pdf)
7. (CleanNet) **Weakly Supervised Temporal Action Localization through Contrast basedEvaluation Networks** (ICCV 2019) [paper](http://openaccess.thecvf.com/content_ICCV_2019/html/Liu_Weakly_Supervised_Temporal_Action_Localization_Through_Contrast_Based_Evaluation_Networks_ICCV_2019_paper.html)
8. (3C-Net) **3C-Net: Category Count and Center Loss for Weakly-Supervised Action Localization** (ICCV 2019) [paper](https://arxiv.org/abs/1908.08216) [code.PyTorch](https://github.com/naraysa/3c-net)
9. (CMCS) **Completeness Modeling and Context Separation for Weakly SupervisedTemporal Action Localization** (CVPR 2019) [paper](http://openaccess.thecvf.com/content_CVPR_2019/papers/Liu_Completeness_Modeling_and_Context_Separation_for_Weakly_Supervised_Temporal_Action_CVPR_2019_paper.pdf) [code.PyTorch](https://github.com/Finspire13/CMCS-Temporal-Action-Localization)
10. (RefineLoc) **RefineLoc: Iterative Refinement for Weakly-Supervised Action Localization** (arxiv 2019) [paper](https://arxiv.org/abs/1904.00227) [homepage](http://humamalwassel.com/publication/refineloc/)
11. (ASSG) **Adversarial Seeded Sequence Growing for Weakly-Supervised Temporal Action Localization** (ACM MM 2019) [paper](https://arxiv.org/abs/1908.02422)
12. (TSRNet) **Learning Transferable Self-attentive Representations for Action Recognition in Untrimmed Videos with Weak Supervision** (AAAI 2019) [paper](https://arxiv.org/abs/1902.07370)
13. (STAR) **Segregated Temporal Assembly Recurrent Networks for Weakly Supervised Multiple Action Detection** (AAAI 2019) [paper](https://arxiv.org/abs/1811.07460)## 2018
1. [Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network](https://link.springer.com/chapter/10.1007/978-3-030-04212-7_37) (ICONIP 2018)
2. (W-TALC) [W-TALC: Weakly-supervised Temporal Activity Localization and Classification](https://arxiv.org/abs/1807.10418) (ECCV 2018) [code.PyTorch](https://github.com/sujoyp/wtalc-pytorch?utm_source=catalyzex.com)
3. (AutoLoc) [AutoLoc: Weakly-supervised Temporal Action Localization](https://arxiv.org/abs/1807.08333) (ECCV 2018) [code](https://github.com/zhengshou/AutoLoc?utm_source=catalyzex.com)
4. (STPN) [Weakly Supervised Action Localization by Sparse Temporal Pooling Network](https://arxiv.org/abs/1712.05080) (CVPR 2018) [code](https://github.com/demianzhang/weakly-action-localization?utm_source=catalyzex.com)
5. [Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector](https://arxiv.org/abs/1807.02929) (ACM MM 2018)
6. (CPMN) [Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization](https://arxiv.org/abs/1810.11794) (accv 2018)## 2017
1. (Hide-and-Seek) [Hide-and-Seek: Forcing a Network to be Meticulous for
Weakly-supervised Object and Action Localization](https://arxiv.org/abs/1704.04232) (ICCV 2017)
2. (UntrimmedNets) [UntrimmedNets for Weakly Supervised Action Recognition and Detection](https://arxiv.org/abs/1703.03329) (CVPR 2017) [code](https://github.com/wanglimin/UntrimmedNet)----
# **Papers: Online Action Detection**## 2024
1. (JOADAA) [JOADAA: joint online action detection and action anticipation](https://arxiv.org/abs/2309.06130) (WACV 2024)
2. [Object Aware Egocentric Online Action Detection](https://arxiv.org/abs/2406.01079) (CVPRW 2024)
3. [ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos](https://arxiv.org/abs/2407.12987) (ECCV 2024)
4. (MATR) [Online Temporal Action Localization with Memory-Augmented Transformer](https://arxiv.org/abs/2408.02957) (ECCV 2024) [code](https://skhcjh231.github.io/MATR_project/)
5. (HAT) [HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization](https://arxiv.org/abs/2408.06437) (ECCV 2024) [code](https://github.com/sakibreza/ECCV24-HAT/)## 2023
1. (recognation) (GliTr) [GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction](https://arxiv.org/abs/2210.13605) (WACV 2023)
2. (E2E-LOAD) [E2E-LOAD: End-to-End Long-form Online Action Detection](https://openaccess.thecvf.com/content/ICCV2023/html/Cao_E2E-LOAD_End-to-End_Long-form_Online_Action_Detection_ICCV_2023_paper.html) (ICCV 2023) [code](https://github.com/sqiangcao99/E2E-LOAD)
3. (MiniROAD) [MiniROAD: Minimal RNN Framework for Online Action Detection](https://openaccess.thecvf.com/content/ICCV2023/html/An_MiniROAD_Minimal_RNN_Framework_for_Online_Action_Detection_ICCV_2023_paper.html) (ICCV 2023) [code](https://github.com/jbistanbul/MiniROAD)
4. (MAT) [Memory-and-Anticipation Transformer for Online Action Understanding](https://openaccess.thecvf.com/content/ICCV2023/html/Wang_Memory-and-Anticipation_Transformer_for_Online_Action_Understanding_ICCV_2023_paper.html) (ICCV 2023) [code](https://github.com/Echo0125/Memory-and-Anticipation-Transformer)
5. [Online Action Detection with Learning Future Representations by Contrastive Learning](https://ieeexplore.ieee.org/abstract/document/10220027) (ICME 2023)
6. (HCM) [HCM: Online Action Detection With Hard Video Clip Mining](https://ieeexplore.ieee.org/abstract/document/10246422) (TMM 2023)
7. (DFAformer) [DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos](https://link.springer.com/chapter/10.1007/978-981-99-8537-1_11) (PRCV 2023)## 2022
1. (Colar) [Colar: Effective and Efficient Online Action Detection by Consulting Exemplars](https://arxiv.org/abs/2203.01057) (CVPR 2022) [code](https://github.com/VividLe/Online-Action-Detection)
2. (GateHUB) [GateHUB: Gated History Unit with Background Suppression for Online Action Detection](https://arxiv.org/abs/2206.04668) (CVPR 2022)
3. [A Circular Window-based Cascade Transformer for Online Action Detection](https://arxiv.org/pdf/2208.14209.pdf) (TPAMI 2022)
4. (TeSTra) [Real-time Online Video Detection with Temporal Smoothing Transformers](https://arxiv.org/abs/2209.09236) (ECCV 2022) [code](https://github.com/zhaoyue-zephyrus/TeSTra)
5. (SimOn) [SimOn: A Simple Framework for Online Temporal Action Localization](https://arxiv.org/pdf/2211.04905.pdf) (arxiv 2022) [code](https://github.com/TuanTNG/SimOn)
6. (survey) [Online human action detection and anticipation in videos: A survey](https://www.sciencedirect.com/science/article/abs/pii/S0925231222003617?via%3Dihub)
7. [Uncertainty-Based Spatial-Temporal Attention for Online Action Detection](https://www.ecva.net/papers/eccv_2022/papers_ECCV/papers/136640068.pdf) (ECCV 2022)
8. (PPKD) [Privileged Knowledge Distillation for Online Action Detection]() (PRhttps://arxiv.org/abs/2011.09158 2022)## 2021
1. (WOAD) [WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos](https://openaccess.thecvf.com/content/CVPR2021/html/Gao_WOAD_Weakly_Supervised_Online_Action_Detection_in_Untrimmed_Videos_CVPR_2021_paper.html) (CVPR 2021)
2. (OadTR) [OadTR: Online Action Detection with Transformers](https://arxiv.org/abs/2106.11149) (ICCV 2021) [code](https://github.com/wangxiang1230/OadTR)
3. (LSTR) [Long Short-Term Transformer for Online Action Detection](https://arxiv.org/abs/2107.03377) (NeurIPS 2021) [code](https://github.com/amazon-science/long-short-term-transformer)
4. [pre awesome](https://github.com/wangxiang1230/Awesome-Online-Action-Detection)----
# **Semi-Supervised**## 2024
1. (APL) [Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization](https://arxiv.org/abs/2407.07673) (ECCV 2024)## 2023
1. (NPL) [Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/ICCV2023/html/Xia_Learning_from_Noisy_Pseudo_Labels_for_Semi-Supervised_Temporal_Action_Localization_ICCV_2023_paper.html) (ICCV 2023) [code](https://github.com/kunnxia/NPL)## 2022
1. (AL-STAL) [Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization](https://arxiv.org/abs/2208.14856) (Displays 2022)
2. (SPOT) [Semi-Supervised Temporal Action Detection with Proposal-Free Masking](https://arxiv.org/abs/2207.07059) (ECCV 2022) [code](https://github.com/sauradip/SPOT)## 2021
1. (SSTAP) [Self-Supervised Learning for Semi-Supervised Temporal Action Proposal](https://arxiv.org/abs/2104.03214) (CVPR 2021) [code](https://github.com/wangxiang1230/SSTAP)
2. [Temporal Action Detection with Multi-level Supervision](https://openaccess.thecvf.com/content/ICCV2021/papers/Shi_Temporal_Action_Detection_With_Multi-Level_Supervision_ICCV_2021_paper.pdf) (ICCV 2021) [code](https://github.com/bfshi/SSAD_OSAD)
3. (KFC) [KFC: An Efficient Framework for Semi-Supervised Temporal Action Localization](https://ieeexplore.ieee.org/abstract/document/9500051) (Tip 2021)## 2019
1. [Learning Temporal Action Proposals With Fewer Labels](https://arxiv.org/abs/1910.01286) (ICCV 2019)
2. (TTC-Loc) [Towards Train-Test Consistency for Semi-supervised Temporal Action Localization](https://arxiv.org/abs/1910.11285v3) (arxiv 2019)----
# **Open-Vocabulary Temporal Action Detection**## 2024
1. [One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features](https://arxiv.org/abs/2404.19542) (FG 2024)
2. [Open-Vocabulary Temporal Action Localization using Multimodal Guidance](https://arxiv.org/abs/2406.15556) (arXiv 2024)
3. (OV-TAL) [Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization](https://arxiv.org/abs/2407.07024) (arXiv 2024) [code](https://github.com/HYUNJS/STOV-TAL)
4. [Open-vocabulary Temporal Action Localization using VLMs](https://arxiv.org/abs/2408.17422) (arXiv 2024)
5. (OV-OAD) [Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection?](https://nips.cc/virtual/2024/poster/95303) (NeurIPS 2024)## 2023
1. (CELL) [Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization](https://openaccess.thecvf.com/content/CVPR2023/html/Chen_Cascade_Evidential_Learning_for_Open-World_Weakly-Supervised_Temporal_Action_Localization_CVPR_2023_paper.html) (CVPR 2023)
2. (OW-TAL) [OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization](https://www.sciencedirect.com/science/article/pii/S0031320322005076) (PR 2023)## 2022
1. [Open-Vocabulary Temporal Action Detection with Off-the-Shelf ImageText Features](https://arxiv.org/pdf/2212.10596.pdf) (arxiv 2022)
2. (OpenTAL) [OpenTAL: Towards Open Set Temporal Action Localization](https://arxiv.org/pdf/2203.05114.pdf) (CVPR 2022) [code](https://www.rit.edu/actionlab/opental)