Ecosyste.ms: Awesome
An open API service indexing awesome lists of open source software.
Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Temporal Action Detection & Weakly Supervised Temporal Action Detection & Temporal Action Proposal Generation
https://github.com/zhenyingfang/Awesome-Temporal-Action-Detection-Temporal-Action-Proposal-Generation
Last synced: 5 days ago
JSON representation
-
2023
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
- LPR: learning point-level temporal action localization through re-training
- DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos
- OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization
- Boundary-Denoising for Video Activity Localization
- Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection
- Multi-Resolution Audio-Visual Feature Fusion for Temporal Action Localization
- Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach - Liu/CASE)
- GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
- E2E-LOAD: End-to-End Long-form Online Action Detection - LOAD)
- MiniROAD: Minimal RNN Framework for Online Action Detection
- Memory-and-Anticipation Transformer for Online Action Understanding - and-Anticipation-Transformer)
- Online Action Detection with Learning Future Representations by Contrastive Learning
- HCM: Online Action Detection With Hard Video Clip Mining
- DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos
- Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization
- ContextLoc++: A Unified Context Model for Temporal Action Localization
- Temporal action detection with dynamic weights based on curriculum learning
- Post-Processing Temporal Action Detection
- TriDet: Temporal Action Detection with Relative Boundary Modeling
- ContextLoc++: A Unified Context Model for Temporal Action Localization
- Temporal action detection with dynamic weights based on curriculum learning
- Post-Processing Temporal Action Detection
- TriDet: Temporal Action Detection with Relative Boundary Modeling
- Temporal Action Localization with Enhanced Instant Discriminability
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
- DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
- Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
- BasicTAD: an Astounding RGB-Only Baseline for Temporal Action Detection - NJU/BasicTAD)
- Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
- Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks
- Progression-Guided Temporal Action Detection in Videos
- Action Sensitivity Learning for Temporal Action Localization
- A Multi-Modal Transformer Network for Action Detection
- Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection
- A Multitemporal Scale and Spatial–Temporal Transformer Network for Temporal Action Localization - Machine Systems 2023)
- Exploring Action Centers for Temporal Action Localization
- ETAD: Training Action Detection End to End on a Laptop
- Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
- Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator - 001/SMBG-for-temporal-action-proposal)
- Action-aware Masking Network with Group-based Attention for Temporal Action Localization
- MIFNet: Multiple Instances Focused Temporal Action Proposal Generation
- Faster Learning of Temporal Action Proposal via Sparse Multilevel Boundary Generator - 001/SMBG-for-temporal-action-proposal)
- Action-aware Masking Network with Group-based Attention for Temporal Action Localization
- SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
- A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization
- Temporal Feature Enhancement Dilated Convolution Network for Weakly-Supervised Temporal Action Localization
- JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization
- Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization - MIL)
- Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms
- Boosting Weakly-Supervised Temporal Action Localization with Text Information - WTAL)
- PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization
- Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels
- Memory-and-Anticipation Transformer for Online Action Understanding - and-Anticipation-Transformer)
- Online Action Detection with Learning Future Representations by Contrastive Learning
- HCM: Online Action Detection With Hard Video Clip Mining
- DFAformer: A Dual Filtering Auxiliary Transformer for Efficient Online Action Detection in Streaming Videos
- Learning from Noisy Pseudo Labels for Semi-Supervised Temporal Action Localization
- Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization
- OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization
- Temporal Action Localization with Enhanced Instant Discriminability
- TemporalMaxer: Maximize Temporal Context with only Max Pooling for Temporal Action Localization
- DiffTAD: Temporal Action Detection with Proposal Denoising Diffusion
- Decomposed Cross-modal Distillation for RGB-based Temporal Action Detection
- Boundary-Denoising for Video Activity Localization
- Action Sensitivity Learning for Temporal Action Localization
- A Multi-Modal Transformer Network for Action Detection
- Truncated attention-aware proposal networks with multi-scale dilation for temporal action detection
- A Multitemporal Scale and Spatial–Temporal Transformer Network for Temporal Action Localization - Machine Systems 2023)
- Exploring Action Centers for Temporal Action Localization
- ETAD: Training Action Detection End to End on a Laptop
- Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
- Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks
- Progression-Guided Temporal Action Detection in Videos
- Self-Feedback DETR for Temporal Action Detection
- UnLoc: A Unified Framework for Video Localization Tasks - research/scenic)
- Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models
- Boundary-Aware Proposal Generation Method for Temporal Action Localization
- Movement Enhancement toward Multi-Scale Video Feature Representation for Temporal Action Detection
- Boundary Discretization and Reliable Classification Network for Temporal Action Detection - Net)
- Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization
- Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization
- Weakly Supervised Temporal Action Localization With Bidirectional Semantic Consistency Constraint
- Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization - Net/)
- Learning Proposal-aware Re-ranking for Weakly-supervised Temporal Action Localization
- Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization
- Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization
- Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling
- Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization
- Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization
- DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization - DDGNet)
- Proposal-based Temporal Action Localization with Point-level Supervision
- LPR: learning point-level temporal action localization through re-training
- POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization
- ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization
- Self-Feedback DETR for Temporal Action Detection
- UnLoc: A Unified Framework for Video Localization Tasks - research/scenic)
- Benchmarking Data Efficiency and Computational Efficiency of Temporal Action Localization Models
- Boundary-Aware Proposal Generation Method for Temporal Action Localization
- Boundary Discretization and Reliable Classification Network for Temporal Action Detection - Net)
- SADA: Semantic adversarial unsupervised domain adaptation for Temporal Action Localization
- A Novel Action Saliency and Context-Aware Network for Weakly-Supervised Temporal Action Localization
- Temporal Feature Enhancement Dilated Convolution Network for Weakly-Supervised Temporal Action Localization
- JCDNet: Joint of Common and Definite phases Network for Weakly Supervised Temporal Action Localization
- Proposal-Based Multiple Instance Learning for Weakly-Supervised Temporal Action Localization - MIL)
- Two-Stream Networks for Weakly-Supervised Temporal Action Localization With Semantic-Aware Mechanisms
- Boosting Weakly-Supervised Temporal Action Localization with Text Information - WTAL)
- PivoTAL: Prior-Driven Supervision for Weakly-Supervised Temporal Action Localization
- Improving Weakly Supervised Temporal Action Localization by Bridging Train-Test Gap in Pseudo Labels
- Video-Specific Query-Key Attention Modeling for Weakly-Supervised Temporal Action Localization
- Dual-Feature Enhancement for Weakly Supervised Temporal Action Localization
- Collaborative Foreground, Background, and Action Modeling Network for Weakly Supervised Temporal Action Localization
- Weakly Supervised Temporal Action Localization With Bidirectional Semantic Consistency Constraint
- Feature Weakening, Contextualization, and Discrimination for Weakly Supervised Temporal Action Localization - Net/)
- Learning Proposal-aware Re-ranking for Weakly-supervised Temporal Action Localization
- Semantic and Temporal Contextual Correlation Learning for Weakly-Supervised Temporal Action Localization
- Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization
- Weakly-Supervised Action Localization by Hierarchically-structured Latent Attention Modeling
- Cross-Video Contextual Knowledge Exploration and Exploitation for Ambiguity Reduction in Weakly Supervised Temporal Action Localization
- Sub-action Prototype Learning for Point-level Weakly-supervised Temporal Action Localization
- DDG-Net: Discriminability-Driven Graph Network for Weakly-supervised Temporal Action Localization - DDGNet)
- Proposal-based Temporal Action Localization with Point-level Supervision
- LPR: learning point-level temporal action localization through re-training
- POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization
- ADM-Loc: Actionness Distribution Modeling for Point-supervised Temporal Action Localization
- Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach - Liu/CASE)
- GliTr: Glimpse Transformers with Spatiotemporal Consistency for Online Action Prediction
- E2E-LOAD: End-to-End Long-form Online Action Detection - LOAD)
- MiniROAD: Minimal RNN Framework for Online Action Detection
- Cascade Evidential Learning for Open-World Weakly-Supervised Temporal Action Localization
- OW-TAL: Learning Unknown Human Activities for Open-World Temporal Action Localization
-
2022
- TVNet: Temporal Voting Network for Action Localization
- ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization
- Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach
- Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
- Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
- Dual-Evidential Learning for Weakly-supervised Temporal Action Localization - DELU)
- A Circular Window-based Cascade Transformer for Online Action Detection
- SimOn: A Simple Framework for Online Temporal Action Localization
- Open-Vocabulary Temporal Action Detection with Off-the-Shelf ImageText Features
- Multi-Modal Few-Shot Temporal Action Detection
- RCL: Recurrent Continuous Localization for Temporal Action Detection
- Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
- MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection - TCT)
- One-stage Action Detection Transformer - 100 2022 V. 26.35 N. 25.83)
- Context-aware Proposal Network for Temporal Action Detection - 2022 ActivityNet Challenge winning solution)
- Dual relation network for temporal action localization
- Learning Disentangled Classification and Localization Representations for Temporal Action Localization
- Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection - NJU/DDM)
- Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach
- HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers
- An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
- Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning
- Prompting Visual-Language Models for Efficient Video Understanding - chen/Efficient-Prompt)
- ReAct: Temporal Action Detection with Relational Queries
- End-to-end Temporal Action Detection with Transformer
- Temporal Action Localization with Multi-temporal Scales
- Adaptive Perception Transformer for Temporal Action Localization
- PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points - NJU/PointTAD) (multi action detection, eg: multiTHUMOS, charades)
- Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks
- Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
- Deep Learning-Based Action Detection in Untrimmed Videos: A Survey
- ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization
- Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
- ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization - Loc)
- Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization - FTCL)
- Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization
- Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization
- Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions
- Zero-Shot Temporal Action Detection via Vision-Language Prompting
- Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization
- Dilation-Erosion for Single-Frame Supervised Temporal Action Localization
- Colar: Effective and Efficient Online Action Detection by Consulting Exemplars - Action-Detection)
- Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
- Dual-Evidential Learning for Weakly-supervised Temporal Action Localization - DELU)
- GateHUB: Gated History Unit with Background Suppression for Online Action Detection
- A Circular Window-based Cascade Transformer for Online Action Detection
- Real-time Online Video Detection with Temporal Smoothing Transformers - zephyrus/TeSTra)
- Temporal Action Localization with Multi-temporal Scales
- Adaptive Perception Transformer for Temporal Action Localization
- PointTAD: Multi-Label Temporal Action Detection with Learnable Query Points - NJU/PointTAD) (multi action detection, eg: multiTHUMOS, charades)
- Soft-Landing Strategy for Alleviating the Task Discrepancy Problem in Temporal Action Localization Tasks
- Re2TAL: Rewiring Pretrained Video Backbones for Reversible Temporal Action Localization
- Multi-Modal Few-Shot Temporal Action Detection
- Deep Learning-Based Action Detection in Untrimmed Videos: A Survey
- ACGNet: Action Complement Graph Network for Weakly-supervised Temporal Action Localization
- Weakly Supervised Temporal Action Localization via Representative Snippet Knowledge Propagation
- ASM-Loc: Action-aware Segment Modeling for Weakly-Supervised Temporal Action Localization - Loc)
- Fine-grained Temporal Contrastive Learning for Weakly-supervised Temporal Action Localization - FTCL)
- Convex Combination Consistency between Neighbors for Weakly-supervised Action Localization
- Exploring Denoised Cross-video Contrast for Weakly-supervised Temporal Action Localization
- Weakly-Supervised Temporal Action Detection for Fine-Grained Videos with Hierarchical Atomic Actions
- Zero-Shot Temporal Action Detection via Vision-Language Prompting
- Slow Motion Matters: A Slow Motion Enhanced Network for Weakly Supervised Temporal Action Localization
- Dilation-Erosion for Single-Frame Supervised Temporal Action Localization
- Context-aware Proposal Network for Temporal Action Detection - 2022 ActivityNet Challenge winning solution)
- Dual relation network for temporal action localization
- Learning Disentangled Classification and Localization Representations for Temporal Action Localization
- Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection - NJU/DDM)
- Submission to Generic Event Boundary Detection Challenge@CVPR 2022: Local Context Modeling and Global Boundary Decoding Approach
- HTNet: Anchor-free Temporal Action Localization with Hierarchical Transformers
- An Efficient Spatio-Temporal Pyramid Transformer for Action Detection
- Proposal-Free Temporal Action Detection with Global Segmentation Mask Learning
- Prompting Visual-Language Models for Efficient Video Understanding - chen/Efficient-Prompt)
- ReAct: Temporal Action Detection with Relational Queries
- End-to-end Temporal Action Detection with Transformer
- Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization
- Semi-Supervised Temporal Action Detection with Proposal-Free Masking
- Open-Vocabulary Temporal Action Detection with Off-the-Shelf ImageText Features
- Temporal Action Proposal Generation with Background Constraint
- Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation - Net)
- Modeling long-term video semantic distribution for temporal action proposal generation
- AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
- DCAN: Improving Temporal Action Detection via Dual Context Aggregation
- TVNet: Temporal Voting Network for Action Localization
- ActionFormer: Localizing Moments of Actions with Transformers
- Temporal Action Proposal Generation with Background Constraint
- Pyramid Region-based Slot Attention Network for Temporal Action Proposal Generation - Net)
- Modeling long-term video semantic distribution for temporal action proposal generation
- AOE-Net: Entities Interactions Modeling with Adaptive Attention Mechanism for Temporal Action Proposals Generation
- DCAN: Improving Temporal Action Detection via Dual Context Aggregation
- TVNet: Temporal Voting Network for Action Localization
- ActionFormer: Localizing Moments of Actions with Transformers
- SegTAD: Precise Temporal Action Detection via Semantic Segmentation
- OpenTAL: Towards Open Set Temporal Action Localization
- TALLFormer: Temporal Action Localization with Long-memory Transformer
- An Empirical Study of End-to-End Temporal Action Detection - TAD)
- Estimation of Reliable Proposal Quality for Temporal Action Detection
- Structured Attention Composition for Temporal Action Localization - Attention-Composition)
- RCL: Recurrent Continuous Localization for Temporal Action Detection
- Learning to Refactor Action and Co-occurrence Features for Temporal Action Localization
- MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection - TCT)
- One-stage Action Detection Transformer - 100 2022 V. 26.35 N. 25.83)
- Uncertainty-Based Spatial-Temporal Attention for Online Action Detection
- Active Learning with Effective Scoring Functions for Semi-Supervised Temporal Action Localization
- Semi-Supervised Temporal Action Detection with Proposal-Free Masking
- Open-Vocabulary Temporal Action Detection with Off-the-Shelf ImageText Features
- SegTAD: Precise Temporal Action Detection via Semantic Segmentation
- OpenTAL: Towards Open Set Temporal Action Localization
- TALLFormer: Temporal Action Localization with Long-memory Transformer
- An Empirical Study of End-to-End Temporal Action Detection - TAD)
- Estimation of Reliable Proposal Quality for Temporal Action Detection
- Structured Attention Composition for Temporal Action Localization - Attention-Composition)
- Adaptive Mutual Supervision for Weakly-Supervised Temporal Action Localization
- Dual-Evidential Learning for Weakly-supervised Temporal Action Localization - DELU)
- Colar: Effective and Efficient Online Action Detection by Consulting Exemplars - Action-Detection)
- GateHUB: Gated History Unit with Background Suppression for Online Action Detection
- A Circular Window-based Cascade Transformer for Online Action Detection
- Real-time Online Video Detection with Temporal Smoothing Transformers - zephyrus/TeSTra)
- SimOn: A Simple Framework for Online Temporal Action Localization
- Online human action detection and anticipation in videos: A survey
- SimOn: A Simple Framework for Online Temporal Action Localization
- Online human action detection and anticipation in videos: A survey
- Uncertainty-Based Spatial-Temporal Attention for Online Action Detection
-
2021
- Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization - hmo)
- FineAction: A Fined Video Dataset for Temporal Action Localization
- Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations
- Proposal Relation Network for Temporal Action Detection
- Exploring Stronger Feature for Temporal Action Localization
- Coarse-Fine Networks for Temporal Activity Detection in Videos - Fine-Networks)
- Three Birds with One Stone: Multi-Task Temporal Action Detection via Recycling Temporal Annotations
- Proposal Relation Network for Temporal Action Detection
- Exploring Stronger Feature for Temporal Action Localization
- BSN++: Complementary Boundary Regressor with Scale-Balanced RelationModeling for Temporal Action Proposal Generation
- Relaxed Transformer Decoders for Direct Action Proposal Generation - NJU/RTD-Action) [Zhihu](https://zhuanlan.zhihu.com/p/363133304)
- Temporal Context Aggregation Network for Temporal Action Proposal Refinement
- Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
- Temporal Action Proposal Generation with Transformers
- Agent-Environment Network for Temporal Action Proposal Generation
- AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation - AgentEnvInteration)
- Activity Graph Transformer for Temporal Action Localization
- Learning Salient Boundary Feature for Anchor-free Temporal Action Localization - AFSD?utm_source=catalyzex.com)
- Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
- Read and Attend: Temporal Localisation in Sign Language Videos
- BSN++: Complementary Boundary Regressor with Scale-Balanced RelationModeling for Temporal Action Proposal Generation
- Relaxed Transformer Decoders for Direct Action Proposal Generation - NJU/RTD-Action) [Zhihu](https://zhuanlan.zhihu.com/p/363133304)
- Temporal Context Aggregation Network for Temporal Action Proposal Refinement
- Augmented Transformer with Adaptive Graph for Temporal Action Proposal Generation
- Temporal Action Proposal Generation with Transformers
- Agent-Environment Network for Temporal Action Proposal Generation
- AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation - AgentEnvInteration)
- Activity Graph Transformer for Temporal Action Localization
- Coarse-Fine Networks for Temporal Activity Detection in Videos - Fine-Networks)
- Modeling Multi-Label Action Dependencies for Temporal Action Localization
- Enriching Local and Global Contexts for Temporal Action Localization
- Class Semantics-based Attention for Action Detection
- Video Self-Stitching Graph Network for Temporal Action Localization
- Multi-shot Temporal Event Localization: a Benchmark
- A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization
- Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization
- The Blessings of Unlabeled Background in Untrimmed Videos - Blessings-of-Unlabeled-Background-in-Untrimmed-Videos)
- ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
- CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
- Background-Click Supervision for Temporal Action Localization
- Action Coherence Network for Weakly-Supervised Temporal Action Localization
- WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
- OadTR: Online Action Detection with Transformers
- Long Short-Term Transformer for Online Action Detection - science/long-short-term-transformer)
- pre awesome
- Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
- Temporal Action Detection with Multi-level Supervision
- Low-Fidelity End-to-End Video Encoder Pre-training for Temporal Action Localization
- Read and Attend: Temporal Localisation in Sign Language Videos
- Towards High-Quality Temporal Action Detection with Sparse Proposals - TAD)
- Few-Shot Temporal Action Localization with Query Adaptive Transformer - Shot)
- Graph Convolutional Module for Temporal Action Localization in Videos
- MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
- Multi-shot Temporal Event Localization: a Benchmark
- A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization
- Cross-Attentional Audio-Visual Fusion for Weakly-Supervised Action Localization
- Weakly-supervised Temporal Action Localization by Uncertainty Modeling - Uncertainty-Modeling)
- The Blessings of Unlabeled Background in Untrimmed Videos - Blessings-of-Unlabeled-Background-in-Untrimmed-Videos)
- ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
- CoLA: Weakly-Supervised Temporal Action Localization with Snippet Contrastive Learning
- Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context
- ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization - lab/ACM-Net)
- Action Unit Memory Network for Weakly Supervised Temporal Action Localization
- Weakly Supervised Action Selection Learning in Video
- Action Shuffling for Weakly Supervised Temporal Localization
- Few-Shot Action Localization without Knowing Boundaries
- Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection
- Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling
- Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization - CO2-Net)
- Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization - Action-Completeness-from-Points)
- Deep Motion Prior for Weakly-Supervised Temporal Action Localization - net?authuser=0)
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization
- Modeling Multi-Label Action Dependencies for Temporal Action Localization
- PcmNet: Position-Sensitive Context Modeling Network for Temporal Action Localization
- Background-Click Supervision for Temporal Action Localization
- Action Coherence Network for Weakly-Supervised Temporal Action Localization
- WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos
- OadTR: Online Action Detection with Transformers
- Long Short-Term Transformer for Online Action Detection - science/long-short-term-transformer)
- pre awesome
- Self-Supervised Learning for Semi-Supervised Temporal Action Proposal
- Temporal Action Detection with Multi-level Supervision
- KFC: An Efficient Framework for Semi-Supervised Temporal Action Localization
- Low Pass Filter for Anti-aliasing in Temporal Action Localization
- Class Semantics-based Attention for Action Detection
- Towards High-Quality Temporal Action Detection with Sparse Proposals - TAD)
- Few-Shot Temporal Action Localization with Query Adaptive Transformer - Shot)
- Graph Convolutional Module for Temporal Action Localization in Videos
- MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
- Video Self-Stitching Graph Network for Temporal Action Localization
- Weakly-Supervised Temporal Action Localization Through Local-Global Background Modeling
- Cross-modal Consensus Network for Weakly Supervised Temporal Action Localization - CO2-Net)
- Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization - Action-Completeness-from-Points)
- Deep Motion Prior for Weakly-Supervised Temporal Action Localization - net?authuser=0)
- Foreground-Action Consistency Network for Weakly Supervised Temporal Action Localization
- KFC: An Efficient Framework for Semi-Supervised Temporal Action Localization
- SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection
- RGB Stream Is Enough for Temporal Action Detection - Smart/vedatad?utm_source=catalyzex.com)
- Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization - hmo)
- Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021
- Two-Stream Consensus Network: Submission to HACS Challenge 2021Weakly-Supervised Learning Track
- SRF-Net: Selective Receptive Field Network for Anchor-Free Temporal Action Detection
- RGB Stream Is Enough for Temporal Action Detection - Smart/vedatad?utm_source=catalyzex.com)
- Hear Me Out: Fusional Approaches for Audio Augmented Temporal Action Localization - hmo)
- Transferable Knowledge-Based Multi-Granularity Aggregation Network for Temporal Action Localization: Submission to ActivityNet Challenge 2021
- Enriching Local and Global Contexts for Temporal Action Localization
- Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context
- ACM-Net: Action Context Modeling Network for Weakly-Supervised Temporal Action Localization - lab/ACM-Net)
- Action Unit Memory Network for Weakly Supervised Temporal Action Localization
- Weakly Supervised Action Selection Learning in Video
- Action Shuffling for Weakly Supervised Temporal Localization
- Few-Shot Action Localization without Knowing Boundaries
- Uncertainty Guided Collaborative Training for Weakly Supervised Temporal Action Detection
-
2020
- paper - Modeling-via-Uncertainty-Estimation)
- paper
- paper
- Temporal Action Localization with Variance-Aware Networks
- Boundary Uncertainty in a Single-Stage Temporal Action Localization Network
- Revisiting Anchor Mechanisms for Temporal Action Localization
- Deep Concept-wise Temporal Convolutional Networks for Action Localization
- Multi-Level Temporal Pyramid Network for Action Detection
- SALAD: Self-Assessment Learning for Action Detection
- paper
- paper
- link
- pre-paper 2019 ActivityNet task-1 2nd
- paper - DBG)
- paper
- TSI: Temporal Scale Invariant Network for Action Proposal Generation
- paper
- paper
- paper
- paper
- Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection
- paper - DBG)
- paper
- Bottom-Up Temporal Action Localization with Mutual Regularization - Up-TAL-with-MR)
- TSI: Temporal Scale Invariant Network for Action Proposal Generation
- paper
- paper
- link
- pre-paper 2019 ActivityNet task-1 2nd
- paper
- paper
- paper
- Temporal Action Localization with Variance-Aware Networks
- Boundary Uncertainty in a Single-Stage Temporal Action Localization Network
- Revisiting Anchor Mechanisms for Temporal Action Localization
- Deep Concept-wise Temporal Convolutional Networks for Action Localization
- Multi-Level Temporal Pyramid Network for Action Detection
- SALAD: Self-Assessment Learning for Action Detection
- paper
- paper
- paper
- paper - Weakly-Supervised-Action-Localization)
- paper
- paper
- paper - pytorch)
- paper - PT)
- paper
- Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
- Learning Temporal Co-Attention Models for Unsupervised Video Action Localization
- D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddingsand Denoised Activations
- SF-Net: Single-Frame Supervision for Temporal Action Localization - Net)
- Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses
- ActionBytes: Learning From Trimmed Videos to Localize Actions
- paper
- paper - Weakly-Supervised-Action-Localization)
- paper
- paper - pytorch)
- paper - PT)
- paper
- Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
- Learning Temporal Co-Attention Models for Unsupervised Video Action Localization
- D2-Net: Weakly-Supervised Action Localization via Discriminative Embeddingsand Denoised Activations
- SF-Net: Single-Frame Supervision for Temporal Action Localization - Net)
- Point-Level Temporal Action Localization: Bridging Fully-supervised Proposals to Weakly-supervised Losses
- Transferable Knowledge-Based Multi-Granularity Fusion Network for Weakly Supervised Temporal Action Detection
- ActionBytes: Learning From Trimmed Videos to Localize Actions
-
2019
- paper
- paper - SSAD)
- paper - icml19)
- paper
- paper - Zeng/PGCN)
- paper
- paper - TAN)
- paper
- paper
- paper
- paper
- paper
- paper
- paper
- paper - Boundary-Matching-Network)
- paper
- paper
- paper
- paper
- paper
- Joint Learning of Local and Global Context for Temporal Action Proposal Generation
- paper
- paper
- paper
- paper
- paper - 2019-Weakly-Supervised-Temporal-Localization-via-Occurrence-Count-Learning)
- paper
- paper
- paper
- paper - 2019-Weakly-Supervised-Temporal-Localization-via-Occurrence-Count-Learning)
- paper
- paper
- paper
- paper
- paper - net)
- paper - Temporal-Action-Localization)
- paper
- paper
- paper
- paper
- Learning Temporal Action Proposals With Fewer Labels
- Towards Train-Test Consistency for Semi-supervised Temporal Action Localization
- paper
- paper - Boundary-Matching-Network)
- paper
- paper
- paper
- paper
- paper
- Joint Learning of Local and Global Context for Temporal Action Proposal Generation
- paper
- paper - icml19)
- paper - SSAD)
- paper
- paper - Zeng/PGCN)
- paper
- paper - TAN)
- paper
- paper
- paper
- paper
- paper
- paper
- paper
- paper
- paper
-
2018
- paper
- paper - boundary-sensitive-network) [code.PyTorch](https://github.com/wzmsltw/BSN-boundary-sensitive-network.pytorch)
- paper
- paper
- paper
- paper - search)
- paper
- paper
- paper
- paper
- paper
- Weakly Supervised Temporal Action Detection with Shot-Based Temporal Pooling Network
- W-TALC: Weakly-supervised Temporal Activity Localization and Classification - pytorch?utm_source=catalyzex.com)
- AutoLoc: Weakly-supervised Temporal Action Localization
- Weakly Supervised Action Localization by Sparse Temporal Pooling Network - action-localization?utm_source=catalyzex.com)
- Step-by-step Erasion, One-by-one Collection: A Weakly Supervised Temporal Action Detector
- Cascaded Pyramid Mining Network for Weakly Supervised Temporal Action Localization
-
2017
- paper - TAP)
- paper - b/sst/) [code.TensorFlow](https://github.com/JaywongWang/SST-Tensorflow)
- paper
- paper - detection)
- paper
- paper - detection)
- paper - C3D) [code.PyTorch](https://github.com/sunnyxiaohu/R-C3D.pytorch)
- paper
- paper - max-sum)
- paper
- paper
- paper - b/ss-tad/)
- paper
- paper
- UntrimmedNets for Weakly Supervised Action Recognition and Detection
-
before
-
2024
- Boundary Denoising for Video Activity Localization
- LITA: Language Instructed Temporal-Localization Assistant
- PLOT-TAL -- Prompt Learning with Optimal Transport for Few-Shot Temporal Action Localization
- Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions - Zeng/temporal-robustness-benchmark)
- Test-Time Zero-Shot Temporal Action Localization
- UniMD: Towards Unifying Moment Retrieval and Temporal Action Detection
- Adapting Short-Term Transformers for Action Detection in Untrimmed Videos
- End-to-End Temporal Action Detection with 1B Parameters Across 1000 Frames
- Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding - mamba-suite)
- TE-TAD: Towards Full End-to-End Temporal Action Detection via Time-Aligned Coordinate Expression - HJ/TE-TAD)
- Action Detection via an Image Diffusion Process
- Dual DETRs for Multi-Label Temporal Action Detection - NJU/DualDETR)
- An Effective-Efficient Approach for Dense Multi-Label Action Detection
- End-to-End Spatio-Temporal Action Localisation with Video Transformers
- DyFADet: Dynamic Feature Aggregation for Temporal Action Detection - pytorch)
- Harnessing Temporal Causality for Advanced Temporal Action Detection
- Long-Term Pre-training for Temporal Action Detection with Transformers
- Prediction-Feedback DETR for Temporal Action Detection
- Introducing Gating and Context into Temporal Action Detection
- ContextDet: Temporal Action Detection with Adaptive Context Aggregation
- Weakly-Supervised Temporal Action Localization by Inferring Snippet-Feature Affinity
- HR-Pro: Point-supervised Temporal Action Localization via Hierarchical Reliability Propagation - Pro)
- STAT: Towards Generalizable Temporal Action Localization
- Realigning Confidence with Temporal Saliency Information for Point-level Weakly-Supervised Temporal Action Localization - TSPNet)
- Weakly-Supervised Temporal Action Localization with Multi-Modal Plateau Transformers
- Ensemble Prototype Network For Weakly Supervised Temporal Action Localization
- Full-Stage Pseudo Label Quality Enhancement for Weakly-supervised Temporal Action Localization
- Probabilistic Vision-Language Representation for Weakly Supervised Temporal Action Localization - rcv/PVLR)
- Towards Completeness: A Generalizable Action Proposal Generator for Zero-Shot Temporal Action Localization
- Stepwise Multi-grained Boundary Detector for Point-supervised Temporal Action Localization
- Zero-shot Action Localization via the Confidence of Large Vision-Language Models
- JOADAA: joint online action detection and action anticipation
- Object Aware Egocentric Online Action Detection
- ActionSwitch: Class-agnostic Detection of Simultaneous Actions in Streaming Videos
- Online Temporal Action Localization with Memory-Augmented Transformer
- HAT: History-Augmented Anchor Transformer for Online Temporal Action Localization - HAT/)
- Towards Adaptive Pseudo-label Learning for Semi-Supervised Temporal Action Localization
- One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features
- Open-Vocabulary Temporal Action Localization using Multimodal Guidance
- Exploring Scalability of Self-Training for Open-Vocabulary Temporal Action Localization - TAL)
- Open-vocabulary Temporal Action Localization using VLMs
- Does Video-Text Pretraining Help Open-Vocabulary Online Action Detection?