TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Fast Learning of Temporal Action Proposal via Dense Bounda...

Fast Learning of Temporal Action Proposal via Dense Boundary Generator

Chuming Lin, Jian Li, Yabiao Wang, Ying Tai, Donghao Luo, Zhipeng Cui, Chengjie Wang, Jilin Li, Feiyue Huang, Rongrong Ji

2019-11-11regressionOptical Flow EstimationGeneral ClassificationTemporal Action Localization
PaperPDFCodeCodeCode

Abstract

Generating temporal action proposals remains a very challenging problem, where the main issue lies in predicting precise temporal proposal boundaries and reliable action confidence in long and untrimmed real-world videos. In this paper, we propose an efficient and unified framework to generate temporal action proposals named Dense Boundary Generator (DBG), which draws inspiration from boundary-sensitive methods and implements boundary classification and action completeness regression for densely distributed proposals. In particular, the DBG consists of two modules: Temporal boundary classification (TBC) and Action-aware completeness regression (ACR). The TBC aims to provide two temporal boundary confidence maps by low-level two-stream features, while the ACR is designed to generate an action completeness score map by high-level action-aware features. Moreover, we introduce a dual stream BaseNet (DSB) to encode RGB and optical flow information, which helps to capture discriminative boundary and actionness features. Extensive experiments on popular benchmarks ActivityNet-1.3 and THUMOS14 demonstrate the superiority of DBG over the state-of-the-art proposal generator (e.g., MGG and BMN). Our code will be made available upon publication.

Results

TaskDatasetMetricValueModel
VideoFineActionmAP6.75DBG (i3d feature)
VideoFineActionmAP IOU@0.510.65DBG (i3d feature)
VideoFineActionmAP IOU@0.756.43DBG (i3d feature)
VideoFineActionmAP IOU@0.952.5DBG (i3d feature)
Temporal Action LocalizationFineActionmAP6.75DBG (i3d feature)
Temporal Action LocalizationFineActionmAP IOU@0.510.65DBG (i3d feature)
Temporal Action LocalizationFineActionmAP IOU@0.756.43DBG (i3d feature)
Temporal Action LocalizationFineActionmAP IOU@0.952.5DBG (i3d feature)
Zero-Shot LearningFineActionmAP6.75DBG (i3d feature)
Zero-Shot LearningFineActionmAP IOU@0.510.65DBG (i3d feature)
Zero-Shot LearningFineActionmAP IOU@0.756.43DBG (i3d feature)
Zero-Shot LearningFineActionmAP IOU@0.952.5DBG (i3d feature)
Action LocalizationFineActionmAP6.75DBG (i3d feature)
Action LocalizationFineActionmAP IOU@0.510.65DBG (i3d feature)
Action LocalizationFineActionmAP IOU@0.756.43DBG (i3d feature)
Action LocalizationFineActionmAP IOU@0.952.5DBG (i3d feature)

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11