TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Temporal Convolution Based Action Proposal: Submission to ...

Temporal Convolution Based Action Proposal: Submission to ActivityNet 2017

Tianwei Lin, Xu Zhao, Zheng Shou

2017-07-21Action ClassificationAction LocalizationGeneral ClassificationTemporal Action Localization
PaperPDF

Abstract

In this notebook paper, we describe our approach in the submission to the temporal action proposal (task 3) and temporal action localization (task 4) of ActivityNet Challenge hosted at CVPR 2017. Since the accuracy in action classification task is already very high (nearly 90% in ActivityNet dataset), we believe that the main bottleneck for temporal action localization is the quality of action proposals. Therefore, we mainly focus on the temporal action proposal task and propose a new proposal model based on temporal convolutional network. Our approach achieves the state-of-the-art performances on both temporal action proposal task and temporal action localization task.

Results

TaskDatasetMetricValueModel
VideoActivityNet-1.3AR@10073.01Lin et al.
VideoActivityNet-1.3AUC (test)64.8Lin et al.
VideoActivityNet-1.3AUC (val)64.4Lin et al.
Temporal Action LocalizationActivityNet-1.3AR@10073.01Lin et al.
Temporal Action LocalizationActivityNet-1.3AUC (test)64.8Lin et al.
Temporal Action LocalizationActivityNet-1.3AUC (val)64.4Lin et al.
Zero-Shot LearningActivityNet-1.3AR@10073.01Lin et al.
Zero-Shot LearningActivityNet-1.3AUC (test)64.8Lin et al.
Zero-Shot LearningActivityNet-1.3AUC (val)64.4Lin et al.
Action LocalizationActivityNet-1.3AR@10073.01Lin et al.
Action LocalizationActivityNet-1.3AUC (test)64.8Lin et al.
Action LocalizationActivityNet-1.3AUC (val)64.4Lin et al.

Related Papers

DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition2025-06-23SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis2025-06-09From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos2025-06-05Zero-Shot Temporal Interaction Localization for Egocentric Videos2025-06-04A Review on Coarse to Fine-Grained Animal Action Recognition2025-06-01LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization2025-05-30Spatio-Temporal Joint Density Driven Learning for Skeleton-Based Action Recognition2025-05-29