TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BSN: Boundary Sensitive Network for Temporal Action Propos...

BSN: Boundary Sensitive Network for Temporal Action Proposal Generation

Tianwei Lin, Xu Zhao, Haisheng Su, Chongjing Wang, Ming Yang

2018-06-08ECCV 2018 9Action DetectionTemporal Action Proposal GenerationTemporal Action Localization
PaperPDFCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Temporal action proposal generation is an important yet challenging problem, since temporal proposals with rich action content are indispensable for analysing real-world videos with long duration and high proportion irrelevant content. This problem requires methods not only generating proposals with precise temporal boundaries, but also retrieving proposals to cover truth action instances with high recall and high overlap using relatively fewer proposals. To address these difficulties, we introduce an effective proposal generation method, named Boundary-Sensitive Network (BSN), which adopts "local to global" fashion. Locally, BSN first locates temporal boundaries with high probabilities, then directly combines these boundaries as proposals. Globally, with Boundary-Sensitive Proposal feature, BSN retrieves proposals by evaluating the confidence of whether a proposal contains an action within its region. We conduct experiments on two challenging datasets: ActivityNet-1.3 and THUMOS14, where BSN outperforms other state-of-the-art temporal action proposal generation methods with high recall and high temporal precision. Finally, further experiments demonstrate that by combining existing action classifiers, our method significantly improves the state-of-the-art temporal action detection performance.

Results

TaskDatasetMetricValueModel
VideoActivityNet-1.3mAP30.03BSN
VideoActivityNet-1.3mAP IOU@0.546.45BSN
VideoActivityNet-1.3mAP IOU@0.7529.96BSN
VideoActivityNet-1.3mAP IOU@0.958.02BSN
VideoTHUMOS’14mAP IOU@0.353.5BSN UNet
VideoTHUMOS’14mAP IOU@0.445BSN UNet
VideoTHUMOS’14mAP IOU@0.536.9BSN UNet
VideoTHUMOS’14mAP IOU@0.628.4BSN UNet
VideoTHUMOS’14mAP IOU@0.720BSN UNet
VideoTHUMOS' 14AR@10046.06BSN + Soft-NMS
VideoTHUMOS' 14AR@100064.52BSN + Soft-NMS
VideoTHUMOS' 14AR@20053.21BSN + Soft-NMS
VideoTHUMOS' 14AR@5037.46BSN + Soft-NMS
VideoTHUMOS' 14AR@50060.64BSN + Soft-NMS
VideoActivityNet-1.3AR@10074.16BSN
VideoActivityNet-1.3AUC (test)66.26BSN
VideoActivityNet-1.3AUC (val)66.17BSN
Temporal Action LocalizationActivityNet-1.3mAP30.03BSN
Temporal Action LocalizationActivityNet-1.3mAP IOU@0.546.45BSN
Temporal Action LocalizationActivityNet-1.3mAP IOU@0.7529.96BSN
Temporal Action LocalizationActivityNet-1.3mAP IOU@0.958.02BSN
Temporal Action LocalizationTHUMOS’14mAP IOU@0.353.5BSN UNet
Temporal Action LocalizationTHUMOS’14mAP IOU@0.445BSN UNet
Temporal Action LocalizationTHUMOS’14mAP IOU@0.536.9BSN UNet
Temporal Action LocalizationTHUMOS’14mAP IOU@0.628.4BSN UNet
Temporal Action LocalizationTHUMOS’14mAP IOU@0.720BSN UNet
Temporal Action LocalizationTHUMOS' 14AR@10046.06BSN + Soft-NMS
Temporal Action LocalizationTHUMOS' 14AR@100064.52BSN + Soft-NMS
Temporal Action LocalizationTHUMOS' 14AR@20053.21BSN + Soft-NMS
Temporal Action LocalizationTHUMOS' 14AR@5037.46BSN + Soft-NMS
Temporal Action LocalizationTHUMOS' 14AR@50060.64BSN + Soft-NMS
Temporal Action LocalizationActivityNet-1.3AR@10074.16BSN
Temporal Action LocalizationActivityNet-1.3AUC (test)66.26BSN
Temporal Action LocalizationActivityNet-1.3AUC (val)66.17BSN
Zero-Shot LearningActivityNet-1.3mAP30.03BSN
Zero-Shot LearningActivityNet-1.3mAP IOU@0.546.45BSN
Zero-Shot LearningActivityNet-1.3mAP IOU@0.7529.96BSN
Zero-Shot LearningActivityNet-1.3mAP IOU@0.958.02BSN
Zero-Shot LearningTHUMOS’14mAP IOU@0.353.5BSN UNet
Zero-Shot LearningTHUMOS’14mAP IOU@0.445BSN UNet
Zero-Shot LearningTHUMOS’14mAP IOU@0.536.9BSN UNet
Zero-Shot LearningTHUMOS’14mAP IOU@0.628.4BSN UNet
Zero-Shot LearningTHUMOS’14mAP IOU@0.720BSN UNet
Zero-Shot LearningTHUMOS' 14AR@10046.06BSN + Soft-NMS
Zero-Shot LearningTHUMOS' 14AR@100064.52BSN + Soft-NMS
Zero-Shot LearningTHUMOS' 14AR@20053.21BSN + Soft-NMS
Zero-Shot LearningTHUMOS' 14AR@5037.46BSN + Soft-NMS
Zero-Shot LearningTHUMOS' 14AR@50060.64BSN + Soft-NMS
Zero-Shot LearningActivityNet-1.3AR@10074.16BSN
Zero-Shot LearningActivityNet-1.3AUC (test)66.26BSN
Zero-Shot LearningActivityNet-1.3AUC (val)66.17BSN
Activity RecognitionTHUMOS’14mAP@0.353.5BSN
Activity RecognitionTHUMOS’14mAP@0.445BSN
Activity RecognitionTHUMOS’14mAP@0.536.9BSN
Action LocalizationActivityNet-1.3mAP30.03BSN
Action LocalizationActivityNet-1.3mAP IOU@0.546.45BSN
Action LocalizationActivityNet-1.3mAP IOU@0.7529.96BSN
Action LocalizationActivityNet-1.3mAP IOU@0.958.02BSN
Action LocalizationTHUMOS’14mAP IOU@0.353.5BSN UNet
Action LocalizationTHUMOS’14mAP IOU@0.445BSN UNet
Action LocalizationTHUMOS’14mAP IOU@0.536.9BSN UNet
Action LocalizationTHUMOS’14mAP IOU@0.628.4BSN UNet
Action LocalizationTHUMOS’14mAP IOU@0.720BSN UNet
Action LocalizationTHUMOS' 14AR@10046.06BSN + Soft-NMS
Action LocalizationTHUMOS' 14AR@100064.52BSN + Soft-NMS
Action LocalizationTHUMOS' 14AR@20053.21BSN + Soft-NMS
Action LocalizationTHUMOS' 14AR@5037.46BSN + Soft-NMS
Action LocalizationTHUMOS' 14AR@50060.64BSN + Soft-NMS
Action LocalizationActivityNet-1.3AR@10074.16BSN
Action LocalizationActivityNet-1.3AUC (test)66.26BSN
Action LocalizationActivityNet-1.3AUC (val)66.17BSN
Action RecognitionTHUMOS’14mAP@0.353.5BSN
Action RecognitionTHUMOS’14mAP@0.445BSN
Action RecognitionTHUMOS’14mAP@0.536.9BSN

Related Papers

DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16CBF-AFA: Chunk-Based Multi-SSL Fusion for Automatic Fluency Assessment2025-06-25MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans2025-06-25Including Semantic Information via Word Embeddings for Skeleton-based Action Recognition2025-06-23Distributed Activity Detection for Cell-Free Hybrid Near-Far Field Communications2025-06-17Zero-Shot Temporal Interaction Localization for Egocentric Videos2025-06-04Speaker Diarization with Overlapping Community Detection Using Graph Attention Networks and Label Propagation Algorithm2025-06-03Attention Is Not Always the Answer: Optimizing Voice Activity Detection with Simple Feature Fusion2025-06-02