TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Robust and efficient post-processing for video object dete...

Robust and efficient post-processing for video object detection

Alberto Sabater, Luis Montesano, Ana C. Murillo

2020-09-23Video Object DetectionObject RecognitionAutonomous Drivingobject-detectionObject Detection
PaperPDFCode

Abstract

Object recognition in video is an important task for plenty of applications, including autonomous driving perception, surveillance tasks, wearable devices or IoT networks. Object recognition using video data is more challenging than using still images due to blur, occlusions or rare object poses. Specific video detectors with high computational cost or standard image detectors together with a fast post-processing algorithm achieve the current state-of-the-art. This work introduces a novel post-processing pipeline that overcomes some of the limitations of previous post-processing methods by introducing a learning-based similarity evaluation between detections across frames. Our method improves the results of state-of-the-art specific video detectors, specially regarding fast moving objects, and presents low resource requirements. And applied to efficient still image detectors, such as YOLO, provides comparable results to much more computationally intensive detectors.

Results

TaskDatasetMetricValueModel
Object DetectionImageNet VIDMAP 84.2REPP + SELSA (ResNet-101)
Object DetectionImageNet VIDMAP 80.1REPP + FGFA
Object DetectionImageNet VIDMAP 75.1REPP + YOLOv3
Object DetectionImageNet VIDMAP 68.6YOLOv3
3DImageNet VIDMAP 84.2REPP + SELSA (ResNet-101)
3DImageNet VIDMAP 80.1REPP + FGFA
3DImageNet VIDMAP 75.1REPP + YOLOv3
3DImageNet VIDMAP 68.6YOLOv3
2D ClassificationImageNet VIDMAP 84.2REPP + SELSA (ResNet-101)
2D ClassificationImageNet VIDMAP 80.1REPP + FGFA
2D ClassificationImageNet VIDMAP 75.1REPP + YOLOv3
2D ClassificationImageNet VIDMAP 68.6YOLOv3
2D Object DetectionImageNet VIDMAP 84.2REPP + SELSA (ResNet-101)
2D Object DetectionImageNet VIDMAP 80.1REPP + FGFA
2D Object DetectionImageNet VIDMAP 75.1REPP + YOLOv3
2D Object DetectionImageNet VIDMAP 68.6YOLOv3
16kImageNet VIDMAP 84.2REPP + SELSA (ResNet-101)
16kImageNet VIDMAP 80.1REPP + FGFA
16kImageNet VIDMAP 75.1REPP + YOLOv3
16kImageNet VIDMAP 68.6YOLOv3

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17