TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/FFAVOD: Feature Fusion Architecture for Video Object Detec...

FFAVOD: Feature Fusion Architecture for Video Object Detection

Hughes Perreault, Guillaume-Alexandre Bilodeau, Nicolas Saunier, Maguelonne Héritier

2021-09-15Video Object Detectionobject-detectionObject Detection
PaperPDFCode(official)

Abstract

A significant amount of redundancy exists between consecutive frames of a video. Object detectors typically produce detections for one image at a time, without any capabilities for taking advantage of this redundancy. Meanwhile, many applications for object detection work with videos, including intelligent transportation systems, advanced driver assistance systems and video surveillance. Our work aims at taking advantage of the similarity between video frames to produce better detections. We propose FFAVOD, standing for feature fusion architecture for video object detection. We first introduce a novel video object detection architecture that allows a network to share feature maps between nearby frames. Second, we propose a feature fusion module that learns to merge feature maps to enhance them. We show that using the proposed architecture and the fusion module can improve the performance of three base object detectors on two object detection benchmarks containing sequences of moving road users. Additionally, to further increase performance, we propose an improvement to the SpotNet attention module. Using our architecture on the improved SpotNet detector, we obtain the state-of-the-art performance on the UA-DETRAC public benchmark as well as on the UAVDT dataset. Code is available at https://github.com/hu64/FFAVOD.

Results

TaskDatasetMetricValueModel
Object DetectionUAVDTmAP53.76FFAVOD-SpotNet with U-Net
Object DetectionUA-DETRACmAP88.1FFAVOD-SpotNet with U-Net
3DUAVDTmAP53.76FFAVOD-SpotNet with U-Net
3DUA-DETRACmAP88.1FFAVOD-SpotNet with U-Net
2D ClassificationUAVDTmAP53.76FFAVOD-SpotNet with U-Net
2D ClassificationUA-DETRACmAP88.1FFAVOD-SpotNet with U-Net
2D Object DetectionUAVDTmAP53.76FFAVOD-SpotNet with U-Net
2D Object DetectionUA-DETRACmAP88.1FFAVOD-SpotNet with U-Net
16kUAVDTmAP53.76FFAVOD-SpotNet with U-Net
16kUA-DETRACmAP88.1FFAVOD-SpotNet with U-Net

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07