TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Lifting Multi-View Detection and Tracking to the Bird's Ey...

Lifting Multi-View Detection and Tracking to the Bird's Eye View

Torben Teepe, Philipp Wolters, Johannes Gilg, Fabian Herzog, Gerhard Rigoll

2024-03-193D Object RecognitionMultiview DetectionMulti-Object TrackingObject RecognitionObject Tracking
PaperPDFCode(official)

Abstract

Taking advantage of multi-view aggregation presents a promising solution to tackle challenges such as occlusion and missed detection in multi-object tracking and detection. Recent advancements in multi-view detection and 3D object recognition have significantly improved performance by strategically projecting all views onto the ground plane and conducting detection analysis from a Bird's Eye View. In this paper, we compare modern lifting methods, both parameter-free and parameterized, to multi-view aggregation. Additionally, we present an architecture that aggregates the features of multiple times steps to learn robust detection and combines appearance- and motion-based cues for tracking. Most current tracking approaches either focus on pedestrians or vehicles. In our work, we combine both branches and add new challenges to multi-view detection with cross-scene setups. Our method generalizes to three public datasets across two domains: (1) pedestrian: Wildtrack and MultiviewX, and (2) roadside perception: Synthehicle, achieving state-of-the-art performance in detection and tracking. https://github.com/tteepe/TrackTacular

Results

TaskDatasetMetricValueModel
Multi-Object TrackingMultiviewXIDF185.6TrackTacular (Bilinear Sampling)
Multi-Object TrackingMultiviewXMOTA92.4TrackTacular (Bilinear Sampling)
Multi-Object TrackingWildtrackIDF195.3TrackTacular (Bilinear Sampling)
Multi-Object TrackingWildtrackMOTA91.8TrackTacular (Bilinear Sampling)
Object TrackingMultiviewXIDF185.6TrackTacular (Bilinear Sampling)
Object TrackingMultiviewXMOTA92.4TrackTacular (Bilinear Sampling)
Object TrackingWildtrackIDF195.3TrackTacular (Bilinear Sampling)
Object TrackingWildtrackMOTA91.8TrackTacular (Bilinear Sampling)
Object DetectionWildtrackMODA93.2TrackTacular (Depth Splatting)
Object DetectionWildtrackMODP77.5TrackTacular (Depth Splatting)
Object DetectionWildtrackRecall95.8TrackTacular (Depth Splatting)
Object DetectionMultiviewXMODA96.5TrackTacular (Bilinear Sampling)
Object DetectionMultiviewXMODP75TrackTacular (Bilinear Sampling)
Object DetectionMultiviewXRecall97.1TrackTacular (Bilinear Sampling)
3DWildtrackMODA93.2TrackTacular (Depth Splatting)
3DWildtrackMODP77.5TrackTacular (Depth Splatting)
3DWildtrackRecall95.8TrackTacular (Depth Splatting)
3DMultiviewXMODA96.5TrackTacular (Bilinear Sampling)
3DMultiviewXMODP75TrackTacular (Bilinear Sampling)
3DMultiviewXRecall97.1TrackTacular (Bilinear Sampling)
3D Object DetectionWildtrackMODA93.2TrackTacular (Depth Splatting)
3D Object DetectionWildtrackMODP77.5TrackTacular (Depth Splatting)
3D Object DetectionWildtrackRecall95.8TrackTacular (Depth Splatting)
3D Object DetectionMultiviewXMODA96.5TrackTacular (Bilinear Sampling)
3D Object DetectionMultiviewXMODP75TrackTacular (Bilinear Sampling)
3D Object DetectionMultiviewXRecall97.1TrackTacular (Bilinear Sampling)
2D ClassificationWildtrackMODA93.2TrackTacular (Depth Splatting)
2D ClassificationWildtrackMODP77.5TrackTacular (Depth Splatting)
2D ClassificationWildtrackRecall95.8TrackTacular (Depth Splatting)
2D ClassificationMultiviewXMODA96.5TrackTacular (Bilinear Sampling)
2D ClassificationMultiviewXMODP75TrackTacular (Bilinear Sampling)
2D ClassificationMultiviewXRecall97.1TrackTacular (Bilinear Sampling)
2D Object DetectionWildtrackMODA93.2TrackTacular (Depth Splatting)
2D Object DetectionWildtrackMODP77.5TrackTacular (Depth Splatting)
2D Object DetectionWildtrackRecall95.8TrackTacular (Depth Splatting)
2D Object DetectionMultiviewXMODA96.5TrackTacular (Bilinear Sampling)
2D Object DetectionMultiviewXMODP75TrackTacular (Bilinear Sampling)
2D Object DetectionMultiviewXRecall97.1TrackTacular (Bilinear Sampling)
16kWildtrackMODA93.2TrackTacular (Depth Splatting)
16kWildtrackMODP77.5TrackTacular (Depth Splatting)
16kWildtrackRecall95.8TrackTacular (Depth Splatting)
16kMultiviewXMODA96.5TrackTacular (Bilinear Sampling)
16kMultiviewXMODP75TrackTacular (Bilinear Sampling)
16kMultiviewXRecall97.1TrackTacular (Bilinear Sampling)

Related Papers

MVA 2025 Small Multi-Object Tracking for Spotting Birds Challenge: Dataset, Methods, and Results2025-07-17YOLOv8-SMOT: An Efficient and Robust Framework for Real-Time Small Object Tracking via Slice-Assisted Training and Adaptive Association2025-07-16HiM2SAM: Enhancing SAM2 with Hierarchical Motion Estimation and Memory Optimization towards Long-term Tracking2025-07-10GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing2025-07-08Robustifying 3D Perception through Least-Squares Multi-Agent Graphs Object Tracking2025-07-07Out-of-distribution detection in 3D applications: a review2025-07-01UMDATrack: Unified Multi-Domain Adaptive Tracking Under Adverse Weather Conditions2025-07-01Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking2025-06-30