TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Kinematic 3D Object Detection in Monocular Video

Kinematic 3D Object Detection in Monocular Video

Garrick Brazil, Gerard Pons-Moll, Xiaoming Liu, Bernt Schiele

2020-07-19ECCV 2020 8Monocular 3D Object DetectionVehicle Pose Estimationobject-detection3D Object DetectionObject Detection
PaperPDFCodeCode

Abstract

Perceiving the physical world in 3D is fundamental for self-driving applications. Although temporal motion is an invaluable resource to human vision for detection, tracking, and depth perception, such features have not been thoroughly utilized in modern 3D object detectors. In this work, we propose a novel method for monocular video-based 3D object detection which carefully leverages kinematic motion to improve precision of 3D localization. Specifically, we first propose a novel decomposition of object orientation as well as a self-balancing 3D confidence. We show that both components are critical to enable our kinematic model to work effectively. Collectively, using only a single model, we efficiently leverage 3D kinematics from monocular videos to improve the overall localization precision in 3D object detection while also producing useful by-products of scene dynamics (ego-motion and per-object velocity). We achieve state-of-the-art performance on monocular 3D object detection and the Bird's Eye View tasks within the KITTI self-driving dataset.

Results

TaskDatasetMetricValueModel
Pose EstimationKITTI Cars HardAverage Orientation Similarity34.81Kinematic3D
Object DetectionRope3DAP@0.717.74Kinematic3D+(G)
Object DetectionKITTI Cars ModerateAP Medium12.72Kinematic3D
3DRope3DAP@0.717.74Kinematic3D+(G)
3DKITTI Cars ModerateAP Medium12.72Kinematic3D
3DKITTI Cars HardAverage Orientation Similarity34.81Kinematic3D
3D Object DetectionRope3DAP@0.717.74Kinematic3D+(G)
3D Object DetectionKITTI Cars ModerateAP Medium12.72Kinematic3D
2D ClassificationRope3DAP@0.717.74Kinematic3D+(G)
2D ClassificationKITTI Cars ModerateAP Medium12.72Kinematic3D
2D Object DetectionRope3DAP@0.717.74Kinematic3D+(G)
2D Object DetectionKITTI Cars ModerateAP Medium12.72Kinematic3D
1 Image, 2*2 StitchiKITTI Cars HardAverage Orientation Similarity34.81Kinematic3D
16kRope3DAP@0.717.74Kinematic3D+(G)
16kKITTI Cars ModerateAP Medium12.72Kinematic3D

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07