TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Categorical Depth Distribution Network for Monocular 3D Ob...

Categorical Depth Distribution Network for Monocular 3D Object Detection

Cody Reading, Ali Harakeh, Julia Chae, Steven L. Waslander

2021-03-01CVPR 2021 1Autonomous VehiclesMonocular 3D Object DetectionDepth Estimationobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

Monocular 3D object detection is a key problem for autonomous vehicles, as it provides a solution with simple configuration compared to typical multi-sensor systems. The main challenge in monocular 3D detection lies in accurately predicting object depth, which must be inferred from object and scene cues due to the lack of direct range measurement. Many methods attempt to directly estimate depth to assist in 3D detection, but show limited performance as a result of depth inaccuracy. Our proposed solution, Categorical Depth Distribution Network (CaDDN), uses a predicted categorical depth distribution for each pixel to project rich contextual feature information to the appropriate depth interval in 3D space. We then use the computationally efficient bird's-eye-view projection and single-stage detector to produce the final output bounding boxes. We design CaDDN as a fully differentiable end-to-end approach for joint depth estimation and object detection. We validate our approach on the KITTI 3D object detection benchmark, where we rank 1st among published monocular methods. We also provide the first monocular 3D detection results on the newly released Waymo Open Dataset. We provide a code release for CaDDN which is made available.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars EasyAP Easy19.17CaDDN
Object DetectionKITTI Cyclist ModerateAP Medium3.41CaDDN
Object DetectionKITTI Cyclist HardAP Hard3.3CaDDN
Object DetectionKITTI Cars ModerateAP Medium13.41CaDDN
Object DetectionKITTI Pedestrian EasyAP Easy12.87CaDDN
Object DetectionKITTI Pedestrian HardAP Hard6.76CaDDN
Object DetectionKITTI Pedestrian ModerateAP Medium8.14CaDDN
Object DetectionKITTI Cars HardAP Hard11.46CaDDN
Object DetectionKITTI Cyclist EasyAP Easy7CaDDN
3DKITTI Cars EasyAP Easy19.17CaDDN
3DKITTI Cyclist ModerateAP Medium3.41CaDDN
3DKITTI Cyclist HardAP Hard3.3CaDDN
3DKITTI Cars ModerateAP Medium13.41CaDDN
3DKITTI Pedestrian EasyAP Easy12.87CaDDN
3DKITTI Pedestrian HardAP Hard6.76CaDDN
3DKITTI Pedestrian ModerateAP Medium8.14CaDDN
3DKITTI Cars HardAP Hard11.46CaDDN
3DKITTI Cyclist EasyAP Easy7CaDDN
3D Object DetectionKITTI Cars EasyAP Easy19.17CaDDN
3D Object DetectionKITTI Cyclist ModerateAP Medium3.41CaDDN
3D Object DetectionKITTI Cyclist HardAP Hard3.3CaDDN
3D Object DetectionKITTI Cars ModerateAP Medium13.41CaDDN
3D Object DetectionKITTI Pedestrian EasyAP Easy12.87CaDDN
3D Object DetectionKITTI Pedestrian HardAP Hard6.76CaDDN
3D Object DetectionKITTI Pedestrian ModerateAP Medium8.14CaDDN
3D Object DetectionKITTI Cars HardAP Hard11.46CaDDN
3D Object DetectionKITTI Cyclist EasyAP Easy7CaDDN
2D ClassificationKITTI Cars EasyAP Easy19.17CaDDN
2D ClassificationKITTI Cyclist ModerateAP Medium3.41CaDDN
2D ClassificationKITTI Cyclist HardAP Hard3.3CaDDN
2D ClassificationKITTI Cars ModerateAP Medium13.41CaDDN
2D ClassificationKITTI Pedestrian EasyAP Easy12.87CaDDN
2D ClassificationKITTI Pedestrian HardAP Hard6.76CaDDN
2D ClassificationKITTI Pedestrian ModerateAP Medium8.14CaDDN
2D ClassificationKITTI Cars HardAP Hard11.46CaDDN
2D ClassificationKITTI Cyclist EasyAP Easy7CaDDN
2D Object DetectionKITTI Cars EasyAP Easy19.17CaDDN
2D Object DetectionKITTI Cyclist ModerateAP Medium3.41CaDDN
2D Object DetectionKITTI Cyclist HardAP Hard3.3CaDDN
2D Object DetectionKITTI Cars ModerateAP Medium13.41CaDDN
2D Object DetectionKITTI Pedestrian EasyAP Easy12.87CaDDN
2D Object DetectionKITTI Pedestrian HardAP Hard6.76CaDDN
2D Object DetectionKITTI Pedestrian ModerateAP Medium8.14CaDDN
2D Object DetectionKITTI Cars HardAP Hard11.46CaDDN
2D Object DetectionKITTI Cyclist EasyAP Easy7CaDDN
16kKITTI Cars EasyAP Easy19.17CaDDN
16kKITTI Cyclist ModerateAP Medium3.41CaDDN
16kKITTI Cyclist HardAP Hard3.3CaDDN
16kKITTI Cars ModerateAP Medium13.41CaDDN
16kKITTI Pedestrian EasyAP Easy12.87CaDDN
16kKITTI Pedestrian HardAP Hard6.76CaDDN
16kKITTI Pedestrian ModerateAP Medium8.14CaDDN
16kKITTI Cars HardAP Hard11.46CaDDN
16kKITTI Cyclist EasyAP Easy7CaDDN

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16