TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BEVDepth: Acquisition of Reliable Depth for Multi-view 3D ...

BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

Yinhao Li, Zheng Ge, Guanyi Yu, Jinrong Yang, Zengran Wang, Yukang Shi, Jianjian Sun, Zeming Li

2022-06-21Robust Camera Only 3D Object DetectionDepth Estimation3D Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

In this research, we propose a new 3D object detector with a trustworthy depth estimation, dubbed BEVDepth, for camera-based Bird's-Eye-View (BEV) 3D object detection. Our work is based on a key observation -- depth estimation in recent approaches is surprisingly inadequate given the fact that depth is essential to camera 3D detection. Our BEVDepth resolves this by leveraging explicit depth supervision. A camera-awareness depth estimation module is also introduced to facilitate the depth predicting capability. Besides, we design a novel Depth Refinement Module to counter the side effects carried by imprecise feature unprojection. Aided by customized Efficient Voxel Pooling and multi-frame mechanism, BEVDepth achieves the new state-of-the-art 60.9% NDS on the challenging nuScenes test set while maintaining high efficiency. For the first time, the NDS score of a camera model reaches 60%.

Results

TaskDatasetMetricValueModel
Object DetectionnuScenes Camera OnlyNDS60.9BEVDepth-pure
Object DetectionRope3DAP@0.742.56BEVDepth
Object DetectionDAIR-V2X-IAP|R40(easy)75.7BEVDepth
Object DetectionDAIR-V2X-IAP|R40(hard)63.7BEVDepth
Object DetectionDAIR-V2X-IAP|R40(moderate)63.6BEVDepth
3DnuScenes Camera OnlyNDS60.9BEVDepth-pure
3DRope3DAP@0.742.56BEVDepth
3DDAIR-V2X-IAP|R40(easy)75.7BEVDepth
3DDAIR-V2X-IAP|R40(hard)63.7BEVDepth
3DDAIR-V2X-IAP|R40(moderate)63.6BEVDepth
3D Object DetectionnuScenes Camera OnlyNDS60.9BEVDepth-pure
3D Object DetectionRope3DAP@0.742.56BEVDepth
3D Object DetectionDAIR-V2X-IAP|R40(easy)75.7BEVDepth
3D Object DetectionDAIR-V2X-IAP|R40(hard)63.7BEVDepth
3D Object DetectionDAIR-V2X-IAP|R40(moderate)63.6BEVDepth
2D ClassificationnuScenes Camera OnlyNDS60.9BEVDepth-pure
2D ClassificationRope3DAP@0.742.56BEVDepth
2D ClassificationDAIR-V2X-IAP|R40(easy)75.7BEVDepth
2D ClassificationDAIR-V2X-IAP|R40(hard)63.7BEVDepth
2D ClassificationDAIR-V2X-IAP|R40(moderate)63.6BEVDepth
2D Object DetectionnuScenes Camera OnlyNDS60.9BEVDepth-pure
2D Object DetectionRope3DAP@0.742.56BEVDepth
2D Object DetectionDAIR-V2X-IAP|R40(easy)75.7BEVDepth
2D Object DetectionDAIR-V2X-IAP|R40(hard)63.7BEVDepth
2D Object DetectionDAIR-V2X-IAP|R40(moderate)63.6BEVDepth
16knuScenes Camera OnlyNDS60.9BEVDepth-pure
16kRope3DAP@0.742.56BEVDepth
16kDAIR-V2X-IAP|R40(easy)75.7BEVDepth
16kDAIR-V2X-IAP|R40(hard)63.7BEVDepth
16kDAIR-V2X-IAP|R40(moderate)63.6BEVDepth

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16