TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/BEVHeight: A Robust Framework for Vision-based Roadside 3D...

BEVHeight: A Robust Framework for Vision-based Roadside 3D Object Detection

Lei Yang, Kaicheng Yu, Tao Tang, Jun Li, Kun Yuan, Li Wang, Xinyu Zhang, Peng Chen

2023-03-15CVPR 2023 1Autonomous Drivingobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

While most recent autonomous driving system focuses on developing perception methods on ego-vehicle sensors, people tend to overlook an alternative approach to leverage intelligent roadside cameras to extend the perception ability beyond the visual range. We discover that the state-of-the-art vision-centric bird's eye view detection methods have inferior performances on roadside cameras. This is because these methods mainly focus on recovering the depth regarding the camera center, where the depth difference between the car and the ground quickly shrinks while the distance increases. In this paper, we propose a simple yet effective approach, dubbed BEVHeight, to address this issue. In essence, instead of predicting the pixel-wise depth, we regress the height to the ground to achieve a distance-agnostic formulation to ease the optimization process of camera-only perception methods. On popular 3D detection benchmarks of roadside cameras, our method surpasses all previous vision-centric methods by a significant margin. The code is available at {\url{https://github.com/ADLab-AutoDrive/BEVHeight}}.

Results

TaskDatasetMetricValueModel
Object DetectionRope3DAP@0.745.73BEVHeight
Object DetectionDAIR-V2X-IAP|R40(easy)77.8BEVHeight
Object DetectionDAIR-V2X-IAP|R40(hard)65.9BEVHeight
Object DetectionDAIR-V2X-IAP|R40(moderate)65.8BEVHeight
3DRope3DAP@0.745.73BEVHeight
3DDAIR-V2X-IAP|R40(easy)77.8BEVHeight
3DDAIR-V2X-IAP|R40(hard)65.9BEVHeight
3DDAIR-V2X-IAP|R40(moderate)65.8BEVHeight
3D Object DetectionRope3DAP@0.745.73BEVHeight
3D Object DetectionDAIR-V2X-IAP|R40(easy)77.8BEVHeight
3D Object DetectionDAIR-V2X-IAP|R40(hard)65.9BEVHeight
3D Object DetectionDAIR-V2X-IAP|R40(moderate)65.8BEVHeight
2D ClassificationRope3DAP@0.745.73BEVHeight
2D ClassificationDAIR-V2X-IAP|R40(easy)77.8BEVHeight
2D ClassificationDAIR-V2X-IAP|R40(hard)65.9BEVHeight
2D ClassificationDAIR-V2X-IAP|R40(moderate)65.8BEVHeight
2D Object DetectionRope3DAP@0.745.73BEVHeight
2D Object DetectionDAIR-V2X-IAP|R40(easy)77.8BEVHeight
2D Object DetectionDAIR-V2X-IAP|R40(hard)65.9BEVHeight
2D Object DetectionDAIR-V2X-IAP|R40(moderate)65.8BEVHeight
16kRope3DAP@0.745.73BEVHeight
16kDAIR-V2X-IAP|R40(easy)77.8BEVHeight
16kDAIR-V2X-IAP|R40(hard)65.9BEVHeight
16kDAIR-V2X-IAP|R40(moderate)65.8BEVHeight

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17