TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Ground-aware Monocular 3D Object Detection for Autonomous ...

Ground-aware Monocular 3D Object Detection for Autonomous Driving

Yuxuan Liu, Yuan Yixuan, Ming Liu

2021-02-01Monocular 3D Object DetectionDepth PredictionAutonomous DrivingPose Estimation6D Pose Estimation using RGBDepth Estimationobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Estimating the 3D position and orientation of objects in the environment with a single RGB camera is a critical and challenging task for low-cost urban autonomous driving and mobile robots. Most of the existing algorithms are based on the geometric constraints in 2D-3D correspondence, which stems from generic 6D object pose estimation. We first identify how the ground plane provides additional clues in depth reasoning in 3D detection in driving scenes. Based on this observation, we then improve the processing of 3D anchors and introduce a novel neural network module to fully utilize such application-specific priors in the framework of deep learning. Finally, we introduce an efficient neural network embedded with the proposed module for 3D object detection. We further verify the power of the proposed module with a neural network designed for monocular depth prediction. The two proposed networks achieve state-of-the-art performances on the KITTI 3D object detection and depth prediction benchmarks, respectively. The code will be published in https://www.github.com/Owen-Liuyuxuan/visualDet3D

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP Medium13.17GAC
Object DetectionKITTI Cars HardAP Hard9.94GAC
3DKITTI Cars ModerateAP Medium13.17GAC
3DKITTI Cars HardAP Hard9.94GAC
3D Object DetectionKITTI Cars ModerateAP Medium13.17GAC
3D Object DetectionKITTI Cars HardAP Hard9.94GAC
2D ClassificationKITTI Cars ModerateAP Medium13.17GAC
2D ClassificationKITTI Cars HardAP Hard9.94GAC
2D Object DetectionKITTI Cars ModerateAP Medium13.17GAC
2D Object DetectionKITTI Cars HardAP Hard9.94GAC
16kKITTI Cars ModerateAP Medium13.17GAC
16kKITTI Cars HardAP Hard9.94GAC

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark2025-07-17