TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Stereo R-CNN based 3D Object Detection for Autonomous Driv...

Stereo R-CNN based 3D Object Detection for Autonomous Driving

Peiliang Li, Xiaozhi Chen, Shaojie Shen

2019-02-26CVPR 2019 6Region Proposal3D Object Detection From Stereo ImagesAutonomous Drivingobject-detection3D Object DetectionObject Detection
PaperPDFCodeCode(official)CodeCode

Abstract

We propose a 3D object detection method for autonomous driving by fully exploiting the sparse and dense, semantic and geometry information in stereo imagery. Our method, called Stereo R-CNN, extends Faster R-CNN for stereo inputs to simultaneously detect and associate object in left and right images. We add extra branches after stereo Region Proposal Network (RPN) to predict sparse keypoints, viewpoints, and object dimensions, which are combined with 2D left-right boxes to calculate a coarse 3D object bounding box. We then recover the accurate 3D bounding box by a region-based photometric alignment using left and right RoIs. Our method does not require depth input and 3D position supervision, however, outperforms all existing fully supervised image-based methods. Experiments on the challenging KITTI dataset show that our method outperforms the state-of-the-art stereo-based method by around 30% AP on both 3D detection and 3D localization tasks. Code has been released at https://github.com/HKUST-Aerial-Robotics/Stereo-RCNN.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP7530.23Stereo R-CNN
3DKITTI Cars ModerateAP7530.23Stereo R-CNN
3D Object DetectionKITTI Cars ModerateAP7530.23Stereo R-CNN
2D ClassificationKITTI Cars ModerateAP7530.23Stereo R-CNN
2D Object DetectionKITTI Cars ModerateAP7530.23Stereo R-CNN
16kKITTI Cars ModerateAP7530.23Stereo R-CNN

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17