TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Object-Centric Stereo Matching for 3D Object Detection

Object-Centric Stereo Matching for 3D Object Detection

Alex D. Pon, Jason Ku, Chengyao Li, Steven L. Waslander

2019-09-17Stereo MatchingStereo Matching Hand3D Object Detection From Stereo ImagesAutonomous DrivingDisparity Estimationobject-detection3D Object DetectionObject Detection
PaperPDF

Abstract

Safe autonomous driving requires reliable 3D object detection-determining the 6 DoF pose and dimensions of objects of interest. Using stereo cameras to solve this task is a cost-effective alternative to the widely used LiDAR sensor. The current state-of-the-art for stereo 3D object detection takes the existing PSMNet stereo matching network, with no modifications, and converts the estimated disparities into a 3D point cloud, and feeds this point cloud into a LiDAR-based 3D object detector. The issue with existing stereo matching networks is that they are designed for disparity estimation, not 3D object detection; the shape and accuracy of object point clouds are not the focus. Stereo matching networks commonly suffer from inaccurate depth estimates at object boundaries, which we define as streaking, because background and foreground points are jointly estimated. Existing networks also penalize disparity instead of the estimated position of object point clouds in their loss functions. We propose a novel 2D box association and object-centric stereo matching method that only estimates the disparities of the objects of interest to address these two issues. Our method achieves state-of-the-art results on the KITTI 3D and BEV benchmarks.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP7537.6OC-Stereo
Object DetectionKITTI Cyclists ModerateAP5016.63OC-Stereo
3DKITTI Cars ModerateAP7537.6OC-Stereo
3DKITTI Cyclists ModerateAP5016.63OC-Stereo
3D Object DetectionKITTI Cars ModerateAP7537.6OC-Stereo
3D Object DetectionKITTI Cyclists ModerateAP5016.63OC-Stereo
2D ClassificationKITTI Cars ModerateAP7537.6OC-Stereo
2D ClassificationKITTI Cyclists ModerateAP5016.63OC-Stereo
2D Object DetectionKITTI Cars ModerateAP7537.6OC-Stereo
2D Object DetectionKITTI Cyclists ModerateAP5016.63OC-Stereo
16kKITTI Cars ModerateAP7537.6OC-Stereo
16kKITTI Cyclists ModerateAP5016.63OC-Stereo

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17