TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D De...

YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection

Yuxuan Liu, Lujia Wang, Ming Liu

2021-03-17Stereo Matching3D Object Detection From Stereo ImagesDisparity Estimationobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)Code

Abstract

Object detection in 3D with stereo cameras is an important problem in computer vision, and is particularly crucial in low-cost autonomous mobile robots without LiDARs. Nowadays, most of the best-performing frameworks for stereo 3D object detection are based on dense depth reconstruction from disparity estimation, making them extremely computationally expensive. To enable real-world deployments of vision detection with binocular images, we take a step back to gain insights from 2D image-based detection frameworks and enhance them with stereo features. We incorporate knowledge and the inference structure from real-time one-stage 2D/3D object detector and introduce a light-weight stereo matching module. Our proposed framework, YOLOStereo3D, is trained on one single GPU and runs at more than ten fps. It demonstrates performance comparable to state-of-the-art stereo 3D detection frameworks without usage of LiDAR data. The code will be published in https://github.com/Owen-Liuyuxuan/visualDet3D.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP7541.25YoLoStereo3D
Object DetectionKITTI Pedestrians ModerateAP5019.75YoLoStereo3D
3DKITTI Cars ModerateAP7541.25YoLoStereo3D
3DKITTI Pedestrians ModerateAP5019.75YoLoStereo3D
3D Object DetectionKITTI Cars ModerateAP7541.25YoLoStereo3D
3D Object DetectionKITTI Pedestrians ModerateAP5019.75YoLoStereo3D
2D ClassificationKITTI Cars ModerateAP7541.25YoLoStereo3D
2D ClassificationKITTI Pedestrians ModerateAP5019.75YoLoStereo3D
2D Object DetectionKITTI Cars ModerateAP7541.25YoLoStereo3D
2D Object DetectionKITTI Pedestrians ModerateAP5019.75YoLoStereo3D
16kKITTI Cars ModerateAP7541.25YoLoStereo3D
16kKITTI Pedestrians ModerateAP5019.75YoLoStereo3D

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08