TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DSGN: Deep Stereo Geometry Network for 3D Object Detection

DSGN: Deep Stereo Geometry Network for 3D Object Detection

Yilun Chen, Shu Liu, Xiaoyong Shen, Jiaya Jia

2020-01-10CVPR 2020 63D Object Detection From Stereo ImagesVehicle Pose Estimationobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Most state-of-the-art 3D object detectors heavily rely on LiDAR sensors because there is a large performance gap between image-based and LiDAR-based methods. It is caused by the way to form representation for the prediction in 3D scenarios. Our method, called Deep Stereo Geometry Network (DSGN), significantly reduces this gap by detecting 3D objects on a differentiable volumetric representation -- 3D geometric volume, which effectively encodes 3D geometric structure for 3D regular space. With this representation, we learn depth information and semantic cues simultaneously. For the first time, we provide a simple and effective one-stage stereo-based 3D detection pipeline that jointly estimates the depth and detects 3D objects in an end-to-end learning manner. Our approach outperforms previous stereo-based 3D detectors (about 10 higher in terms of AP) and even achieves comparable performance with several LiDAR-based methods on the KITTI 3D object detection leaderboard. Our code is publicly available at https://github.com/chenyilun95/DSGN.

Results

TaskDatasetMetricValueModel
Pose EstimationKITTI Cars HardAverage Orientation Similarity78.27DSGN (Stereo)
Object DetectionKITTI Cars ModerateAP7552.18DSGN
Object DetectionKITTI Cyclists ModerateAP5018.17DSGN
Object DetectionKITTI Pedestrians ModerateAP5015.55DSGN
3DKITTI Cars ModerateAP7552.18DSGN
3DKITTI Cyclists ModerateAP5018.17DSGN
3DKITTI Pedestrians ModerateAP5015.55DSGN
3DKITTI Cars HardAverage Orientation Similarity78.27DSGN (Stereo)
3D Object DetectionKITTI Cars ModerateAP7552.18DSGN
3D Object DetectionKITTI Cyclists ModerateAP5018.17DSGN
3D Object DetectionKITTI Pedestrians ModerateAP5015.55DSGN
2D ClassificationKITTI Cars ModerateAP7552.18DSGN
2D ClassificationKITTI Cyclists ModerateAP5018.17DSGN
2D ClassificationKITTI Pedestrians ModerateAP5015.55DSGN
2D Object DetectionKITTI Cars ModerateAP7552.18DSGN
2D Object DetectionKITTI Cyclists ModerateAP5018.17DSGN
2D Object DetectionKITTI Pedestrians ModerateAP5015.55DSGN
1 Image, 2*2 StitchiKITTI Cars HardAverage Orientation Similarity78.27DSGN (Stereo)
16kKITTI Cars ModerateAP7552.18DSGN
16kKITTI Cyclists ModerateAP5018.17DSGN
16kKITTI Pedestrians ModerateAP5015.55DSGN

Related Papers

A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17Decoupled PROB: Decoupled Query Initialization Tasks and Objectness-Class Learning for Open World Object Detection2025-07-17Dual LiDAR-Based Traffic Movement Count Estimation at a Signalized Intersection: Deployment, Data Collection, and Preliminary Analysis2025-07-17Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15ECORE: Energy-Conscious Optimized Routing for Deep Learning Models at the Edge2025-07-08Beyond One Shot, Beyond One Perspective: Cross-View and Long-Horizon Distillation for Better LiDAR Representations2025-07-07