TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LIGA-Stereo: Learning LiDAR Geometry Aware Representations...

LIGA-Stereo: Learning LiDAR Geometry Aware Representations for Stereo-based 3D Detector

Xiaoyang Guo, Shaoshuai Shi, Xiaogang Wang, Hongsheng Li

2021-08-18ICCV 2021 10Stereo Matching3D geometry3D Object Detection From Stereo Images
PaperPDFCode(official)

Abstract

Stereo-based 3D detection aims at detecting 3D object bounding boxes from stereo images using intermediate depth maps or implicit 3D geometry representations, which provides a low-cost solution for 3D perception. However, its performance is still inferior compared with LiDAR-based detection algorithms. To detect and localize accurate 3D bounding boxes, LiDAR-based models can encode accurate object boundaries and surface normal directions from LiDAR point clouds. However, the detection results of stereo-based detectors are easily affected by the erroneous depth features due to the limitation of stereo matching. To solve the problem, we propose LIGA-Stereo (LiDAR Geometry Aware Stereo Detector) to learn stereo-based 3D detectors under the guidance of high-level geometry-aware representations of LiDAR-based detection models. In addition, we found existing voxel-based stereo detectors failed to learn semantic features effectively from indirect 3D supervisions. We attach an auxiliary 2D detection head to provide direct 2D semantic supervisions. Experiment results show that the above two strategies improved the geometric and semantic representation capabilities. Compared with the state-of-the-art stereo detector, our method has improved the 3D detection performance of cars, pedestrians, cyclists by 10.44%, 5.69%, 5.97% mAP respectively on the official KITTI benchmark. The gap between stereo-based and LiDAR-based 3D detectors is further narrowed.

Results

TaskDatasetMetricValueModel
Object DetectionKITTI Cars ModerateAP7564.66LIGA-Stereo
Object DetectionKITTI Cyclists ModerateAP5036.86LIGA-Stereo
Object DetectionKITTI Pedestrians ModerateAP5030LIGA-Stereo
3DKITTI Cars ModerateAP7564.66LIGA-Stereo
3DKITTI Cyclists ModerateAP5036.86LIGA-Stereo
3DKITTI Pedestrians ModerateAP5030LIGA-Stereo
3D Object DetectionKITTI Cars ModerateAP7564.66LIGA-Stereo
3D Object DetectionKITTI Cyclists ModerateAP5036.86LIGA-Stereo
3D Object DetectionKITTI Pedestrians ModerateAP5030LIGA-Stereo
2D ClassificationKITTI Cars ModerateAP7564.66LIGA-Stereo
2D ClassificationKITTI Cyclists ModerateAP5036.86LIGA-Stereo
2D ClassificationKITTI Pedestrians ModerateAP5030LIGA-Stereo
2D Object DetectionKITTI Cars ModerateAP7564.66LIGA-Stereo
2D Object DetectionKITTI Cyclists ModerateAP5036.86LIGA-Stereo
2D Object DetectionKITTI Pedestrians ModerateAP5030LIGA-Stereo
16kKITTI Cars ModerateAP7564.66LIGA-Stereo
16kKITTI Cyclists ModerateAP5036.86LIGA-Stereo
16kKITTI Pedestrians ModerateAP5030LIGA-Stereo

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling2025-07-15TRAN-D: 2D Gaussian Splatting-based Sparse-view Transparent Object Depth Reconstruction via Physics Simulation for Scene Update2025-07-15Feed-Forward SceneDINO for Unsupervised Semantic Scene Completion2025-07-08DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation2025-07-08Learning Robust Stereo Matching in the Wild with Selective Mixture-of-Experts2025-07-07RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather2025-07-02RoboScape: Physics-informed Embodied World Model2025-06-29