TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Wasserstein Distances for Stereo Disparity Estimation

Wasserstein Distances for Stereo Disparity Estimation

Divyansh Garg, Yan Wang, Bharath Hariharan, Mark Campbell, Kilian Q. Weinberger, Wei-Lun Chao

2020-07-06NeurIPS 2020 12Stereo Depth EstimationStereo Disparity Estimation3D Object Detection From Stereo ImagesAutonomous DrivingDisparity EstimationDepth Estimationobject-detection3D Object DetectionObject Detection
PaperPDFCode(official)

Abstract

Existing approaches to depth or disparity estimation output a distribution over a set of pre-defined discrete values. This leads to inaccurate results when the true depth or disparity does not match any of these values. The fact that this distribution is usually learned indirectly through a regression loss causes further problems in ambiguous regions around object boundaries. We address these issues using a new neural network architecture that is capable of outputting arbitrary depth values, and a new loss function that is derived from the Wasserstein distance between the true and the predicted distributions. We validate our approach on a variety of tasks, including stereo disparity and depth estimation, and the downstream 3D object detection. Our approach drastically reduces the error in ambiguous regions, especially around object boundaries that greatly affect the localization of objects in 3D, achieving the state-of-the-art in 3D object detection for autonomous driving. Our code will be available at https://github.com/Div99/W-Stereo-Disp.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI2015three pixel error1.92CDN-GANet Deep
Object DetectionKITTI Cars ModerateAP7554.2CDN-DSGN
3DKITTI Cars ModerateAP7554.2CDN-DSGN
3DKITTI2015three pixel error1.92CDN-GANet Deep
3D Object DetectionKITTI Cars ModerateAP7554.2CDN-DSGN
2D ClassificationKITTI Cars ModerateAP7554.2CDN-DSGN
2D Object DetectionKITTI Cars ModerateAP7554.2CDN-DSGN
Stereo Disparity EstimationScene FlowEPE0.7CDN-GANet Deep
Stereo Disparity EstimationScene Flowone pixel error7.7CDN-GANet Deep
Stereo Disparity EstimationScene Flowthree pixel error2.98CDN-GANet Deep
16kKITTI Cars ModerateAP7554.2CDN-DSGN
Stereo Depth EstimationKITTI2015three pixel error1.92CDN-GANet Deep

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17