TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Boosting Omnidirectional Stereo Matching with a Pre-traine...

Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model

Jannik Endres, Oliver Hahn, Charles Corbière, Simone Schaub-Meyer, Stefan Roth, Alexandre Alahi

2025-03-30Omnnidirectional Stereo Depth EstimationStereo MatchingStereo Depth EstimationScene UnderstandingDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Omnidirectional depth perception is essential for mobile robotics applications that require scene understanding across a full 360{\deg} field of view. Camera-based setups offer a cost-effective option by using stereo depth estimation to generate dense, high-resolution depth maps without relying on expensive active sensing. However, existing omnidirectional stereo matching approaches achieve only limited depth accuracy across diverse environments, depth ranges, and lighting conditions, due to the scarcity of real-world data. We present DFI-OmniStereo, a novel omnidirectional stereo matching method that leverages a large-scale pre-trained foundation model for relative monocular depth estimation within an iterative optimization-based stereo matching architecture. We introduce a dedicated two-stage training strategy to utilize the relative monocular depth features for our omnidirectional stereo matching before scale-invariant fine-tuning. DFI-OmniStereo achieves state-of-the-art results on the real-world Helvipad dataset, reducing disparity MAE by approximately 16% compared to the previous best omnidirectional stereo method.

Results

TaskDatasetMetricValueModel
Depth EstimationHelvipadDepth-LRCE0.397DFI-OmniStereo
Depth EstimationHelvipadDepth-MAE1.463DFI-OmniStereo
Depth EstimationHelvipadDepth-MARE0.108DFI-OmniStereo
Depth EstimationHelvipadDepth-RMSE3.767DFI-OmniStereo
Depth EstimationHelvipadDisp-LRCE0.058DFI-OmniStereo
Depth EstimationHelvipadDisp-MAE0.158DFI-OmniStereo
Depth EstimationHelvipadDisp-MARE0.12DFI-OmniStereo
Depth EstimationHelvipadDisp-RMSE0.338DFI-OmniStereo
3DHelvipadDepth-LRCE0.397DFI-OmniStereo
3DHelvipadDepth-MAE1.463DFI-OmniStereo
3DHelvipadDepth-MARE0.108DFI-OmniStereo
3DHelvipadDepth-RMSE3.767DFI-OmniStereo
3DHelvipadDisp-LRCE0.058DFI-OmniStereo
3DHelvipadDisp-MAE0.158DFI-OmniStereo
3DHelvipadDisp-MARE0.12DFI-OmniStereo
3DHelvipadDisp-RMSE0.338DFI-OmniStereo
Stereo Depth EstimationHelvipadDepth-LRCE0.397DFI-OmniStereo
Stereo Depth EstimationHelvipadDepth-MAE1.463DFI-OmniStereo
Stereo Depth EstimationHelvipadDepth-MARE0.108DFI-OmniStereo
Stereo Depth EstimationHelvipadDepth-RMSE3.767DFI-OmniStereo
Stereo Depth EstimationHelvipadDisp-LRCE0.058DFI-OmniStereo
Stereo Depth EstimationHelvipadDisp-MAE0.158DFI-OmniStereo
Stereo Depth EstimationHelvipadDisp-MARE0.12DFI-OmniStereo
Stereo Depth EstimationHelvipadDisp-RMSE0.338DFI-OmniStereo

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation2025-07-15