Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model

Jannik Endres, Oliver Hahn, Charles Corbière, Simone Schaub-Meyer, Stefan Roth, Alexandre Alahi

2025-03-30Omnnidirectional Stereo Depth Estimation Stereo Matching Stereo Depth Estimation Scene Understanding Depth Estimation Monocular Depth Estimation

Paper PDF Code(official)

Abstract

Omnidirectional depth perception is essential for mobile robotics applications that require scene understanding across a full 360{\deg} field of view. Camera-based setups offer a cost-effective option by using stereo depth estimation to generate dense, high-resolution depth maps without relying on expensive active sensing. However, existing omnidirectional stereo matching approaches achieve only limited depth accuracy across diverse environments, depth ranges, and lighting conditions, due to the scarcity of real-world data. We present DFI-OmniStereo, a novel omnidirectional stereo matching method that leverages a large-scale pre-trained foundation model for relative monocular depth estimation within an iterative optimization-based stereo matching architecture. We introduce a dedicated two-stage training strategy to utilize the relative monocular depth features for our omnidirectional stereo matching before scale-invariant fine-tuning. DFI-OmniStereo achieves state-of-the-art results on the real-world Helvipad dataset, reducing disparity MAE by approximately 16% compared to the previous best omnidirectional stereo method.

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	Helvipad	Depth-LRCE	0.397	DFI-OmniStereo
Depth Estimation	Helvipad	Depth-MAE	1.463	DFI-OmniStereo
Depth Estimation	Helvipad	Depth-MARE	0.108	DFI-OmniStereo
Depth Estimation	Helvipad	Depth-RMSE	3.767	DFI-OmniStereo
Depth Estimation	Helvipad	Disp-LRCE	0.058	DFI-OmniStereo
Depth Estimation	Helvipad	Disp-MAE	0.158	DFI-OmniStereo
Depth Estimation	Helvipad	Disp-MARE	0.12	DFI-OmniStereo
Depth Estimation	Helvipad	Disp-RMSE	0.338	DFI-OmniStereo
3D	Helvipad	Depth-LRCE	0.397	DFI-OmniStereo
3D	Helvipad	Depth-MAE	1.463	DFI-OmniStereo
3D	Helvipad	Depth-MARE	0.108	DFI-OmniStereo
3D	Helvipad	Depth-RMSE	3.767	DFI-OmniStereo
3D	Helvipad	Disp-LRCE	0.058	DFI-OmniStereo
3D	Helvipad	Disp-MAE	0.158	DFI-OmniStereo
3D	Helvipad	Disp-MARE	0.12	DFI-OmniStereo
3D	Helvipad	Disp-RMSE	0.338	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Depth-LRCE	0.397	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Depth-MAE	1.463	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Depth-MARE	0.108	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Depth-RMSE	3.767	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Disp-LRCE	0.058	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Disp-MAE	0.158	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Disp-MARE	0.12	DFI-OmniStereo
Stereo Depth Estimation	Helvipad	Disp-RMSE	0.338	DFI-OmniStereo

Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model

Abstract

Results

Related Papers

Boosting Omnidirectional Stereo Matching with a Pre-trained Depth Foundation Model

Abstract

Results

Related Papers