UniDepth: Universal Monocular Metric Depth Estimation

Luigi Piccinelli, Yung-Hsu Yang, Christos Sakaridis, Mattia Segu, Siyuan Li, Luc van Gool, Fisher Yu

2024-03-27CVPR 2024 1Depth Estimation Monocular Depth Estimation

Abstract

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepth, capable of reconstructing metric 3D scenes from solely single images across domains. Departing from the existing MMDE methods, UniDepth directly predicts metric 3D points from the input image at inference time without any additional information, striving for a universal and flexible MMDE solution. In particular, UniDepth implements a self-promptable camera module predicting dense camera representation to condition depth features. Our model exploits a pseudo-spherical output representation, which disentangles camera and depth representations. In addition, we propose a geometric invariance loss that promotes the invariance of camera-prompted depth features. Thorough evaluations on ten datasets in a zero-shot regime consistently demonstrate the superior performance of UniDepth, even when compared with methods directly trained on the testing domains. Code and models are available at: https://github.com/lpiccinelli-eth/unidepth

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	NYU-Depth V2	Delta < 1.25	0.984	UniDepth (Zero-shot)
Depth Estimation	NYU-Depth V2	Delta < 1.25^2	0.997	UniDepth (Zero-shot)
Depth Estimation	NYU-Depth V2	Delta < 1.25^3	0.999	UniDepth (Zero-shot)
Depth Estimation	NYU-Depth V2	RMSE	0.201	UniDepth (Zero-shot)
Depth Estimation	NYU-Depth V2	absolute relative error	0.058	UniDepth (Zero-shot)
Depth Estimation	NYU-Depth V2	log 10	0.024	UniDepth (Zero-shot)
Depth Estimation	KITTI Eigen split	Delta < 1.25	0.986	UniDepth (Zero-shot)
Depth Estimation	KITTI Eigen split	Delta < 1.25^2	0.998	UniDepth (Zero-shot)
Depth Estimation	KITTI Eigen split	Delta < 1.25^3	0.999	UniDepth (Zero-shot)
Depth Estimation	KITTI Eigen split	RMSE	1.75	UniDepth (Zero-shot)
Depth Estimation	KITTI Eigen split	RMSE log	0.064	UniDepth (Zero-shot)
Depth Estimation	KITTI Eigen split	absolute relative error	0.042	UniDepth (Zero-shot)
3D	NYU-Depth V2	Delta < 1.25	0.984	UniDepth (Zero-shot)
3D	NYU-Depth V2	Delta < 1.25^2	0.997	UniDepth (Zero-shot)
3D	NYU-Depth V2	Delta < 1.25^3	0.999	UniDepth (Zero-shot)
3D	NYU-Depth V2	RMSE	0.201	UniDepth (Zero-shot)
3D	NYU-Depth V2	absolute relative error	0.058	UniDepth (Zero-shot)
3D	NYU-Depth V2	log 10	0.024	UniDepth (Zero-shot)
3D	KITTI Eigen split	Delta < 1.25	0.986	UniDepth (Zero-shot)
3D	KITTI Eigen split	Delta < 1.25^2	0.998	UniDepth (Zero-shot)
3D	KITTI Eigen split	Delta < 1.25^3	0.999	UniDepth (Zero-shot)
3D	KITTI Eigen split	RMSE	1.75	UniDepth (Zero-shot)
3D	KITTI Eigen split	RMSE log	0.064	UniDepth (Zero-shot)
3D	KITTI Eigen split	absolute relative error	0.042	UniDepth (Zero-shot)

UniDepth: Universal Monocular Metric Depth Estimation

Abstract

Results

Related Papers

UniDepth: Universal Monocular Metric Depth Estimation

Abstract

Results

Related Papers