UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

Luigi Piccinelli, Christos Sakaridis, Yung-Hsu Yang, Mattia Segu, Siyuan Li, Wim Abbeloos, Luc van Gool

2025-02-27Depth Estimation Monocular Depth Estimation

Abstract

Accurate monocular metric depth estimation (MMDE) is crucial to solving downstream tasks in 3D perception and modeling. However, the remarkable accuracy of recent MMDE methods is confined to their training domains. These methods fail to generalize to unseen domains even in the presence of moderate domain gaps, which hinders their practical applicability. We propose a new model, UniDepthV2, capable of reconstructing metric 3D scenes from solely single images across domains. Departing from the existing MMDE paradigm, UniDepthV2 directly predicts metric 3D points from the input image at inference time without any additional information, striving for a universal and flexible MMDE solution. In particular, UniDepthV2 implements a self-promptable camera module predicting a dense camera representation to condition depth features. Our model exploits a pseudo-spherical output representation, which disentangles the camera and depth representations. In addition, we propose a geometric invariance loss that promotes the invariance of camera-prompted depth features. UniDepthV2 improves its predecessor UniDepth model via a new edge-guided loss which enhances the localization and sharpness of edges in the metric depth outputs, a revisited, simplified and more efficient architectural design, and an additional uncertainty-level output which enables downstream tasks requiring confidence. Thorough evaluations on ten depth datasets in a zero-shot regime consistently demonstrate the superior performance and generalization of UniDepthV2. Code and models are available at https://github.com/lpiccinelli-eth/UniDepth

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	NYU-Depth V2	Delta < 1.25	0.988	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	Delta < 1.25^3	1	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	RMSE	0.18	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	absolute relative error	0.046	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	log 10	0.02	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	Delta < 1.25	0.989	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	Delta < 1.25^3	0.999	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	RMSE	1.71	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	RMSE log	0.061	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	absolute relative error	0.037	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	Delta < 1.25	0.988	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	Delta < 1.25^3	1	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	RMSE	0.18	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	absolute relative error	0.046	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	log 10	0.02	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	Delta < 1.25	0.989	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	Delta < 1.25^3	0.999	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	RMSE	1.71	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	RMSE log	0.061	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	absolute relative error	0.037	UniDepthV2 (FT, metric)

Abstract

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	NYU-Depth V2	Delta < 1.25	0.988	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	Delta < 1.25^3	1	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	RMSE	0.18	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	absolute relative error	0.046	UniDepthV2 (FT, metric)
Depth Estimation	NYU-Depth V2	log 10	0.02	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	Delta < 1.25	0.989	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	Delta < 1.25^3	0.999	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	RMSE	1.71	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	RMSE log	0.061	UniDepthV2 (FT, metric)
Depth Estimation	KITTI Eigen split	absolute relative error	0.037	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	Delta < 1.25	0.988	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	Delta < 1.25^3	1	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	RMSE	0.18	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	absolute relative error	0.046	UniDepthV2 (FT, metric)
3D	NYU-Depth V2	log 10	0.02	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	Delta < 1.25	0.989	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	Delta < 1.25^2	0.998	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	Delta < 1.25^3	0.999	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	RMSE	1.71	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	RMSE log	0.061	UniDepthV2 (FT, metric)
3D	KITTI Eigen split	absolute relative error	0.037	UniDepthV2 (FT, metric)

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

Abstract

Results

Related Papers

UniDepthV2: Universal Monocular Metric Depth Estimation Made Simpler

Abstract

Results

Related Papers