Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Xiankang He, Dongyan Guo, Hongji Li, Ruibo Li, Ying Cui, Chi Zhang

2025-02-26Scene Understanding Depth Estimation Monocular Depth Estimation

Abstract

Recent advances in zero-shot monocular depth estimation(MDE) have significantly improved generalization by unifying depth distributions through normalized depth representations and by leveraging large-scale unlabeled data via pseudo-label distillation. However, existing methods that rely on global depth normalization treat all depth values equally, which can amplify noise in pseudo-labels and reduce distillation effectiveness. In this paper, we present a systematic analysis of depth normalization strategies in the context of pseudo-label distillation. Our study shows that, under recent distillation paradigms (e.g., shared-context distillation), normalization is not always necessary, as omitting it can help mitigate the impact of noisy supervision. Furthermore, rather than focusing solely on how depth information is represented, we propose Cross-Context Distillation, which integrates both global and local depth cues to enhance pseudo-label quality. We also introduce an assistant-guided distillation strategy that incorporates complementary depth priors from a diffusion-based teacher model, enhancing supervision diversity and robustness. Extensive experiments on benchmark datasets demonstrate that our approach significantly outperforms state-of-the-art methods, both quantitatively and qualitatively.

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	ScanNetV2	Delta < 1.25	0.98	Distill Any Depth
Depth Estimation	ScanNetV2	absolute relative error	0.042	Distill Any Depth
Depth Estimation	NYU-Depth V2	Delta < 1.25	0.981	Distill Any Depth
Depth Estimation	NYU-Depth V2	absolute relative error	0.043	Distill Any Depth
Depth Estimation	ETH3D	Delta < 1.25	0.981	Distill Any Depth
Depth Estimation	ETH3D	absolute relative error	0.054	Distill Any Depth
3D	ScanNetV2	Delta < 1.25	0.98	Distill Any Depth
3D	ScanNetV2	absolute relative error	0.042	Distill Any Depth
3D	NYU-Depth V2	Delta < 1.25	0.981	Distill Any Depth
3D	NYU-Depth V2	absolute relative error	0.043	Distill Any Depth
3D	ETH3D	Delta < 1.25	0.981	Distill Any Depth
3D	ETH3D	absolute relative error	0.054	Distill Any Depth

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Abstract

Results

Related Papers

Distill Any Depth: Distillation Creates a Stronger Monocular Depth Estimator

Abstract

Results

Related Papers