TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/URCDC-Depth: Uncertainty Rectified Cross-Distillation with...

URCDC-Depth: Uncertainty Rectified Cross-Distillation with CutFlip for Monocular Depth Estimation

Shuwei Shao, Zhongcai Pei, Weihai Chen, Ran Li, Zhong Liu, Zhengguo Li

2023-02-16Data AugmentationDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

This work aims to estimate a high-quality depth map from a single RGB image. Due to the lack of depth clues, making full use of the long-range correlation and the local information is critical for accurate depth estimation. Towards this end, we introduce an uncertainty rectified cross-distillation between Transformer and convolutional neural network (CNN) to learn a unified depth estimator. Specifically, we use the depth estimates from the Transformer branch and the CNN branch as pseudo labels to teach each other. Meanwhile, we model the pixel-wise depth uncertainty to rectify the loss weights of noisy pseudo labels. To avoid the large capacity gap induced by the strong Transformer branch deteriorating the cross-distillation, we transfer the feature maps from Transformer to CNN and design coupling units to assist the weak CNN branch to leverage the transferred features. Furthermore, we propose a surprisingly simple yet highly effective data augmentation technique CutFlip, which enforces the model to exploit more valuable clues apart from the vertical image position for depth inference. Extensive experiments demonstrate that our model, termed~\textbf{URCDC-Depth}, exceeds previous state-of-the-art methods on the KITTI, NYU-Depth-v2 and SUN RGB-D datasets, even with no additional computational burden at inference time. The source code is publicly available at \url{https://github.com/ShuweiShao/URCDC-Depth}.

Results

TaskDatasetMetricValueModel
Depth EstimationNYU-Depth V2Delta < 1.250.933URCDC-Depth
Depth EstimationNYU-Depth V2Delta < 1.25^20.992URCDC-Depth
Depth EstimationNYU-Depth V2Delta < 1.25^30.998URCDC-Depth
Depth EstimationNYU-Depth V2RMSE0.316URCDC-Depth
Depth EstimationNYU-Depth V2absolute relative error0.088URCDC-Depth
Depth EstimationNYU-Depth V2log 100.038URCDC-Depth
Depth EstimationKITTI Eigen splitDelta < 1.250.977URCDC-Depth
Depth EstimationKITTI Eigen splitDelta < 1.25^20.997URCDC-Depth
Depth EstimationKITTI Eigen splitDelta < 1.25^30.999URCDC-Depth
Depth EstimationKITTI Eigen splitRMSE2.032URCDC-Depth
Depth EstimationKITTI Eigen splitRMSE log0.076URCDC-Depth
Depth EstimationKITTI Eigen splitSq Rel0.142URCDC-Depth
Depth EstimationKITTI Eigen splitabsolute relative error0.05URCDC-Depth
3DNYU-Depth V2Delta < 1.250.933URCDC-Depth
3DNYU-Depth V2Delta < 1.25^20.992URCDC-Depth
3DNYU-Depth V2Delta < 1.25^30.998URCDC-Depth
3DNYU-Depth V2RMSE0.316URCDC-Depth
3DNYU-Depth V2absolute relative error0.088URCDC-Depth
3DNYU-Depth V2log 100.038URCDC-Depth
3DKITTI Eigen splitDelta < 1.250.977URCDC-Depth
3DKITTI Eigen splitDelta < 1.25^20.997URCDC-Depth
3DKITTI Eigen splitDelta < 1.25^30.999URCDC-Depth
3DKITTI Eigen splitRMSE2.032URCDC-Depth
3DKITTI Eigen splitRMSE log0.076URCDC-Depth
3DKITTI Eigen splitSq Rel0.142URCDC-Depth
3DKITTI Eigen splitabsolute relative error0.05URCDC-Depth

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15