TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CutDepth:Edge-aware Data Augmentation in Depth Estimation

CutDepth:Edge-aware Data Augmentation in Depth Estimation

Yasunori Ishii, Takayoshi Yamashita

2021-07-16Data AugmentationDepth EstimationMonocular Depth Estimation
PaperPDFCode

Abstract

It is difficult to collect data on a large scale in a monocular depth estimation because the task requires the simultaneous acquisition of RGB images and depths. Data augmentation is thus important to this task. However, there has been little research on data augmentation for tasks such as monocular depth estimation, where the transformation is performed pixel by pixel. In this paper, we propose a data augmentation method, called CutDepth. In CutDepth, part of the depth is pasted onto an input image during training. The method extends variations data without destroying edge features. Experiments objectively and subjectively show that the proposed method outperforms conventional methods of data augmentation. The estimation accuracy is improved with CutDepth even though there are few training data at long distances.

Results

TaskDatasetMetricValueModel
Depth EstimationNYU-Depth V2Delta < 1.250.899CutDepth
Depth EstimationNYU-Depth V2Delta < 1.25^20.985CutDepth
Depth EstimationNYU-Depth V2Delta < 1.25^30.997CutDepth
Depth EstimationNYU-Depth V2RMSE0.375CutDepth
Depth EstimationNYU-Depth V2absolute relative error0.104CutDepth
Depth EstimationNYU-Depth V2log 100.044CutDepth
3DNYU-Depth V2Delta < 1.250.899CutDepth
3DNYU-Depth V2Delta < 1.25^20.985CutDepth
3DNYU-Depth V2Delta < 1.25^30.997CutDepth
3DNYU-Depth V2RMSE0.375CutDepth
3DNYU-Depth V2absolute relative error0.104CutDepth
3DNYU-Depth V2log 100.044CutDepth

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16Data Augmentation in Time Series Forecasting through Inverted Framework2025-07-15