TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Excavating the Potential Capacity of Self-Supervised Monoc...

Excavating the Potential Capacity of Self-Supervised Monocular Depth Estimation

Rui Peng, Ronggang Wang, Yawen Lai, Luyang Tang, Yangang Cai

2021-09-26ICCV 2021 10Data AugmentationSemantic SegmentationDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Self-supervised methods play an increasingly important role in monocular depth estimation due to their great potential and low annotation cost. To close the gap with supervised methods, recent works take advantage of extra constraints, e.g., semantic segmentation. However, these methods will inevitably increase the burden on the model. In this paper, we show theoretical and empirical evidence that the potential capacity of self-supervised monocular depth estimation can be excavated without increasing this cost. In particular, we propose (1) a novel data augmentation approach called data grafting, which forces the model to explore more cues to infer depth besides the vertical image position, (2) an exploratory self-distillation loss, which is supervised by the self-distillation label generated by our new post-processing method - selective post-processing, and (3) the full-scale network, designed to endow the encoder with the specialization of depth estimation task and enhance the representational power of the model. Extensive experiments show that our contributions can bring significant performance improvement to the baseline with even less computational overhead, and our model, named EPCDepth, surpasses the previous state-of-the-art methods even those supervised by additional constraints.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.901EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.966EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.983EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.207EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.176EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.646EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.091EPCDepth(S+1024x320)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.888EPCDepth(S+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.963EPCDepth(S+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.982EPCDepth(S+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.49EPCDepth(S+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.183EPCDepth(S+640x192)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.754EPCDepth(S+640x192)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.099EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.250.901EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.966EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.983EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedRMSE4.207EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedRMSE log0.176EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedSq Rel0.646EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedabsolute relative error0.091EPCDepth(S+1024x320)
3DKITTI Eigen split unsupervisedDelta < 1.250.888EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.963EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.982EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedRMSE4.49EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedRMSE log0.183EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedSq Rel0.754EPCDepth(S+640x192)
3DKITTI Eigen split unsupervisedabsolute relative error0.099EPCDepth(S+640x192)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17