TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DCPI-Depth: Explicitly Infusing Dense Correspondence Prior...

DCPI-Depth: Explicitly Infusing Dense Correspondence Prior to Unsupervised Monocular Depth Estimation

Mengtan Zhang, Yi Feng, Qijun Chen, Rui Fan

2024-05-27Optical Flow EstimationUnsupervised Monocular Depth EstimationDepth EstimationMonocular Depth Estimation
PaperPDF

Abstract

There has been a recent surge of interest in learning to perceive depth from monocular videos in an unsupervised fashion. A key challenge in this field is achieving robust and accurate depth estimation in challenging scenarios, particularly in regions with weak textures or where dynamic objects are present. This study makes three major contributions by delving deeply into dense correspondence priors to provide existing frameworks with explicit geometric constraints. The first novelty is a contextual-geometric depth consistency loss, which employs depth maps triangulated from dense correspondences based on estimated ego-motion to guide the learning of depth perception from contextual information, since explicitly triangulated depth maps capture accurate relative distances among pixels. The second novelty arises from the observation that there exists an explicit, deducible relationship between optical flow divergence and depth gradient. A differential property correlation loss is, therefore, designed to refine depth estimation with a specific emphasis on local variations. The third novelty is a bidirectional stream co-adjustment strategy that enhances the interaction between rigid and optical flows, encouraging the former towards more accurate correspondence and making the latter more adaptable across various scenarios under the static scene hypotheses. DCPI-Depth, a framework that incorporates all these innovative components and couples two bidirectional and collaborative streams, achieves state-of-the-art performance and generalizability across multiple public datasets, outperforming all existing prior arts. Specifically, it demonstrates accurate depth estimation in texture-less and dynamic regions, and shows more reasonable smoothness. Our source code will be publicly available at mias.group/DCPI-Depth upon publication.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.914DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.969DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.985DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.113DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.167DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.655DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.09DCPI-Depth (M+1024x320)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.902DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.967DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.985DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.274DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.17DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.662DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.095DCPI-Depth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.496DCPI-Depth (M+832x256+SC-V3)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.679DCPI-Depth (M+832x256+SC-V3)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.109DCPI-Depth (M+832x256+SC-V3)
3DKITTI Eigen split unsupervisedDelta < 1.250.914DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.969DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.985DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedRMSE4.113DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedRMSE log0.167DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedSq Rel0.655DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedabsolute relative error0.09DCPI-Depth (M+1024x320)
3DKITTI Eigen split unsupervisedDelta < 1.250.902DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.967DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.985DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedRMSE4.274DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedRMSE log0.17DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedSq Rel0.662DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedabsolute relative error0.095DCPI-Depth (M+640x192)
3DKITTI Eigen split unsupervisedRMSE4.496DCPI-Depth (M+832x256+SC-V3)
3DKITTI Eigen split unsupervisedSq Rel0.679DCPI-Depth (M+832x256+SC-V3)
3DKITTI Eigen split unsupervisedabsolute relative error0.109DCPI-Depth (M+832x256+SC-V3)

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Cameras as Relative Positional Encoding2025-07-14