TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Disentangling Object Motion and Occlusion for Unsupervised...

Disentangling Object Motion and Occlusion for Unsupervised Multi-frame Monocular Depth

Ziyue Feng, Liang Yang, Longlong Jing, HaiYan Wang, YingLi Tian, Bing Li

2022-03-29DisentanglementUnsupervised Monocular Depth EstimationDepth PredictionPredictionMotion DisentanglementDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Conventional self-supervised monocular depth prediction methods are based on a static environment assumption, which leads to accuracy degradation in dynamic scenes due to the mismatch and occlusion problems introduced by object motions. Existing dynamic-object-focused methods only partially solved the mismatch problem at the training loss level. In this paper, we accordingly propose a novel multi-frame monocular depth prediction method to solve these problems at both the prediction and supervision loss levels. Our method, called DynamicDepth, is a new framework trained via a self-supervised cycle consistent learning scheme. A Dynamic Object Motion Disentanglement (DOMD) module is proposed to disentangle object motions to solve the mismatch problem. Moreover, novel occlusion-aware Cost Volume and Re-projection Loss are designed to alleviate the occlusion effects of object motions. Extensive analyses and experiments on the Cityscapes and KITTI datasets show that our method significantly outperforms the state-of-the-art monocular depth prediction methods, especially in the areas of dynamic objects. Code is available at https://github.com/AutoAILab/DynamicDepth

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.897DynamicDepth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.964DynamicDepth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.984DynamicDepth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.458DynamicDepth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.175DynamicDepth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.72DynamicDepth (M+640x192)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.096DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.250.897DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.964DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.984DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedRMSE4.458DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedRMSE log0.175DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedSq Rel0.72DynamicDepth (M+640x192)
3DKITTI Eigen split unsupervisedabsolute relative error0.096DynamicDepth (M+640x192)

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21CSD-VAR: Content-Style Decomposition in Visual Autoregressive Models2025-07-18$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15