TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Manydepth2: Motion-Aware Self-Supervised Multi-Frame Monoc...

Manydepth2: Motion-Aware Self-Supervised Multi-Frame Monocular Depth Estimation in Dynamic Scenes

Kaichen Zhou, Jia-Wang Bian, Jian-Qing Zheng, JiaXing Zhong, Qian Xie, Niki Trigoni, Andrew Markham

2023-12-23Optical Flow EstimationCamera Pose EstimationDepth EstimationMonocular Depth Estimation
PaperPDFCode

Abstract

Despite advancements in self-supervised monocular depth estimation, challenges persist in dynamic scenarios due to the dependence on assumptions about a static world. In this paper, we present Manydepth2, to achieve precise depth estimation for both dynamic objects and static backgrounds, all while maintaining computational efficiency. To tackle the challenges posed by dynamic content, we incorporate optical flow and coarse monocular depth to create a pseudo-static reference frame. This frame is then utilized to build a motion-aware cost volume in collaboration with the vanilla target frame. Furthermore, to improve the accuracy and robustness of the network architecture, we propose an attention-based depth network that effectively integrates information from feature maps at different resolutions by incorporating both channel and non-local attention mechanisms. Compared to methods with similar computational costs, Manydepth2 achieves a significant reduction of approximately five percent in root-mean-square error for self-supervised monocular depth estimation on the KITTI-2015 dataset. The code could be found at https://github.com/kaichen-z/Manydepth2.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen splitDelta < 1.250.909Manydepth2
Depth EstimationKITTI Eigen splitDelta < 1.25^20.968Manydepth2
Depth EstimationKITTI Eigen splitDelta < 1.25^30.984Manydepth2
Depth EstimationKITTI Eigen splitRMSE4.232Manydepth2
Depth EstimationKITTI Eigen splitRMSE log0.649Manydepth2
Depth EstimationKITTI Eigen splitSq Rel0.17Manydepth2
Depth EstimationKITTI Eigen splitabsolute relative error0.091Manydepth2
Depth EstimationCityscapesAbsolute relative error (AbsRel)0.097Manydepth2
Depth EstimationCityscapesRMSE5.827Manydepth2
Depth EstimationCityscapesRMSE log0.154Manydepth2
Depth EstimationCityscapesSquare relative error (SqRel)0.792Manydepth2
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.909Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.968Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.984Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.232Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.17Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.649Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.091Manydepth2(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.250.909Manydepth2-NF(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^20.968Manydepth2-NF(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedDelta < 1.25^30.985Manydepth2-NF(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE4.246Manydepth2-NF(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedRMSE log0.17Manydepth2-NF(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedSq Rel0.676Manydepth2-NF(M+640x192)
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.094Manydepth2-NF(M+640x192)
3DKITTI Eigen splitDelta < 1.250.909Manydepth2
3DKITTI Eigen splitDelta < 1.25^20.968Manydepth2
3DKITTI Eigen splitDelta < 1.25^30.984Manydepth2
3DKITTI Eigen splitRMSE4.232Manydepth2
3DKITTI Eigen splitRMSE log0.649Manydepth2
3DKITTI Eigen splitSq Rel0.17Manydepth2
3DKITTI Eigen splitabsolute relative error0.091Manydepth2
3DCityscapesAbsolute relative error (AbsRel)0.097Manydepth2
3DCityscapesRMSE5.827Manydepth2
3DCityscapesRMSE log0.154Manydepth2
3DCityscapesSquare relative error (SqRel)0.792Manydepth2
3DKITTI Eigen split unsupervisedDelta < 1.250.909Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.968Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.984Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedRMSE4.232Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedRMSE log0.17Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedSq Rel0.649Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedabsolute relative error0.091Manydepth2(M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.250.909Manydepth2-NF(M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^20.968Manydepth2-NF(M+640x192)
3DKITTI Eigen split unsupervisedDelta < 1.25^30.985Manydepth2-NF(M+640x192)
3DKITTI Eigen split unsupervisedRMSE4.246Manydepth2-NF(M+640x192)
3DKITTI Eigen split unsupervisedRMSE log0.17Manydepth2-NF(M+640x192)
3DKITTI Eigen split unsupervisedSq Rel0.676Manydepth2-NF(M+640x192)
3DKITTI Eigen split unsupervisedabsolute relative error0.094Manydepth2-NF(M+640x192)
Camera Pose EstimationKITTI Odometry BenchmarkAverage Rotational Error er[%]2.205Manydepth2
Camera Pose EstimationKITTI Odometry BenchmarkAverage Translational Error et[%]7.15Manydepth2

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16