ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Sungmin Woo, Wonjoon Lee, Woo Jin Kim, Dogyoon Lee, Sangyoun Lee

2024-07-12Unsupervised Monocular Depth Estimation Depth Prediction Depth Estimation Monocular Depth Estimation

Abstract

Self-supervised multi-frame monocular depth estimation relies on the geometric consistency between successive frames under the assumption of a static scene. However, the presence of moving objects in dynamic scenes introduces inevitable inconsistencies, causing misaligned multi-frame feature matching and misleading self-supervision during training. In this paper, we propose a novel framework called ProDepth, which effectively addresses the mismatch problem caused by dynamic objects using a probabilistic approach. We initially deduce the uncertainty associated with static scene assumption by adopting an auxiliary decoder. This decoder analyzes inconsistencies embedded in the cost volume, inferring the probability of areas being dynamic. We then directly rectify the erroneous cost volume for dynamic areas through a Probabilistic Cost Volume Modulation (PCVM) module. Specifically, we derive probability distributions of depth candidates from both single-frame and multi-frame cues, modulating the cost volume by adaptively fusing those distributions based on the inferred uncertainty. Additionally, we present a self-supervision loss reweighting strategy that not only masks out incorrect supervision with high uncertainty but also mitigates the risks in remaining possible dynamic areas in accordance with the probability. Our proposed method excels over state-of-the-art approaches in all metrics on both Cityscapes and KITTI datasets, and demonstrates superior generalization ability on the Waymo Open dataset.

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	KITTI Eigen split unsupervised	Delta < 1.25	0.918	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	Delta < 1.25^2	0.969	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	Delta < 1.25^3	0.984	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	RMSE	4.139	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	RMSE log	0.166	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	Sq Rel	0.629	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	absolute relative error	0.086	ProDepth
Depth Estimation	KITTI Eigen split unsupervised	Delta < 1.25	0.902	ProDepth(M+640x192)
Depth Estimation	KITTI Eigen split unsupervised	Delta < 1.25^2	0.967	ProDepth(M+640x192)
Depth Estimation	KITTI Eigen split unsupervised	Delta < 1.25^3	0.985	ProDepth(M+640x192)
Depth Estimation	KITTI Eigen split unsupervised	RMSE	4.345	ProDepth(M+640x192)
Depth Estimation	KITTI Eigen split unsupervised	RMSE log	0.172	ProDepth(M+640x192)
Depth Estimation	KITTI Eigen split unsupervised	Sq Rel	0.693	ProDepth(M+640x192)
Depth Estimation	KITTI Eigen split unsupervised	absolute relative error	0.095	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	Delta < 1.25	0.918	ProDepth
3D	KITTI Eigen split unsupervised	Delta < 1.25^2	0.969	ProDepth
3D	KITTI Eigen split unsupervised	Delta < 1.25^3	0.984	ProDepth
3D	KITTI Eigen split unsupervised	RMSE	4.139	ProDepth
3D	KITTI Eigen split unsupervised	RMSE log	0.166	ProDepth
3D	KITTI Eigen split unsupervised	Sq Rel	0.629	ProDepth
3D	KITTI Eigen split unsupervised	absolute relative error	0.086	ProDepth
3D	KITTI Eigen split unsupervised	Delta < 1.25	0.902	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	Delta < 1.25^2	0.967	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	Delta < 1.25^3	0.985	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	RMSE	4.345	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	RMSE log	0.172	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	Sq Rel	0.693	ProDepth(M+640x192)
3D	KITTI Eigen split unsupervised	absolute relative error	0.095	ProDepth(M+640x192)

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Abstract

Results

Related Papers

ProDepth: Boosting Self-Supervised Multi-Frame Monocular Depth with Probabilistic Fusion

Abstract

Results

Related Papers