Unsupervised Monocular Depth Estimation with Left-Right Consistency

Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow

2016-09-13CVPR 2017 7Image Reconstruction Unsupervised Monocular Depth Estimation Depth Prediction Depth Estimation Monocular Depth Estimation

Paper PDF Code Code Code(official)Code Code Code Code Code Code Code Code Code Code Code Code Code

Abstract

Learning based methods have shown very promising results for the task of depth estimation in single images. However, most existing approaches treat depth prediction as a supervised regression problem and as a result, require vast quantities of corresponding ground truth depth data for training. Just recording quality depth data in a range of environments is a challenging problem. In this paper, we innovate beyond existing approaches, replacing the use of explicit depth data during training with easier-to-obtain binocular stereo footage. We propose a novel training objective that enables our convolutional neural network to learn to perform single image depth estimation, despite the absence of ground truth depth data. Exploiting epipolar geometry constraints, we generate disparity images by training our network with an image reconstruction loss. We show that solving for image reconstruction alone results in poor quality depth images. To overcome this problem, we propose a novel training loss that enforces consistency between the disparities produced relative to both the left and right images, leading to improved performance and robustness compared to existing approaches. Our method produces state of the art results for monocular depth estimation on the KITTI driving dataset, even outperforming supervised methods that have been trained with ground truth depth.

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	Mid-Air Dataset	Abs Rel	0.3136	Monodepth
Depth Estimation	Mid-Air Dataset	RMSE	13.595	Monodepth
Depth Estimation	Mid-Air Dataset	RMSE log	0.438	Monodepth
Depth Estimation	Mid-Air Dataset	SQ Rel	8.7127	Monodepth
Depth Estimation	KITTI Eigen split unsupervised	absolute relative error	0.133	Monodepth S
3D	Mid-Air Dataset	Abs Rel	0.3136	Monodepth
3D	Mid-Air Dataset	RMSE	13.595	Monodepth
3D	Mid-Air Dataset	RMSE log	0.438	Monodepth
3D	Mid-Air Dataset	SQ Rel	8.7127	Monodepth
3D	KITTI Eigen split unsupervised	absolute relative error	0.133	Monodepth S

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Abstract

Results

Related Papers

Unsupervised Monocular Depth Estimation with Left-Right Consistency

Abstract

Results

Related Papers