Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency

Seokju Lee, Sunghoon Im, Stephen Lin, In So Kweon

2021-02-04Optical Flow Estimation Motion Estimation Unsupervised Monocular Depth Estimation Semantic Segmentation Instance Segmentation Video Instance Segmentation Monocular Depth Estimation

Paper PDF Code(official)

Abstract

We present an end-to-end joint training framework that explicitly models 6-DoF motion of multiple dynamic objects, ego-motion and depth in a monocular camera setup without supervision. Our technical contributions are three-fold. First, we highlight the fundamental difference between inverse and forward projection while modeling the individual motion of each rigid object, and propose a geometrically correct projection pipeline using a neural forward projection module. Second, we design a unified instance-aware photometric and geometric consistency loss that holistically imposes self-supervisory signals for every background and object region. Lastly, we introduce a general-purpose auto-annotation scheme using any off-the-shelf instance segmentation and optical flow models to produce video instance segmentation maps that will be utilized as input to our training pipeline. These proposed elements are validated in a detailed ablation study. Through extensive experiments conducted on the KITTI and Cityscapes dataset, our framework is shown to outperform the state-of-the-art depth and motion estimation methods. Our code, dataset, and models are available at https://github.com/SeokjuLee/Insta-DM .

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	Cityscapes	Absolute relative error (AbsRel)	0.111	Lee et al.
Depth Estimation	Cityscapes	RMSE	6.437	Lee et al.
Depth Estimation	Cityscapes	RMSE log	0.182	Lee et al.
Depth Estimation	Cityscapes	Square relative error (SqRel)	1.158	Lee et al.
3D	Cityscapes	Absolute relative error (AbsRel)	0.111	Lee et al.
3D	Cityscapes	RMSE	6.437	Lee et al.
3D	Cityscapes	RMSE log	0.182	Lee et al.
3D	Cityscapes	Square relative error (SqRel)	1.158	Lee et al.

Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency

Abstract

Results

Related Papers

Learning Monocular Depth in Dynamic Scenes via Instance-Aware Projection Consistency

Abstract

Results

Related Papers