MonoScene: Monocular 3D Semantic Scene Completion

Anh-Quan Cao, Raoul de Charette

2021-12-01CVPR 2022 13D Scene Reconstruction 3D Reconstruction Single-View 3D Reconstruction 3D Semantic Scene Completion from a single RGB image 3D Semantic Scene Completion

Paper PDF Code(official)Code

Abstract

MonoScene proposes a 3D Semantic Scene Completion (SSC) framework, where the dense geometry and semantics of a scene are inferred from a single monocular RGB image. Different from the SSC literature, relying on 2.5 or 3D input, we solve the complex problem of 2D to 3D scene reconstruction while jointly inferring its semantics. Our framework relies on successive 2D and 3D UNets bridged by a novel 2D-3D features projection inspiring from optics and introduces a 3D context relation prior to enforce spatio-semantic consistency. Along with architectural contributions, we introduce novel global scene and local frustums losses. Experiments show we outperform the literature on all metrics and datasets while hallucinating plausible scenery even beyond the camera field of view. Our code and trained models are available at https://github.com/cv-rits/MonoScene.

Results

Task	Dataset	Metric	Value	Model
Reconstruction	KITTI-360	mIoU	12.31	MonoScene
Reconstruction	NYUv2	mIoU	26.94	MonoScene
Reconstruction	SemanticKITTI	mIoU	11.08	MonoScene
3D Reconstruction	NYUv2	mIoU	26.94	MonoScene (RGB input only)
3D Reconstruction	SemanticKITTI	mIoU	11.08	MonoScene (RGB input only)
3D Reconstruction	KITTI-360	mIoU	12.31	MonoScene
3D Reconstruction	KITTI-360	mIoU	12.31	MonoScene
3D Reconstruction	NYUv2	mIoU	26.94	MonoScene
3D Reconstruction	SemanticKITTI	mIoU	11.08	MonoScene
3D	NYUv2	mIoU	26.94	MonoScene (RGB input only)
3D	SemanticKITTI	mIoU	11.08	MonoScene (RGB input only)
3D	KITTI-360	mIoU	12.31	MonoScene
3D	KITTI-360	mIoU	12.31	MonoScene
3D	NYUv2	mIoU	26.94	MonoScene
3D	SemanticKITTI	mIoU	11.08	MonoScene
3D Semantic Scene Completion	NYUv2	mIoU	26.94	MonoScene (RGB input only)
3D Semantic Scene Completion	SemanticKITTI	mIoU	11.08	MonoScene (RGB input only)
3D Semantic Scene Completion	KITTI-360	mIoU	12.31	MonoScene
3D Semantic Scene Completion	KITTI-360	mIoU	12.31	MonoScene
3D Semantic Scene Completion	NYUv2	mIoU	26.94	MonoScene
3D Semantic Scene Completion	SemanticKITTI	mIoU	11.08	MonoScene
3D Scene Reconstruction	KITTI-360	mIoU	12.31	MonoScene
3D Scene Reconstruction	NYUv2	mIoU	26.94	MonoScene
3D Scene Reconstruction	SemanticKITTI	mIoU	11.08	MonoScene
Single-View 3D Reconstruction	KITTI-360	mIoU	12.31	MonoScene
Single-View 3D Reconstruction	NYUv2	mIoU	26.94	MonoScene
Single-View 3D Reconstruction	SemanticKITTI	mIoU	11.08	MonoScene

MonoScene: Monocular 3D Semantic Scene Completion

Abstract

Results

Related Papers

MonoScene: Monocular 3D Semantic Scene Completion

Abstract

Results

Related Papers