Semantic Scene Completion from a Single Depth Image

Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas Funkhouser

2016-11-28CVPR 2017 73D Semantic Scene Completion

Abstract

This paper focuses on semantic scene completion, a task for producing a complete 3D voxel representation of volumetric occupancy and semantic labels for a scene from a single-view depth map observation. Previous work has considered scene completion and semantic labeling of depth maps separately. However, we observe that these two problems are tightly intertwined. To leverage the coupled nature of these two tasks, we introduce the semantic scene completion network (SSCNet), an end-to-end 3D convolutional network that takes a single depth image as input and simultaneously outputs occupancy and semantic labels for all voxels in the camera view frustum. Our network uses a dilation-based 3D context module to efficiently expand the receptive field and enable 3D context learning. To train our network, we construct SUNCG - a manually created large-scale dataset of synthetic 3D scenes with dense volumetric annotations. Our experiments demonstrate that the joint model outperforms methods addressing each task in isolation and outperforms alternative approaches on the semantic scene completion task.

Results

Task	Dataset	Metric	Value	Model
3D Reconstruction	NYUv2	mIoU	30.5	SSCNet (SUNCG pretraining)
3D Reconstruction	NYUv2	mIoU	24.7	SSCNet
3D Reconstruction	SemanticKITTI	mIoU	16.1	SSCNet (reported in LMSCNet)
3D Reconstruction	SemanticKITTI	mIoU	16.1	SSCNet-full (reported in LMSCNet)
3D Reconstruction	KITTI-360	mIoU	16.95	SSCNet
3D	NYUv2	mIoU	30.5	SSCNet (SUNCG pretraining)
3D	NYUv2	mIoU	24.7	SSCNet
3D	SemanticKITTI	mIoU	16.1	SSCNet (reported in LMSCNet)
3D	SemanticKITTI	mIoU	16.1	SSCNet-full (reported in LMSCNet)
3D	KITTI-360	mIoU	16.95	SSCNet
3D Semantic Scene Completion	NYUv2	mIoU	30.5	SSCNet (SUNCG pretraining)
3D Semantic Scene Completion	NYUv2	mIoU	24.7	SSCNet
3D Semantic Scene Completion	SemanticKITTI	mIoU	16.1	SSCNet (reported in LMSCNet)
3D Semantic Scene Completion	SemanticKITTI	mIoU	16.1	SSCNet-full (reported in LMSCNet)
3D Semantic Scene Completion	KITTI-360	mIoU	16.95	SSCNet

Semantic Scene Completion from a Single Depth Image

Abstract

Results

Related Papers

Semantic Scene Completion from a Single Depth Image

Abstract

Results

Related Papers