3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Ji Hou, Angela Dai, Matthias Nießner

2018-12-17CVPR 2019 63D Instance Segmentation Segmentation Semantic Segmentation 3D Reconstruction Instance Segmentation object-detection 3D Object Detection Object Detection 3D Semantic Instance Segmentation

Paper PDF Code(official)

Abstract

We introduce 3D-SIS, a novel neural network architecture for 3D semantic instance segmentation in commodity RGB-D scans. The core idea of our method is to jointly learn from both geometric and color signal, thus enabling accurate instance predictions. Rather than operate solely on 2D frames, we observe that most computer vision applications have multi-view RGB-D input available, which we leverage to construct an approach for 3D instance segmentation that effectively fuses together these multi-modal inputs. Our network leverages high-resolution RGB input by associating 2D images with the volumetric grid based on the pose alignment of the 3D reconstruction. For each image, we first extract 2D features for each pixel with a series of 2D convolutions; we then backproject the resulting feature vector to the associated voxel in the 3D grid. This combination of 2D and 3D feature learning allows significantly higher accuracy object detection and instance segmentation than state-of-the-art alternatives. We show results on both synthetic and real-world public benchmarks, achieving an improvement in mAP of over 13 on real-world data.

Results

Task	Dataset	Metric	Value	Model
Object Detection	ScanNetV2	mAP@0.25	40.2	3D-SIS
Object Detection	ScanNetV2	mAP@0.5	22.5	3D-SIS
3D	ScanNetV2	mAP@0.25	40.2	3D-SIS
3D	ScanNetV2	mAP@0.5	22.5	3D-SIS
Instance Segmentation	ScanNet(v2)	mAP @ 50	38.2	3D-SIS
Instance Segmentation	ScanNetV2	mAP@0.50	38.2	3D-SIS
3D Object Detection	ScanNetV2	mAP@0.25	40.2	3D-SIS
3D Object Detection	ScanNetV2	mAP@0.5	22.5	3D-SIS
2D Classification	ScanNetV2	mAP@0.25	40.2	3D-SIS
2D Classification	ScanNetV2	mAP@0.5	22.5	3D-SIS
2D Object Detection	ScanNetV2	mAP@0.25	40.2	3D-SIS
2D Object Detection	ScanNetV2	mAP@0.5	22.5	3D-SIS
16k	ScanNetV2	mAP@0.25	40.2	3D-SIS
16k	ScanNetV2	mAP@0.5	22.5	3D-SIS
3D Instance Segmentation	ScanNet(v2)	mAP @ 50	38.2	3D-SIS

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Abstract

Results

Related Papers

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Abstract

Results

Related Papers