TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans

Ji Hou, Angela Dai, Matthias Nießner

2018-12-17CVPR 2019 63D Instance SegmentationSegmentationSemantic Segmentation3D ReconstructionInstance Segmentationobject-detection3D Object DetectionObject Detection3D Semantic Instance Segmentation
PaperPDFCode(official)

Abstract

We introduce 3D-SIS, a novel neural network architecture for 3D semantic instance segmentation in commodity RGB-D scans. The core idea of our method is to jointly learn from both geometric and color signal, thus enabling accurate instance predictions. Rather than operate solely on 2D frames, we observe that most computer vision applications have multi-view RGB-D input available, which we leverage to construct an approach for 3D instance segmentation that effectively fuses together these multi-modal inputs. Our network leverages high-resolution RGB input by associating 2D images with the volumetric grid based on the pose alignment of the 3D reconstruction. For each image, we first extract 2D features for each pixel with a series of 2D convolutions; we then backproject the resulting feature vector to the associated voxel in the 3D grid. This combination of 2D and 3D feature learning allows significantly higher accuracy object detection and instance segmentation than state-of-the-art alternatives. We show results on both synthetic and real-world public benchmarks, achieving an improvement in mAP of over 13 on real-world data.

Results

TaskDatasetMetricValueModel
Object DetectionScanNetV2mAP@0.2540.23D-SIS
Object DetectionScanNetV2mAP@0.522.53D-SIS
3DScanNetV2mAP@0.2540.23D-SIS
3DScanNetV2mAP@0.522.53D-SIS
Instance SegmentationScanNet(v2)mAP @ 5038.23D-SIS
Instance SegmentationScanNetV2mAP@0.5038.23D-SIS
3D Object DetectionScanNetV2mAP@0.2540.23D-SIS
3D Object DetectionScanNetV2mAP@0.522.53D-SIS
2D ClassificationScanNetV2mAP@0.2540.23D-SIS
2D ClassificationScanNetV2mAP@0.522.53D-SIS
2D Object DetectionScanNetV2mAP@0.2540.23D-SIS
2D Object DetectionScanNetV2mAP@0.522.53D-SIS
16kScanNetV2mAP@0.2540.23D-SIS
16kScanNetV2mAP@0.522.53D-SIS
3D Instance SegmentationScanNet(v2)mAP @ 5038.23D-SIS

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17