TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Instance Segmentation in 3D Scenes using Semantic Superpoi...

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Zhihao Liang, Zhihao LI, Songcen Xu, Mingkui Tan, Kui Jia

2021-08-17ICCV 2021 103D Instance SegmentationScene UnderstandingSemantic SegmentationInstance Segmentation
PaperPDFCode(official)

Abstract

Instance segmentation in 3D scenes is fundamental in many applications of scene understanding. It is yet challenging due to the compound factors of data irregularity and uncertainty in the numbers of instances. State-of-the-art methods largely rely on a general pipeline that first learns point-wise features discriminative at semantic and instance levels, followed by a separate step of point grouping for proposing object instances. While promising, they have the shortcomings that (1) the second step is not supervised by the main objective of instance segmentation, and (2) their point-wise feature learning and grouping are less effective to deal with data irregularities, possibly resulting in fragmented segmentations. To address these issues, we propose in this work an end-to-end solution of Semantic Superpoint Tree Network (SSTNet) for proposing object instances from scene points. Key in SSTNet is an intermediate, semantic superpoint tree (SST), which is constructed based on the learned semantic features of superpoints, and which will be traversed and split at intermediate tree nodes for proposals of object instances. We also design in SSTNet a refinement module, termed CliqueNet, to prune superpoints that may be wrongly grouped into instance proposals. Experiments on the benchmarks of ScanNet and S3DIS show the efficacy of our proposed method. At the time of submission, SSTNet ranks top on the ScanNet (V2) leaderboard, with 2% higher of mAP than the second best method. The source code in PyTorch is available at https://github.com/Gorilla-Lab-SCUT/SSTNet.

Results

TaskDatasetMetricValueModel
Instance SegmentationS3DISAP@5067.8SSTNet
Instance SegmentationS3DISmAP54.1SSTNet
Instance SegmentationS3DISmPrec73.5SSTNet
Instance SegmentationS3DISmRec73.4SSTNet
Instance SegmentationScanNet(v2)mAP50.6SSTNet
Instance SegmentationScanNet(v2)mAP @ 5069.8SSTNet
3D Instance SegmentationS3DISAP@5067.8SSTNet
3D Instance SegmentationS3DISmAP54.1SSTNet
3D Instance SegmentationS3DISmPrec73.5SSTNet
3D Instance SegmentationS3DISmRec73.4SSTNet
3D Instance SegmentationScanNet(v2)mAP50.6SSTNet
3D Instance SegmentationScanNet(v2)mAP @ 5069.8SSTNet

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17