TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/OneFormer3D: One Transformer for Unified Point Cloud Segme...

OneFormer3D: One Transformer for Unified Point Cloud Segmentation

Maxim Kolodiazhnyi, Anna Vorontsova, Anton Konushin, Danila Rukhovich

2023-11-24CVPR 2024 1Panoptic Segmentation3D Instance SegmentationSegmentationSemantic SegmentationPoint Cloud Segmentation3D Semantic Segmentation3D Object Detection
PaperPDFCode

Abstract

Semantic, instance, and panoptic segmentation of 3D point clouds have been addressed using task-specific models of distinct design. Thereby, the similarity of all segmentation tasks and the implicit relationship between them have not been utilized effectively. This paper presents a unified, simple, and effective model addressing all these tasks jointly. The model, named OneFormer3D, performs instance and semantic segmentation consistently, using a group of learnable kernels, where each kernel is responsible for generating a mask for either an instance or a semantic category. These kernels are trained with a transformer-based decoder with unified instance and semantic queries passed as an input. Such a design enables training a model end-to-end in a single run, so that it achieves top performance on all three segmentation tasks simultaneously. Specifically, our OneFormer3D ranks 1st and sets a new state-of-the-art (+2.1 mAP50) in the ScanNet test leaderboard. We also demonstrate the state-of-the-art results in semantic, instance, and panoptic segmentation of ScanNet (+21 PQ), ScanNet200 (+3.8 mAP50), and S3DIS (+0.8 mIoU) datasets.

Results

TaskDatasetMetricValueModel
Semantic SegmentationScanNetval mIoU76.6OneFormer3D
Semantic SegmentationScanNetPQ71.2OneFormer3D
Semantic SegmentationScanNetPQ_st86.1OneFormer3D
Semantic SegmentationScanNetPQ_th69.6OneFormer3D
Semantic SegmentationScanNetV2PQ71.2OneFormer3D
Semantic SegmentationScanNet200val mIoU30.1OneFormer3D
Semantic SegmentationS3DISmIoU (6-Fold)75OneFormer3D
Semantic SegmentationS3DISmIoU (Area-5)72.4OneFormer3D
Object DetectionScanNetV2mAP@0.2576.9OneFormer3D
Object DetectionScanNetV2mAP@0.565.3OneFormer3D
3DScanNetV2mAP@0.2576.9OneFormer3D
3DScanNetV2mAP@0.565.3OneFormer3D
Instance SegmentationS3DISAP@5075.8OneFormer3D
Instance SegmentationS3DISmAP63OneFormer3D
Instance SegmentationS3DISmPrec82.3OneFormer3D
Instance SegmentationS3DISmRec74.1OneFormer3D
Instance SegmentationScanNet(v2)mAP56.6OneFromer3D
Instance SegmentationScanNet(v2)mAP @ 5080.1OneFromer3D
Instance SegmentationScanNet(v2)mAP@2589.6OneFromer3D
3D Semantic SegmentationScanNet200val mIoU30.1OneFormer3D
3D Semantic SegmentationS3DISmIoU (6-Fold)75OneFormer3D
3D Semantic SegmentationS3DISmIoU (Area-5)72.4OneFormer3D
3D Object DetectionScanNetV2mAP@0.2576.9OneFormer3D
3D Object DetectionScanNetV2mAP@0.565.3OneFormer3D
2D ClassificationScanNetV2mAP@0.2576.9OneFormer3D
2D ClassificationScanNetV2mAP@0.565.3OneFormer3D
2D Object DetectionScanNetV2mAP@0.2576.9OneFormer3D
2D Object DetectionScanNetV2mAP@0.565.3OneFormer3D
10-shot image generationScanNetval mIoU76.6OneFormer3D
10-shot image generationScanNetPQ71.2OneFormer3D
10-shot image generationScanNetPQ_st86.1OneFormer3D
10-shot image generationScanNetPQ_th69.6OneFormer3D
10-shot image generationScanNetV2PQ71.2OneFormer3D
10-shot image generationScanNet200val mIoU30.1OneFormer3D
10-shot image generationS3DISmIoU (6-Fold)75OneFormer3D
10-shot image generationS3DISmIoU (Area-5)72.4OneFormer3D
Panoptic SegmentationScanNetPQ71.2OneFormer3D
Panoptic SegmentationScanNetPQ_st86.1OneFormer3D
Panoptic SegmentationScanNetPQ_th69.6OneFormer3D
Panoptic SegmentationScanNetV2PQ71.2OneFormer3D
16kScanNetV2mAP@0.2576.9OneFormer3D
16kScanNetV2mAP@0.565.3OneFormer3D
3D Instance SegmentationS3DISAP@5075.8OneFormer3D
3D Instance SegmentationS3DISmAP63OneFormer3D
3D Instance SegmentationS3DISmPrec82.3OneFormer3D
3D Instance SegmentationS3DISmRec74.1OneFormer3D
3D Instance SegmentationScanNet(v2)mAP56.6OneFromer3D
3D Instance SegmentationScanNet(v2)mAP @ 5080.1OneFromer3D
3D Instance SegmentationScanNet(v2)mAP@2589.6OneFromer3D

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17