TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Scene-Centric Unsupervised Panoptic Segmentation

Scene-Centric Unsupervised Panoptic Segmentation

Oliver Hahn, Christoph Reich, Nikita Araslanov, Daniel Cremers, Christian Rupprecht, Stefan Roth

2025-04-02CVPR 2025 1Panoptic SegmentationUnsupervised Semantic SegmentationScene UnderstandingSegmentationSemantic SegmentationUnsupervised Panoptic SegmentationInstance SegmentationUnsupervised Object Detection
PaperPDFCode(official)

Abstract

Unsupervised panoptic segmentation aims to partition an image into semantically meaningful regions and distinct object instances without training on manually annotated data. In contrast to prior work on unsupervised panoptic scene understanding, we eliminate the need for object-centric training data, enabling the unsupervised understanding of complex scenes. To that end, we present the first unsupervised panoptic method that directly trains on scene-centric imagery. In particular, we propose an approach to obtain high-resolution panoptic pseudo labels on complex scene-centric data, combining visual representations, depth, and motion cues. Utilizing both pseudo-label training and a panoptic self-training strategy yields a novel approach that accurately predicts panoptic segmentation of complex scenes without requiring any human annotations. Our approach significantly improves panoptic quality, e.g., surpassing the recent state of the art in unsupervised panoptic segmentation on Cityscapes by 9.4% points in PQ.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCityscapes testAccuracy83.2CUPS
Semantic SegmentationCityscapes testmIoU26.8CUPS
Unsupervised Semantic SegmentationCityscapes testAccuracy83.2CUPS
Unsupervised Semantic SegmentationCityscapes testmIoU26.8CUPS
10-shot image generationCityscapes testAccuracy83.2CUPS
10-shot image generationCityscapes testmIoU26.8CUPS
Unsupervised Panoptic SegmentationWaymo Open DatasetPQ27.3CUPS (54 pseudo-classes)
Unsupervised Panoptic SegmentationWaymo Open DatasetPQ27.2CUPS (40 pseudo-classes)
Unsupervised Panoptic SegmentationWaymo Open DatasetPQ26.4CUPS (27 pseudo-classes)
Unsupervised Panoptic SegmentationCityscapesPQ30.6CUPS (54 pseudo-classes)
Unsupervised Panoptic SegmentationCityscapesPQ30.3CUPS (40 pseudo-classes)
Unsupervised Panoptic SegmentationCityscapesPQ27.8CUPS (27 pseudo-classes)
Unsupervised Panoptic SegmentationMUSES: MUlti-SEnsor Semantic perception datasetPQ28.2CUPS (40 pseudo-classes)
Unsupervised Panoptic SegmentationMUSES: MUlti-SEnsor Semantic perception datasetPQ24.4CUPS (27 pseudo-classes)
Unsupervised Panoptic SegmentationMUSES: MUlti-SEnsor Semantic perception datasetPQ22.8CUPS (54 pseudo-classes)
Unsupervised Panoptic SegmentationBDD100K valPQ21.9CUPS (40 pseudo-classes)
Unsupervised Panoptic SegmentationBDD100K valPQ21.8CUPS (54 pseudo-classes)
Unsupervised Panoptic SegmentationBDD100K valPQ19.9CUPS (27 pseudo-classes)
Unsupervised Panoptic SegmentationKITTIPQ28.5CUPS (54 pseudo-classes)
Unsupervised Panoptic SegmentationKITTIPQ28.1CUPS (40 pseudo-classes)
Unsupervised Panoptic SegmentationKITTIPQ25.5CUPS (27 pseudo-classes)
2D Panoptic SegmentationWaymo Open DatasetPQ27.3CUPS (54 pseudo-classes)
2D Panoptic SegmentationWaymo Open DatasetPQ27.2CUPS (40 pseudo-classes)
2D Panoptic SegmentationWaymo Open DatasetPQ26.4CUPS (27 pseudo-classes)
2D Panoptic SegmentationCityscapesPQ30.6CUPS (54 pseudo-classes)
2D Panoptic SegmentationCityscapesPQ30.3CUPS (40 pseudo-classes)
2D Panoptic SegmentationCityscapesPQ27.8CUPS (27 pseudo-classes)
2D Panoptic SegmentationMUSES: MUlti-SEnsor Semantic perception datasetPQ28.2CUPS (40 pseudo-classes)
2D Panoptic SegmentationMUSES: MUlti-SEnsor Semantic perception datasetPQ24.4CUPS (27 pseudo-classes)
2D Panoptic SegmentationMUSES: MUlti-SEnsor Semantic perception datasetPQ22.8CUPS (54 pseudo-classes)
2D Panoptic SegmentationBDD100K valPQ21.9CUPS (40 pseudo-classes)
2D Panoptic SegmentationBDD100K valPQ21.8CUPS (54 pseudo-classes)
2D Panoptic SegmentationBDD100K valPQ19.9CUPS (27 pseudo-classes)
2D Panoptic SegmentationKITTIPQ28.5CUPS (54 pseudo-classes)
2D Panoptic SegmentationKITTIPQ28.1CUPS (40 pseudo-classes)
2D Panoptic SegmentationKITTIPQ25.5CUPS (27 pseudo-classes)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Advancing Complex Wide-Area Scene Understanding with Hierarchical Coresets Selection2025-07-17Argus: Leveraging Multiview Images for Improved 3-D Scene Understanding With Large Language Models2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17