TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/CMT-DeepLab: Clustering Mask Transformers for Panoptic Seg...

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation

Qihang Yu, Huiyu Wang, Dahun Kim, Siyuan Qiao, Maxwell Collins, Yukun Zhu, Hartwig Adam, Alan Yuille, Liang-Chieh Chen

2022-06-17CVPR 2022 1Panoptic SegmentationSegmentationClustering
PaperPDFCodeCode

Abstract

We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based framework for panoptic segmentation designed around clustering. It rethinks the existing transformer architectures used in segmentation and detection; CMT-DeepLab considers the object queries as cluster centers, which fill the role of grouping the pixels when applied to segmentation. The clustering is computed with an alternating procedure, by first assigning pixels to the clusters by their feature affinity, and then updating the cluster centers and pixel features. Together, these operations comprise the Clustering Mask Transformer (CMT) layer, which produces cross-attention that is denser and more consistent with the final segmentation task. CMT-DeepLab improves the performance over prior art significantly by 4.4% PQ, achieving a new state-of-the-art of 55.7% PQ on the COCO test-dev set.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCityscapes valPQ64.6CMT-DeepLab (MaX-S, single-scale, IN-1K)
Semantic SegmentationCityscapes valmIoU81.4CMT-DeepLab (MaX-S, single-scale, IN-1K)
Semantic SegmentationCOCO test-devPQ55.7CMT-DeepLab (single-scale)
Semantic SegmentationCOCO test-devPQst46.8CMT-DeepLab (single-scale)
Semantic SegmentationCOCO test-devPQth61.6CMT-DeepLab (single-scale)
Semantic SegmentationCOCO minivalPQ55.3CMT-DeepLab (single-scale)
Semantic SegmentationCOCO minivalPQst46.6CMT-DeepLab (single-scale)
Semantic SegmentationCOCO minivalPQth61CMT-DeepLab (single-scale)
10-shot image generationCityscapes valPQ64.6CMT-DeepLab (MaX-S, single-scale, IN-1K)
10-shot image generationCityscapes valmIoU81.4CMT-DeepLab (MaX-S, single-scale, IN-1K)
10-shot image generationCOCO test-devPQ55.7CMT-DeepLab (single-scale)
10-shot image generationCOCO test-devPQst46.8CMT-DeepLab (single-scale)
10-shot image generationCOCO test-devPQth61.6CMT-DeepLab (single-scale)
10-shot image generationCOCO minivalPQ55.3CMT-DeepLab (single-scale)
10-shot image generationCOCO minivalPQst46.6CMT-DeepLab (single-scale)
10-shot image generationCOCO minivalPQth61CMT-DeepLab (single-scale)
Panoptic SegmentationCityscapes valPQ64.6CMT-DeepLab (MaX-S, single-scale, IN-1K)
Panoptic SegmentationCityscapes valmIoU81.4CMT-DeepLab (MaX-S, single-scale, IN-1K)
Panoptic SegmentationCOCO test-devPQ55.7CMT-DeepLab (single-scale)
Panoptic SegmentationCOCO test-devPQst46.8CMT-DeepLab (single-scale)
Panoptic SegmentationCOCO test-devPQth61.6CMT-DeepLab (single-scale)
Panoptic SegmentationCOCO minivalPQ55.3CMT-DeepLab (single-scale)
Panoptic SegmentationCOCO minivalPQst46.6CMT-DeepLab (single-scale)
Panoptic SegmentationCOCO minivalPQth61CMT-DeepLab (single-scale)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17