TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Panoptic SegFormer: Delving Deeper into Panoptic Segmentat...

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo, Tong Lu

2021-09-08CVPR 2022 1Panoptic SegmentationSegmentationInstance Segmentation
PaperPDFCode(official)CodeCode

Abstract

Panoptic segmentation involves a combination of joint semantic segmentation and instance segmentation, where image contents are divided into two types: things and stuff. We present Panoptic SegFormer, a general framework for panoptic segmentation with transformers. It contains three innovative components: an efficient deeply-supervised mask decoder, a query decoupling strategy, and an improved post-processing method. We also use Deformable DETR to efficiently process multi-scale features, which is a fast and efficient version of DETR. Specifically, we supervise the attention modules in the mask decoder in a layer-wise manner. This deep supervision strategy lets the attention modules quickly focus on meaningful semantic regions. It improves performance and reduces the number of required training epochs by half compared to Deformable DETR. Our query decoupling strategy decouples the responsibilities of the query set and avoids mutual interference between things and stuff. In addition, our post-processing strategy improves performance without additional costs by jointly considering classification and segmentation qualities to resolve conflicting mask overlaps. Our approach increases the accuracy 6.2\% PQ over the baseline DETR model. Panoptic SegFormer achieves state-of-the-art results on COCO test-dev with 56.2\% PQ. It also shows stronger zero-shot robustness over existing methods. The code is released at \url{https://github.com/zhiqi-li/Panoptic-SegFormer}.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCOCO test-devPQ56.2Panoptic SegFormer (Swin-L)
Semantic SegmentationCOCO test-devPQst47Panoptic SegFormer (Swin-L)
Semantic SegmentationCOCO test-devPQth62.3Panoptic SegFormer (Swin-L)
Semantic SegmentationCOCO test-devPQ55.8Panoptic SegFormer (PVTv2-B5)
Semantic SegmentationCOCO test-devPQst46.5Panoptic SegFormer (PVTv2-B5)
Semantic SegmentationCOCO test-devPQth61.9Panoptic SegFormer (PVTv2-B5)
Semantic SegmentationCOCO test-devPQ50.9Panoptic SegFormer (ResNet-101)
Semantic SegmentationCOCO test-devPQst43Panoptic SegFormer (ResNet-101)
Semantic SegmentationCOCO test-devPQth56.2Panoptic SegFormer (ResNet-101)
Semantic SegmentationCOCO test-devPQ50.2Panoptic SegFormer (ResNet-50)
Semantic SegmentationCOCO test-devPQst42.4Panoptic SegFormer (ResNet-50)
Semantic SegmentationCOCO test-devPQth55.3Panoptic SegFormer (ResNet-50)
Semantic SegmentationCOCO minivalPQ55.8Panoptic SegFormer (single-scale)
Semantic SegmentationCOCO minivalPQst46.9Panoptic SegFormer (single-scale)
Semantic SegmentationCOCO minivalPQth61.7Panoptic SegFormer (single-scale)
Semantic SegmentationCOCO minivalPQ50.6Panoptic SegFormer (ResNet-101)
Semantic SegmentationCOCO minivalPQst43.2Panoptic SegFormer (ResNet-101)
Semantic SegmentationCOCO minivalPQth55.5Panoptic SegFormer (ResNet-101)
10-shot image generationCOCO test-devPQ56.2Panoptic SegFormer (Swin-L)
10-shot image generationCOCO test-devPQst47Panoptic SegFormer (Swin-L)
10-shot image generationCOCO test-devPQth62.3Panoptic SegFormer (Swin-L)
10-shot image generationCOCO test-devPQ55.8Panoptic SegFormer (PVTv2-B5)
10-shot image generationCOCO test-devPQst46.5Panoptic SegFormer (PVTv2-B5)
10-shot image generationCOCO test-devPQth61.9Panoptic SegFormer (PVTv2-B5)
10-shot image generationCOCO test-devPQ50.9Panoptic SegFormer (ResNet-101)
10-shot image generationCOCO test-devPQst43Panoptic SegFormer (ResNet-101)
10-shot image generationCOCO test-devPQth56.2Panoptic SegFormer (ResNet-101)
10-shot image generationCOCO test-devPQ50.2Panoptic SegFormer (ResNet-50)
10-shot image generationCOCO test-devPQst42.4Panoptic SegFormer (ResNet-50)
10-shot image generationCOCO test-devPQth55.3Panoptic SegFormer (ResNet-50)
10-shot image generationCOCO minivalPQ55.8Panoptic SegFormer (single-scale)
10-shot image generationCOCO minivalPQst46.9Panoptic SegFormer (single-scale)
10-shot image generationCOCO minivalPQth61.7Panoptic SegFormer (single-scale)
10-shot image generationCOCO minivalPQ50.6Panoptic SegFormer (ResNet-101)
10-shot image generationCOCO minivalPQst43.2Panoptic SegFormer (ResNet-101)
10-shot image generationCOCO minivalPQth55.5Panoptic SegFormer (ResNet-101)
Panoptic SegmentationCOCO test-devPQ56.2Panoptic SegFormer (Swin-L)
Panoptic SegmentationCOCO test-devPQst47Panoptic SegFormer (Swin-L)
Panoptic SegmentationCOCO test-devPQth62.3Panoptic SegFormer (Swin-L)
Panoptic SegmentationCOCO test-devPQ55.8Panoptic SegFormer (PVTv2-B5)
Panoptic SegmentationCOCO test-devPQst46.5Panoptic SegFormer (PVTv2-B5)
Panoptic SegmentationCOCO test-devPQth61.9Panoptic SegFormer (PVTv2-B5)
Panoptic SegmentationCOCO test-devPQ50.9Panoptic SegFormer (ResNet-101)
Panoptic SegmentationCOCO test-devPQst43Panoptic SegFormer (ResNet-101)
Panoptic SegmentationCOCO test-devPQth56.2Panoptic SegFormer (ResNet-101)
Panoptic SegmentationCOCO test-devPQ50.2Panoptic SegFormer (ResNet-50)
Panoptic SegmentationCOCO test-devPQst42.4Panoptic SegFormer (ResNet-50)
Panoptic SegmentationCOCO test-devPQth55.3Panoptic SegFormer (ResNet-50)
Panoptic SegmentationCOCO minivalPQ55.8Panoptic SegFormer (single-scale)
Panoptic SegmentationCOCO minivalPQst46.9Panoptic SegFormer (single-scale)
Panoptic SegmentationCOCO minivalPQth61.7Panoptic SegFormer (single-scale)
Panoptic SegmentationCOCO minivalPQ50.6Panoptic SegFormer (ResNet-101)
Panoptic SegmentationCOCO minivalPQst43.2Panoptic SegFormer (ResNet-101)
Panoptic SegmentationCOCO minivalPQth55.5Panoptic SegFormer (ResNet-101)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17