Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, Jose M. Alvarez, Ping Luo, Tong Lu

2021-09-08CVPR 2022 1Panoptic Segmentation Segmentation Instance Segmentation

Abstract

Panoptic segmentation involves a combination of joint semantic segmentation and instance segmentation, where image contents are divided into two types: things and stuff. We present Panoptic SegFormer, a general framework for panoptic segmentation with transformers. It contains three innovative components: an efficient deeply-supervised mask decoder, a query decoupling strategy, and an improved post-processing method. We also use Deformable DETR to efficiently process multi-scale features, which is a fast and efficient version of DETR. Specifically, we supervise the attention modules in the mask decoder in a layer-wise manner. This deep supervision strategy lets the attention modules quickly focus on meaningful semantic regions. It improves performance and reduces the number of required training epochs by half compared to Deformable DETR. Our query decoupling strategy decouples the responsibilities of the query set and avoids mutual interference between things and stuff. In addition, our post-processing strategy improves performance without additional costs by jointly considering classification and segmentation qualities to resolve conflicting mask overlaps. Our approach increases the accuracy 6.2\% PQ over the baseline DETR model. Panoptic SegFormer achieves state-of-the-art results on COCO test-dev with 56.2\% PQ. It also shows stronger zero-shot robustness over existing methods. The code is released at \url{https://github.com/zhiqi-li/Panoptic-SegFormer}.

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	COCO test-dev	PQ	56.2	Panoptic SegFormer (Swin-L)
Semantic Segmentation	COCO test-dev	PQst	47	Panoptic SegFormer (Swin-L)
Semantic Segmentation	COCO test-dev	PQth	62.3	Panoptic SegFormer (Swin-L)
Semantic Segmentation	COCO test-dev	PQ	55.8	Panoptic SegFormer (PVTv2-B5)
Semantic Segmentation	COCO test-dev	PQst	46.5	Panoptic SegFormer (PVTv2-B5)
Semantic Segmentation	COCO test-dev	PQth	61.9	Panoptic SegFormer (PVTv2-B5)
Semantic Segmentation	COCO test-dev	PQ	50.9	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO test-dev	PQst	43	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO test-dev	PQth	56.2	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO test-dev	PQ	50.2	Panoptic SegFormer (ResNet-50)
Semantic Segmentation	COCO test-dev	PQst	42.4	Panoptic SegFormer (ResNet-50)
Semantic Segmentation	COCO test-dev	PQth	55.3	Panoptic SegFormer (ResNet-50)
Semantic Segmentation	COCO minival	PQ	55.8	Panoptic SegFormer (single-scale)
Semantic Segmentation	COCO minival	PQst	46.9	Panoptic SegFormer (single-scale)
Semantic Segmentation	COCO minival	PQth	61.7	Panoptic SegFormer (single-scale)
Semantic Segmentation	COCO minival	PQ	50.6	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO minival	PQst	43.2	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO minival	PQth	55.5	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQ	56.2	Panoptic SegFormer (Swin-L)
10-shot image generation	COCO test-dev	PQst	47	Panoptic SegFormer (Swin-L)
10-shot image generation	COCO test-dev	PQth	62.3	Panoptic SegFormer (Swin-L)
10-shot image generation	COCO test-dev	PQ	55.8	Panoptic SegFormer (PVTv2-B5)
10-shot image generation	COCO test-dev	PQst	46.5	Panoptic SegFormer (PVTv2-B5)
10-shot image generation	COCO test-dev	PQth	61.9	Panoptic SegFormer (PVTv2-B5)
10-shot image generation	COCO test-dev	PQ	50.9	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQst	43	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQth	56.2	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQ	50.2	Panoptic SegFormer (ResNet-50)
10-shot image generation	COCO test-dev	PQst	42.4	Panoptic SegFormer (ResNet-50)
10-shot image generation	COCO test-dev	PQth	55.3	Panoptic SegFormer (ResNet-50)
10-shot image generation	COCO minival	PQ	55.8	Panoptic SegFormer (single-scale)
10-shot image generation	COCO minival	PQst	46.9	Panoptic SegFormer (single-scale)
10-shot image generation	COCO minival	PQth	61.7	Panoptic SegFormer (single-scale)
10-shot image generation	COCO minival	PQ	50.6	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO minival	PQst	43.2	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO minival	PQth	55.5	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQ	56.2	Panoptic SegFormer (Swin-L)
Panoptic Segmentation	COCO test-dev	PQst	47	Panoptic SegFormer (Swin-L)
Panoptic Segmentation	COCO test-dev	PQth	62.3	Panoptic SegFormer (Swin-L)
Panoptic Segmentation	COCO test-dev	PQ	55.8	Panoptic SegFormer (PVTv2-B5)
Panoptic Segmentation	COCO test-dev	PQst	46.5	Panoptic SegFormer (PVTv2-B5)
Panoptic Segmentation	COCO test-dev	PQth	61.9	Panoptic SegFormer (PVTv2-B5)
Panoptic Segmentation	COCO test-dev	PQ	50.9	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQst	43	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQth	56.2	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQ	50.2	Panoptic SegFormer (ResNet-50)
Panoptic Segmentation	COCO test-dev	PQst	42.4	Panoptic SegFormer (ResNet-50)
Panoptic Segmentation	COCO test-dev	PQth	55.3	Panoptic SegFormer (ResNet-50)
Panoptic Segmentation	COCO minival	PQ	55.8	Panoptic SegFormer (single-scale)
Panoptic Segmentation	COCO minival	PQst	46.9	Panoptic SegFormer (single-scale)
Panoptic Segmentation	COCO minival	PQth	61.7	Panoptic SegFormer (single-scale)
Panoptic Segmentation	COCO minival	PQ	50.6	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO minival	PQst	43.2	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO minival	PQth	55.5	Panoptic SegFormer (ResNet-101)

Abstract

Results

Task	Dataset	Metric	Value	Model
Semantic Segmentation	COCO test-dev	PQ	56.2	Panoptic SegFormer (Swin-L)
Semantic Segmentation	COCO test-dev	PQst	47	Panoptic SegFormer (Swin-L)
Semantic Segmentation	COCO test-dev	PQth	62.3	Panoptic SegFormer (Swin-L)
Semantic Segmentation	COCO test-dev	PQ	55.8	Panoptic SegFormer (PVTv2-B5)
Semantic Segmentation	COCO test-dev	PQst	46.5	Panoptic SegFormer (PVTv2-B5)
Semantic Segmentation	COCO test-dev	PQth	61.9	Panoptic SegFormer (PVTv2-B5)
Semantic Segmentation	COCO test-dev	PQ	50.9	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO test-dev	PQst	43	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO test-dev	PQth	56.2	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO test-dev	PQ	50.2	Panoptic SegFormer (ResNet-50)
Semantic Segmentation	COCO test-dev	PQst	42.4	Panoptic SegFormer (ResNet-50)
Semantic Segmentation	COCO test-dev	PQth	55.3	Panoptic SegFormer (ResNet-50)
Semantic Segmentation	COCO minival	PQ	55.8	Panoptic SegFormer (single-scale)
Semantic Segmentation	COCO minival	PQst	46.9	Panoptic SegFormer (single-scale)
Semantic Segmentation	COCO minival	PQth	61.7	Panoptic SegFormer (single-scale)
Semantic Segmentation	COCO minival	PQ	50.6	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO minival	PQst	43.2	Panoptic SegFormer (ResNet-101)
Semantic Segmentation	COCO minival	PQth	55.5	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQ	56.2	Panoptic SegFormer (Swin-L)
10-shot image generation	COCO test-dev	PQst	47	Panoptic SegFormer (Swin-L)
10-shot image generation	COCO test-dev	PQth	62.3	Panoptic SegFormer (Swin-L)
10-shot image generation	COCO test-dev	PQ	55.8	Panoptic SegFormer (PVTv2-B5)
10-shot image generation	COCO test-dev	PQst	46.5	Panoptic SegFormer (PVTv2-B5)
10-shot image generation	COCO test-dev	PQth	61.9	Panoptic SegFormer (PVTv2-B5)
10-shot image generation	COCO test-dev	PQ	50.9	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQst	43	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQth	56.2	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO test-dev	PQ	50.2	Panoptic SegFormer (ResNet-50)
10-shot image generation	COCO test-dev	PQst	42.4	Panoptic SegFormer (ResNet-50)
10-shot image generation	COCO test-dev	PQth	55.3	Panoptic SegFormer (ResNet-50)
10-shot image generation	COCO minival	PQ	55.8	Panoptic SegFormer (single-scale)
10-shot image generation	COCO minival	PQst	46.9	Panoptic SegFormer (single-scale)
10-shot image generation	COCO minival	PQth	61.7	Panoptic SegFormer (single-scale)
10-shot image generation	COCO minival	PQ	50.6	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO minival	PQst	43.2	Panoptic SegFormer (ResNet-101)
10-shot image generation	COCO minival	PQth	55.5	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQ	56.2	Panoptic SegFormer (Swin-L)
Panoptic Segmentation	COCO test-dev	PQst	47	Panoptic SegFormer (Swin-L)
Panoptic Segmentation	COCO test-dev	PQth	62.3	Panoptic SegFormer (Swin-L)
Panoptic Segmentation	COCO test-dev	PQ	55.8	Panoptic SegFormer (PVTv2-B5)
Panoptic Segmentation	COCO test-dev	PQst	46.5	Panoptic SegFormer (PVTv2-B5)
Panoptic Segmentation	COCO test-dev	PQth	61.9	Panoptic SegFormer (PVTv2-B5)
Panoptic Segmentation	COCO test-dev	PQ	50.9	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQst	43	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQth	56.2	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO test-dev	PQ	50.2	Panoptic SegFormer (ResNet-50)
Panoptic Segmentation	COCO test-dev	PQst	42.4	Panoptic SegFormer (ResNet-50)
Panoptic Segmentation	COCO test-dev	PQth	55.3	Panoptic SegFormer (ResNet-50)
Panoptic Segmentation	COCO minival	PQ	55.8	Panoptic SegFormer (single-scale)
Panoptic Segmentation	COCO minival	PQst	46.9	Panoptic SegFormer (single-scale)
Panoptic Segmentation	COCO minival	PQth	61.7	Panoptic SegFormer (single-scale)
Panoptic Segmentation	COCO minival	PQ	50.6	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO minival	PQst	43.2	Panoptic SegFormer (ResNet-101)
Panoptic Segmentation	COCO minival	PQth	55.5	Panoptic SegFormer (ResNet-101)

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Abstract

Results

Related Papers

Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers

Abstract

Results

Related Papers