TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Medical/Semantic Segmentation/Cityscapes val

Semantic Segmentation on Cityscapes val

Metric: PQ (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕PQ▼Extra DataPaperDate↕Code
1ViT-P (OneFormer, InternImage-H)70.8NoThe Missing Point in Vision Transformers for Uni...2025-05-26Code
2OneFormer (ConvNeXt-L, single-scale, 512x1024, Mapillary Vistas-pretrained)70.1YesOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
3Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary Vistas, multi-scale)69.6YesScaling Wide Residual Networks for Panoptic Segm...2020-11-23-
4OneFormer (ConvNeXt-L, single-scale)68.51NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
5Axial-DeepLab-XL (Mapillary Vistas, multi-scale)68.5YesAxial-DeepLab: Stand-Alone Axial-Attention for P...2020-03-17Code
6Panoptic-DeepLab (SWideRNet [1, 1, 4.5], Mapillary Vistas, single-scale)68.5YesScaling Wide Residual Networks for Panoptic Segm...2020-11-23-
7OneFormer (ConvNeXt-XL, single-scale)68.4NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
8kMaX-DeepLab (single-scale)68.4NokMaX-DeepLab: k-means Mask Transformer2022-07-08Code
9AFF-Base (single-scale, point-based Mask2Former)67.7NoAutoFocusFormer: Image Segmentation off the Grid2023-04-24Code
10OneFormer (DiNAT-L, single-scale)67.6NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
11EfficientPS67.5YesEfficientPS: Efficient Panoptic Segmentation2020-04-05Code
12DiNAT-L (Mask2Former)67.2NoDilated Neighborhood Attention Transformer2022-09-29Code
13OneFormer (Swin-L, single-scale)67.2NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
14AFF-Small (single-scale, point-based Mask2Former)66.9NoAutoFocusFormer: Image Segmentation off the Grid2023-04-24Code
15Mask2Former (Swin-L)66.6NoMasked-attention Mask Transformer for Universal ...2021-12-02Code
16EfficientPS (Cityscapes-fine)64.9NoEfficientPS: Efficient Panoptic Segmentation2020-04-05Code
17CMT-DeepLab (MaX-S, single-scale, IN-1K)64.6NoCMT-DeepLab: Clustering Mask Transformers for Pa...2022-06-17Code
18Panoptic-DeepLab (X71)64.1YesPanoptic-DeepLab: A Simple, Strong, and Fast Bas...2019-11-22Code
19Mask2Former + Intra-Batch Supervision (ResNet-50)62.4NoIntra-Batch Supervision for Panoptic Segmentatio...2023-04-17Code
20COPS (ResNet-50)62.1NoCombinatorial Optimization for Panoptic Segmenta...2021-06-06Code
21AdaptIS (ResNeXt-101)62NoAdaptIS: Adaptive Instance Selection Network2019-09-17-
22UPSNet (ResNet-101, multiscale)61.8YesUPSNet: A Unified Panoptic Segmentation Network2019-01-12Code
23Panoptic FCN* (ResNet-FPN)61.4NoFully Convolutional Networks for Panoptic Segmen...2020-12-01Code
24MRCNN + PSPNet (ResNet-101)61.2YesPanoptic Segmentation2018-01-03Code
25AdaptIS (ResNet-101)60.6NoAdaptIS: Adaptive Instance Selection Network2019-09-17-
26UPSNet (ResNet-101)60.5YesUPSNet: A Unified Panoptic Segmentation Network2019-01-12Code
27TASCNet (ResNet-50, multi-scale)60.4YesLearning to Fuse Things and Stuff2018-12-04-
28UPSNet (ResNet-50)59.3NoUPSNet: A Unified Panoptic Segmentation Network2019-01-12Code
29TASCNet (ResNet-50)59.2YesLearning to Fuse Things and Stuff2018-12-04-
30AUNet (ResNet-101-FPN)59NoAttention-guided Unified Network for Panoptic Se...2018-12-10-
31AdaptIS (ResNet-50)59NoAdaptIS: Adaptive Instance Selection Network2019-09-17-
32Panoptic FPN (ResNet-101)58.1NoPanoptic Feature Pyramid Networks2019-01-08Code
33DeeperLab (Xception-71)56.5NoDeeperLab: Single-Shot Image Parser2019-02-13-
34Dynamically Instantiated Network (ResNet-101)53.8NoWeakly- and Semi-Supervised Panoptic Segmentation2018-08-10Code