TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/10-shot image generation/COCO minival

10-shot image generation on COCO minival

Metric: PQst (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕PQst▼Extra DataPaperDate↕Code
1OneFormer (InternImage-H,single-scale)49.2NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
2DiNAT-L (single-scale, Mask2Former)48.8NoDilated Neighborhood Attention Transformer2022-09-29Code
3kMaX-DeepLab (single-scale, pseudo-labels)48.8YeskMaX-DeepLab: k-means Mask Transformer2022-07-08Code
4kMaX-DeepLab (single-scale, drop query with 256 queries)48.6NokMaX-DeepLab: k-means Mask Transformer2022-07-08Code
5kMaX-DeepLab (single-scale)48.6NokMaX-DeepLab: k-means Mask Transformer2022-07-08Code
6ViT-Adapter-L (single-scale, BEiTv2 pretrain, Mask2Former)48.4NoVision Transformer Adapter for Dense Predictions2022-05-17Code
7OneFormer (DiNAT-L, single-scale)48.4NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
8Visual Attention Network (VAN-B6 + Mask2Former)48.2NoVisual Attention Network2022-02-20Code
9Mask2Former (single-scale)48.1NoMasked-attention Mask Transformer for Universal ...2021-12-02Code
10OneFormer (Swin-L, single-scale)48NoOneFormer: One Transformer to Rule Universal Ima...2022-11-10Code
11Panoptic SegFormer (single-scale)46.9NoPanoptic SegFormer: Delving Deeper into Panoptic...2021-09-08Code
12CMT-DeepLab (single-scale)46.6NoCMT-DeepLab: Clustering Mask Transformers for Pa...2022-06-17Code
13MaskFormer (single-scale)44NoPer-Pixel Classification is Not All You Need for...2021-07-13Code
14Panoptic SegFormer (ResNet-101)43.2NoPanoptic SegFormer: Delving Deeper into Panoptic...2021-09-08Code
15MaX-DeepLab-L (single-scale)42.2NoMaX-DeepLab: End-to-End Panoptic Segmentation wi...2020-12-01Code
16PanopticFPN+ResNeSt(single-scale)37NoResNeSt: Split-Attention Networks2020-04-19Code
17DETR-R101 (ResNet-101)37NoEnd-to-End Object Detection with Transformers2020-05-26Code
18Axial-DeepLab-L(multi-scale)36.8NoAxial-DeepLab: Stand-Alone Axial-Attention for P...2020-03-17Code
19Panoptic FCN* (ResNet-50-FPN)35.6NoFully Convolutional Networks for Panoptic Segmen...2020-12-01Code
20Axial-DeepLab-L (single-scale)35.6NoAxial-DeepLab: Stand-Alone Axial-Attention for P...2020-03-17Code
21PanopticFPN++33.6NoEnd-to-End Object Detection with Transformers2020-05-26Code