TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Audio/10-shot image generation/COCO-Stuff-171

10-shot image generation on COCO-Stuff-171

Metric: mIoU (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕mIoU▼Extra DataPaperDate↕Code
1CorrCLIP34NoCorrCLIP: Reconstructing Correlations in CLIP wi...2024-11-15Code
2TextRegion31.2NoTextRegion: Text-Aligned Region Tokens from Froz...2025-05-29Code
3Trident28.6NoHarnessing Vision Foundation Models for High-Per...2024-11-14Code
4ProxyCLIP26.8NoProxyCLIP: Proxy Attention Improves CLIP for Ope...2024-08-09Code
5TagAlign25.3NoTagAlign: Improving Vision-Language Alignment wi...2023-12-21Code
6TTD (TCL)23.7NoTTD: Text-Tag Self-Distillation Enhancing Image-...2024-03-30Code
7COSMOS ViT-B/1623.2NoCOSMOS: Cross-Modality Self-Distillation for Vis...2024-12-02Code
8TCL22.4NoLearning to Generate Text-grounded Mask for Open...2022-12-01Code
9TTD (MaskCLIP)19.4NoTTD: Text-Tag Self-Distillation Enhancing Image-...2024-03-30Code
10MaskCLIP16.4NoExtract Free Dense Labels from CLIP2021-12-02Code
11CAUSE-TR (ViT-S/8)15.2NoCausal Unsupervised Semantic Segmentation2023-10-11Code
12ReCo14.8NoReCo: Retrieve and Co-segment for Zero-shot Tran...2022-06-14Code
13TransFGU (ViT-S/8)11.93YesTransFGU: A Top-down Approach to Fine-Grained Un...2021-12-02Code
14GroupViT11.1NoGroupViT: Semantic Segmentation Emerges from Tex...2022-02-22Code
15PiCIE (ResNet-50)5.6NoPiCIE: Unsupervised Semantic Segmentation using ...2021-03-30Code
16IIC (ResNet-50)2.2NoInvariant Information Clustering for Unsupervise...2018-07-17Code