TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Text-to-Image Generation/COCO (Common Objects in Context)

Text-to-Image Generation on COCO (Common Objects in Context)

Metric: Inception score (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Inception score▼Extra DataPaperDate↕Code
1FuseDream (k=10, 256)34.67NoFuseDream: Training-Free Text-to-Image Generatio...2021-12-02Code
2FuseDream (few-shot, k=5)34.26NoFuseDream: Training-Free Text-to-Image Generatio...2021-12-02Code
3FuseDream (k=5, 256)34.26NoFuseDream: Training-Free Text-to-Image Generatio...2021-12-02Code
4DM-GAN+CL33.34NoImproving Text-to-Image Synthesis Using Contrast...2021-07-06Code
5DM-GAN + VICTR32.37NoVICTR: Visual Information Captured Text Represen...2020-10-07Code
6Lafite32.34NoLAFITE: Towards Language-Free Training for Text-...2021-11-27Code
7DM-GAN (256 x 256)32.2NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
8Swinv2-Imagen31.46YesSwinv2-Imagen: Hierarchical Vision Transformer D...2022-10-18-
9XMC-GAN (256 x 256)30.5NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
10DM-GAN30.49NoDM-GAN: Dynamic Memory Generative Adversarial Ne...2019-04-02Code
11AttnGAN + VICTR28.18NoVICTR: Visual Information Captured Text Represen...2020-10-07Code
12OP-GAN27.88NoSemantic Object Accuracy for Generative Text-to-...2019-10-29Code
13NÜWA (256 x 256)27.2NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
14Lafite (zero-shot)26.02NoLAFITE: Towards Language-Free Training for Text-...2021-11-27Code
15AttnGAN+CL25.7NoImproving Text-to-Image Synthesis Using Contrast...2021-07-06Code
16AttnGAN + OP24.76NoGenerating Multiple Objects at Spatially Distinc...2019-01-03Code
17AttnGAN (256 x 256)23.3NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
18DF-GAN (256 x 256)18.7NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
19CogView18.2YesCogView: Mastering Text-to-Image Generation via ...2021-05-26Code
20CogView (256 x 256)18.2NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
21DALL-E (256 x 256)17.9NoNÜWA: Visual Synthesis Pre-training for Neural v...2021-11-24Code
22StackGAN + OP12.12NoGenerating Multiple Objects at Spatially Distinc...2019-01-03Code
23StackGAN + VICTR10.38NoVICTR: Visual Information Captured Text Represen...2020-10-07Code
24ChatPainter9.74NoChatPainter: Improving Text to Image Generation ...2018-02-22-
25StackGAN-v18.45NoStackGAN++: Realistic Image Synthesis with Stack...2017-10-19Code