Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Text-to-Image Generation
/
CUB
Text-to-Image Generation on CUB
Metric: FID (lower is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
Sort:
FID (best first)
FID (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
FID
▲
Extra Data
Paper
Date
↕
Code
1
RAT-Diffusion
6.36
Yes
Data Extrapolation for Text-to-image Generation ...
2024-10-02
Code
2
TLDM
6.72
No
Truncated Diffusion Probabilistic Models and Dif...
2022-02-19
Code
3
Swinv2-Imagen
9.78
No
Swinv2-Imagen: Hierarchical Vision Transformer D...
2022-10-18
-
4
GALIP
10.08
No
GALIP: Generative Adversarial CLIPs for Text-to-...
2023-01-30
Code
5
RAT-GAN
10.21
No
Recurrent Affine Transformation for Text-to-imag...
2022-04-22
Code
6
VQ-Diffusion-F
10.32
Yes
Vector Quantized Diffusion Model for Text-to-Ima...
2021-11-29
Code
7
Lafite
10.48
Yes
LAFITE: Towards Language-Free Training for Text-...
2021-11-27
Code
8
VQ-Diffusion-B
11.94
Yes
Vector Quantized Diffusion Model for Text-to-Ima...
2021-11-29
Code
9
VQ-Diffusion-S
12.97
Yes
Vector Quantized Diffusion Model for Text-to-Ima...
2021-11-29
Code
10
DM-GAN+CL
14.38
No
Improving Text-to-Image Synthesis Using Contrast...
2021-07-06
Code
11
StackGAN-v2
15.3
No
StackGAN++: Realistic Image Synthesis with Stack...
2017-10-19
Code
12
AttnGAN+CL
16.34
No
Improving Text-to-Image Synthesis Using Contrast...
2021-07-06
Code
13
StackGAN-v1
51.89
No
StackGAN++: Realistic Image Synthesis with Stack...
2017-10-19
Code
14
GAWWN
67.22
Yes
Learning What and Where to Draw
2016-10-08
-
#1
RAT-Diffusion
SOTA
6.36
FID
· Extra Data
· 2024-10-02
Data Extrapolation for Text-to-image Generation on Small Datasets
Code
#2
TLDM
SOTA
6.72
FID
· 2022-02-19
Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders
Code
#3
Swinv2-Imagen
9.78
FID
· 2022-10-18
Swinv2-Imagen: Hierarchical Vision Transformer Diffusion Models for Text-to-Image Generation
#4
GALIP
10.08
FID
· 2023-01-30
GALIP: Generative Adversarial CLIPs for Text-to-Image Synthesis
Code
#5
RAT-GAN
10.21
FID
· 2022-04-22
Recurrent Affine Transformation for Text-to-image Synthesis
Code
#6
VQ-Diffusion-F
SOTA
10.32
FID
· Extra Data
· 2021-11-29
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Code
#7
Lafite
SOTA
10.48
FID
· Extra Data
· 2021-11-27
LAFITE: Towards Language-Free Training for Text-to-Image Generation
Code
#8
VQ-Diffusion-B
11.94
FID
· Extra Data
· 2021-11-29
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Code
#9
VQ-Diffusion-S
12.97
FID
· Extra Data
· 2021-11-29
Vector Quantized Diffusion Model for Text-to-Image Synthesis
Code
#10
DM-GAN+CL
SOTA
14.38
FID
· 2021-07-06
Improving Text-to-Image Synthesis Using Contrastive Learning
Code
#11
StackGAN-v2
SOTA
15.3
FID
· 2017-10-19
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Code
#12
AttnGAN+CL
16.34
FID
· 2021-07-06
Improving Text-to-Image Synthesis Using Contrastive Learning
Code
#13
StackGAN-v1
SOTA
51.89
FID
· 2017-10-19
StackGAN++: Realistic Image Synthesis with Stacked Generative Adversarial Networks
Code
#14
GAWWN
SOTA
67.22
FID
· Extra Data
· 2016-10-08
Learning What and Where to Draw