Image Generation on ImageNet 256x256

Metric: FID (lower is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	FID▲	Extra Data	Paper	Date↕	Code
1	SiT-XL/2 + UCGM-S (E2E-VAE + 40 sampling steps + CFG)	1.06	No	Unified Continuous Generative Models	2025-05-12	Code
2	UCGM-XL/2 (VA-VAE + 30 sampling steps, without guidance)	1.21	No	Unified Continuous Generative Models	2025-05-12	Code
3	UCGM-XL/2 (E2E-VAE + 40 sampling steps, without guidance)	1.21	No	Unified Continuous Generative Models	2025-05-12	Code
4	EDM2-L + DDO (SD-VAE, 25 steps, DPM-Solver-v3)	1.21	No	Direct Discriminative Optimization: Your Likelih...	2025-03-03	Code
5	LightningDiT + UCGM-S (VA-VAE + 50 sampling steps + CFG)	1.21	No	Unified Continuous Generative Models	2025-05-12	Code
6	xAR-H	1.24	No	Beyond Next-Token: Next-X Prediction for Autoreg...	2025-02-27	Code
7	SiT-XL/2 + REPA-E	1.26	No	REPA-E: Unlocking VAE for End-to-End Tuning with...	2025-04-14	Code
8	DDT-XL/2(22en6de 675M + guidance interval )	1.26	No	DDT: Decoupled Diffusion Transformer	2025-04-08	Code
9	xAR-L	1.28	No	Beyond Next-Token: Next-X Prediction for Autoreg...	2025-02-27	Code
10	FACM (2-step)	1.32	No	Flow-Anchored Consistency Models	2025-07-04	Code
11	GMem (with the guidance interval)	1.32	No	Generative Modeling with Explicit Memory	2024-12-11	Code
12	SiT-XL/2 + MG	1.34	No	Diffusion Models without Classifier-free Guidance	2025-02-17	Code
13	AliTok-XL, autoregressive, 662M	1.35	No	AliTok: Towards Sequence Modeling Alignment betw...	2025-06-05	Code
14	LightningDiT + VA-VAE (with the guidance interval)	1.35	No	Reconstruction vs. Generation: Taming Optimizati...	2025-01-02	Code
15	SiD2	1.38	No	Simpler Diffusion (SiD2): 1.5 FID on ImageNet512...	2024-10-25	-
16	SiT↓-XL/2+U-REPA (with the guidance interval)	1.41	No	U-REPA: Aligning Diffusion U-Nets to ViTs	2025-03-24	Code
17	AliTok-XL, autoregressive, 318M	1.42	No	AliTok: Towards Sequence Modeling Alignment betw...	2025-06-05	Code
18	SiT-XL/2 + REPA (with the guidance interval)	1.42	No	Representation Alignment for Generation: Trainin...	2024-10-09	Code
19	RAR-XXL, autoregressive	1.48	No	Randomized Autoregressive Visual Generation	2024-11-01	Code
20	RAR-XL, autoregressive	1.5	No	Randomized Autoregressive Visual Generation	2024-11-01	Code
21	MaskBit	1.52	No	MaskBit: Embedding-free Image Generation via Bit...	2024-09-24	Code
22	GMem (w/o guidance)	1.53	No	Generative Modeling with Explicit Memory	2024-12-11	Code
23	ELM	1.54	No	Elucidating the design space of language models ...	2024-10-21	Code
24	MAR-H, Diff Loss	1.55	No	Autoregressive Image Generation without Vector Q...	2024-06-17	Code
25	PaGoDA	1.56	No	PaGoDA: Progressive Growing of a One-Step Genera...	2024-05-23	Code
26	ViT-XL/2 with limited Interval Guidance	1.57	No	Efficient Diffusion Training via Min-SNR Weighti...	2023-03-16	Code
27	MDTv2	1.58	No	MDTv2: Masked Diffusion Transformer is a Strong ...	2023-03-25	Code
28	SiT-XL + SRA	1.58	No	No Other Representation Component Is Needed: Dif...	2025-05-05	Code
29	RobustTok-L	1.6	No	Robust Latent Matters: Boosting Image Generation...	2025-03-11	Code
30	DiMR-G/2R	1.63	No	Alleviating Distortion in Image Generation via M...	2024-06-13	Code
31	FlowAR	1.65	No	FlowAR: Scale-wise Autoregressive Image Generati...	2024-12-19	Code
32	FACM (1-step)	1.7	No	Flow-Anchored Consistency Models	2025-07-04	Code
33	DiT-XL/2 with CADS	1.7	No	CADS: Unleashing the Diversity of Diffusion Mode...	2023-10-26	-
34	DiMR-XL/2R	1.7	No	Alleviating Distortion in Image Generation via M...	2024-06-13	Code
35	RAR-L, autoregressive	1.7	No	Randomized Autoregressive Visual Generation	2024-11-01	Code
36	DiffiT	1.73	No	DiffiT: Diffusion Vision Transformers for Image ...	2023-12-04	Code
37	VAR (Visual Autoregressive)	1.73	No	Visual Autoregressive Modeling: Scalable Image G...	2024-04-03	Code
38	MAGVIT-v2	1.78	No	Language Model Beats Diffusion -- Tokenizer is K...	2023-10-09	Code
39	MAR-L, Diff Loss	1.78	No	Autoregressive Image Generation without Vector Q...	2024-06-17	Code
40	MDT	1.79	No	MDTv2: Masked Diffusion Transformer is a Strong ...	2023-03-25	Code
41	Discriminator Guidance	1.83	No	Refining Generative Process with Discriminator G...	2022-11-28	Code
42	DoD-XL	1.83	No	Diffusion Models Need Visual Priors for Image Ge...	2024-10-11	-
43	RobustTok-B	1.83	No	Robust Latent Matters: Boosting Image Generation...	2025-03-11	Code
44	ARPG-XXL	1.94	No	Autoregressive Image Generation with Randomized ...	2025-03-13	Code
45	RAR-B, autoregressive	1.95	No	Randomized Autoregressive Visual Generation	2024-11-01	Code
46	TiTok-S-128	1.97	No	An Image is Worth 32 Tokens for Reconstruction a...	2024-06-11	Code
47	PixelFlow	1.98	No	PixelFlow: Pixel-Space Generative Models with Flow	2025-04-10	Code
48	RDM	1.99	No	Relay Diffusion: Unifying diffusion process acro...	2023-09-04	Code
49	FasterDiT-XL/2	2.03	No	FasterDiT: Towards Faster Diffusion Transformers...	2024-10-14	Code
50	LEGO-XL	2.05	No	Learning Stackable and Skippable LEGO Bricks for...	2023-10-10	Code
51	ARPG-XL	2.1	No	Autoregressive Image Generation with Randomized ...	2025-03-13	Code
52	StyleSAN-XL	2.14	No	SAN: Inducing Metrizability of GAN with Discrimi...	2023-01-30	Code
53	LlamaGen	2.18	No	Autoregressive Model Beats Diffusion: Llama for ...	2024-06-10	Code
54	DiT-XL/2	2.27	No	Scalable Diffusion Models with Transformers	2022-12-19	Code
55	StyleGAN-XL	2.3	No	StyleGAN-XL: Scaling StyleGAN to Large Diverse D...	2022-02-01	Code
56	MAR-B, Diff Loss	2.31	No	Autoregressive Image Generation without Vector Q...	2024-06-17	Code
57	Open-MAGVIT2-XL	2.33	No	Open-MAGVIT2: An Open-Source Project Toward Demo...	2024-09-06	Code
58	ACDiT	2.37	No	ACDiT: Interpolating Autoregressive Conditional ...	2024-12-10	Code
59	ARPG-L	2.44	No	Autoregressive Image Generation with Randomized ...	2025-03-13	Code
60	TiTok-B-64	2.48	No	An Image is Worth 32 Tokens for Reconstruction a...	2024-06-11	Code
61	GIVT-Causal-L+A	2.59	No	GIVT: Generative Infinite-Vocabulary Transformers	2023-12-04	Code
62	Patch Diffusion	2.74	No	-	-	-
63	TiTok-B-32	2.77	No	An Image is Worth 32 Tokens for Reconstruction a...	2024-06-11	Code
64	DoD-B	2.79	No	Diffusion Models Need Visual Priors for Image Ge...	2024-10-11	-
65	Poly-INR	2.86	No	Polynomial Implicit Neural Representations For L...	2023-03-20	Code
66	MGVQ	3.02	No	MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tok...	2025-07-10	Code
67	ADM-G++ (FID)	3.18	No	Refining Generative Process with Discriminator G...	2022-11-28	Code
68	DiGIT-0.7B	3.39	No	Stabilize the Latent Space for Image Autoregress...	2024-10-16	Code
69	DiGIT	3.39	No	Stabilize the Latent Space for Image Autoregress...	2024-10-16	Code
70	Contextual RQ-Transformer	3.41	No	Draft-and-Revise: Effective Image Generation wit...	2022-06-09	-
71	GigaGAN	3.45	No	Scaling up GANs for Text-to-Image Synthesis	2023-03-09	Code
72	RCG-L (w/o guidance)	3.49	No	Return of Unconditional Generation: A Self-super...	2023-12-06	Code
73	BIGRoC-gt (Guided-Diffusion)	3.63	No	BIGRoC: Boosting Image Generation via a Robust C...	2021-08-08	Code
74	MAGVIT-v2 (w/o guidance)	3.65	No	Language Model Beats Diffusion -- Tokenizer is K...	2023-10-09	Code
75	BIGRoC-pl (Guided-Diffusion)	3.69	No	BIGRoC: Boosting Image Generation via a Robust C...	2021-08-08	Code
76	simple diffusion (U-Net)	3.71	No	Simple diffusion: End-to-end diffusion for high ...	2023-01-26	Code
77	simple diffusion (U-ViT, L)	3.75	No	Simple diffusion: End-to-end diffusion for high ...	2023-01-26	Code
78	RQ-Transformer	3.83	No	Autoregressive Image Generation using Residual Q...	2022-03-03	Code
79	ADM-G, ADM-U	3.94	No	Diffusion Models Beat GANs on Image Synthesis	2021-05-11	Code
80	ADM-G + EDS (ED-DPM, classifier_scale=0.75)	3.96	No	Entropy-driven Sampling and Training Scheme for ...	2022-06-23	Code
81	MaskGIT (a=0.05)	4.02	No	MaskGIT: Masked Generative Image Transformer	2022-02-08	Code
82	ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0)	4.09	No	Entropy-driven Sampling and Training Scheme for ...	2022-06-23	Code
83	ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0)	4.09	No	Entropy-driven Sampling and Training Scheme for ...	2022-06-23	Code
84	LDM	4.29	No	-	-	-
85	ADM-G++ (Recall)	4.45	No	Refining Generative Process with Discriminator G...	2022-11-28	Code
86	LFM	4.46	No	Flow Matching in Latent Space	2023-07-17	Code
87	RIN	4.51	No	Scalable Adaptive Computation for Iterative Gene...	2022-12-22	Code
88	ADM-G	4.59	No	Diffusion Models Beat GANs on Image Synthesis	2021-05-11	Code
89	ADM-G	4.59	No	Diffusion Models Beat GANs on Image Synthesis	2021-05-11	Code
90	CDM	4.88	No	Cascaded Diffusion Models for High Fidelity Imag...	2021-05-30	-
91	VQGAN+Transformer (k=600, p=1.0, a=0.05)	5.2	No	Taming Transformers for High-Resolution Image Sy...	2020-12-17	Code
92	MaskGIT	6.18	No	MaskGIT: Masked Generative Image Transformer	2022-02-08	Code
93	VQGAN+Transformer (k=mixed, p=1.0, a=0.005)	6.59	No	Taming Transformers for High-Resolution Image Sy...	2020-12-17	Code
94	Polarity-BigGAN	6.82	No	Polarity Sampling: Quality and Diversity Control...	2022-03-03	Code
95	BigGAN-deep	8.1	No	Large Scale GAN Training for High Fidelity Natur...	2018-09-28	Code
96	BigGAN+ [Brock et al.] (chx96)	8.1	No	Instance-Conditioned GAN	2021-09-10	Code
97	ADM	11.84	No	-	-	-
98	Improved DDPM	12.3	No	Improved Denoising Diffusion Probabilistic Models	2021-02-18	Code

#1SiT-XL/2 + UCGM-S (E2E-VAE + 40 sampling steps + CFG)SOTA
1.06
FID· 2025-05-12
Unified Continuous Generative Models Code
#2UCGM-XL/2 (VA-VAE + 30 sampling steps, without guidance)
1.21
FID· 2025-05-12
Unified Continuous Generative Models Code
#3UCGM-XL/2 (E2E-VAE + 40 sampling steps, without guidance)
1.21
FID· 2025-05-12
Unified Continuous Generative Models Code
#4EDM2-L + DDO (SD-VAE, 25 steps, DPM-Solver-v3)SOTA
1.21
FID· 2025-03-03
Direct Discriminative Optimization: Your Likelihood-Based Visual Generative Model is Secretly a GAN Discriminator Code
#5LightningDiT + UCGM-S (VA-VAE + 50 sampling steps + CFG)
1.21
FID· 2025-05-12
Unified Continuous Generative Models Code
#6xAR-HSOTA
1.24
FID· 2025-02-27
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Code
#7SiT-XL/2 + REPA-E
1.26
FID· 2025-04-14
REPA-E: Unlocking VAE for End-to-End Tuning with Latent Diffusion Transformers Code
#8DDT-XL/2(22en6de 675M + guidance interval )
1.26
FID· 2025-04-08
DDT: Decoupled Diffusion Transformer Code
#9xAR-LSOTA
1.28
FID· 2025-02-27
Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generation Code
#10FACM (2-step)
1.32
FID· 2025-07-04
Flow-Anchored Consistency Models Code
#11GMem (with the guidance interval)SOTA
1.32
FID· 2024-12-11
Generative Modeling with Explicit Memory Code
#12SiT-XL/2 + MG
1.34
FID· 2025-02-17
Diffusion Models without Classifier-free Guidance Code
#13AliTok-XL, autoregressive, 662M
1.35
FID· 2025-06-05
AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model Code
#14LightningDiT + VA-VAE (with the guidance interval)
1.35
FID· 2025-01-02
Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models Code
#15SiD2SOTA
1.38
FID· 2024-10-25
Simpler Diffusion (SiD2): 1.5 FID on ImageNet512 with pixel-space diffusion
#16SiT↓-XL/2+U-REPA (with the guidance interval)
1.41
FID· 2025-03-24
U-REPA: Aligning Diffusion U-Nets to ViTs Code
#17AliTok-XL, autoregressive, 318M
1.42
FID· 2025-06-05
AliTok: Towards Sequence Modeling Alignment between Tokenizer and Autoregressive Model Code
#18SiT-XL/2 + REPA (with the guidance interval)SOTA
1.42
FID· 2024-10-09
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think Code
#19RAR-XXL, autoregressive
1.48
FID· 2024-11-01
Randomized Autoregressive Visual Generation Code
#20RAR-XL, autoregressive
1.5
FID· 2024-11-01
Randomized Autoregressive Visual Generation Code
#21MaskBitSOTA
1.52
FID· 2024-09-24
MaskBit: Embedding-free Image Generation via Bit Tokens Code
#22GMem (w/o guidance)
1.53
FID· 2024-12-11
Generative Modeling with Explicit Memory Code
#23ELM
1.54
FID· 2024-10-21
Elucidating the design space of language models for image generation Code
#24MAR-H, Diff LossSOTA
1.55
FID· 2024-06-17
Autoregressive Image Generation without Vector Quantization Code
#25PaGoDASOTA
1.56
FID· 2024-05-23
PaGoDA: Progressive Growing of a One-Step Generator from a Low-Resolution Diffusion Teacher Code
#26ViT-XL/2 with limited Interval GuidanceSOTA
1.57
FID· 2023-03-16
Efficient Diffusion Training via Min-SNR Weighting Strategy Code
#27MDTv2
1.58
FID· 2023-03-25
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer Code
#28SiT-XL + SRA
1.58
FID· 2025-05-05
No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves Code
#29RobustTok-L
1.6
FID· 2025-03-11
Robust Latent Matters: Boosting Image Generation with Sampling Error Code
#30DiMR-G/2R
1.63
FID· 2024-06-13
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization Code
#31FlowAR
1.65
FID· 2024-12-19
FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching Code
#32FACM (1-step)
1.7
FID· 2025-07-04
Flow-Anchored Consistency Models Code
#33DiT-XL/2 with CADS
1.7
FID· 2023-10-26
CADS: Unleashing the Diversity of Diffusion Models through Condition-Annealed Sampling
#34DiMR-XL/2R
1.7
FID· 2024-06-13
Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models and Time-Dependent Layer Normalization Code
#35RAR-L, autoregressive
1.7
FID· 2024-11-01
Randomized Autoregressive Visual Generation Code
#36DiffiT
1.73
FID· 2023-12-04
DiffiT: Diffusion Vision Transformers for Image Generation Code
#37VAR (Visual Autoregressive)
1.73
FID· 2024-04-03
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction Code
#38MAGVIT-v2
1.78
FID· 2023-10-09
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Code
#39MAR-L, Diff Loss
1.78
FID· 2024-06-17
Autoregressive Image Generation without Vector Quantization Code
#40MDT
1.79
FID· 2023-03-25
MDTv2: Masked Diffusion Transformer is a Strong Image Synthesizer Code
#41Discriminator GuidanceSOTA
1.83
FID· 2022-11-28
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models Code
#42DoD-XL
1.83
FID· 2024-10-11
Diffusion Models Need Visual Priors for Image Generation
#43RobustTok-B
1.83
FID· 2025-03-11
Robust Latent Matters: Boosting Image Generation with Sampling Error Code
#44ARPG-XXL
1.94
FID· 2025-03-13
Autoregressive Image Generation with Randomized Parallel Decoding Code
#45RAR-B, autoregressive
1.95
FID· 2024-11-01
Randomized Autoregressive Visual Generation Code
#46TiTok-S-128
1.97
FID· 2024-06-11
An Image is Worth 32 Tokens for Reconstruction and Generation Code
#47PixelFlow
1.98
FID· 2025-04-10
PixelFlow: Pixel-Space Generative Models with Flow Code
#48RDM
1.99
FID· 2023-09-04
Relay Diffusion: Unifying diffusion process across resolutions for image synthesis Code
#49FasterDiT-XL/2
2.03
FID· 2024-10-14
FasterDiT: Towards Faster Diffusion Transformers Training without Architecture Modification Code
#50LEGO-XL
2.05
FID· 2023-10-10
Learning Stackable and Skippable LEGO Bricks for Efficient, Reconfigurable, and Variable-Resolution Diffusion Modeling Code
#51ARPG-XL
2.1
FID· 2025-03-13
Autoregressive Image Generation with Randomized Parallel Decoding Code
#52StyleSAN-XL
2.14
FID· 2023-01-30
SAN: Inducing Metrizability of GAN with Discriminative Normalized Linear Layer Code
#53LlamaGen
2.18
FID· 2024-06-10
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation Code
#54DiT-XL/2
2.27
FID· 2022-12-19
Scalable Diffusion Models with Transformers Code
#55StyleGAN-XLSOTA
2.3
FID· 2022-02-01
StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets Code
#56MAR-B, Diff Loss
2.31
FID· 2024-06-17
Autoregressive Image Generation without Vector Quantization Code
#57Open-MAGVIT2-XL
2.33
FID· 2024-09-06
Open-MAGVIT2: An Open-Source Project Toward Democratizing Auto-regressive Visual Generation Code
#58ACDiT
2.37
FID· 2024-12-10
ACDiT: Interpolating Autoregressive Conditional Modeling and Diffusion Transformer Code
#59ARPG-L
2.44
FID· 2025-03-13
Autoregressive Image Generation with Randomized Parallel Decoding Code
#60TiTok-B-64
2.48
FID· 2024-06-11
An Image is Worth 32 Tokens for Reconstruction and Generation Code
#61GIVT-Causal-L+A
2.59
FID· 2023-12-04
GIVT: Generative Infinite-Vocabulary Transformers Code
#62Patch Diffusion
2.74
FID
No paper
#63TiTok-B-32
2.77
FID· 2024-06-11
An Image is Worth 32 Tokens for Reconstruction and Generation Code
#64DoD-B
2.79
FID· 2024-10-11
Diffusion Models Need Visual Priors for Image Generation
#65Poly-INR
2.86
FID· 2023-03-20
Polynomial Implicit Neural Representations For Large Diverse Datasets Code
#66MGVQ
3.02
FID· 2025-07-10
MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization Code
#67ADM-G++ (FID)
3.18
FID· 2022-11-28
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models Code
#68DiGIT-0.7B
3.39
FID· 2024-10-16
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective Code
#69DiGIT
3.39
FID· 2024-10-16
Stabilize the Latent Space for Image Autoregressive Modeling: A Unified Perspective Code
#70Contextual RQ-Transformer
3.41
FID· 2022-06-09
Draft-and-Revise: Effective Image Generation with Contextual RQ-Transformer
#71GigaGAN
3.45
FID· 2023-03-09
Scaling up GANs for Text-to-Image Synthesis Code
#72RCG-L (w/o guidance)
3.49
FID· 2023-12-06
Return of Unconditional Generation: A Self-supervised Representation Generation Method Code
#73BIGRoC-gt (Guided-Diffusion)SOTA
3.63
FID· 2021-08-08
BIGRoC: Boosting Image Generation via a Robust Classifier Code
#74MAGVIT-v2 (w/o guidance)
3.65
FID· 2023-10-09
Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation Code
#75BIGRoC-pl (Guided-Diffusion)SOTA
3.69
FID· 2021-08-08
BIGRoC: Boosting Image Generation via a Robust Classifier Code
#76simple diffusion (U-Net)
3.71
FID· 2023-01-26
Simple diffusion: End-to-end diffusion for high resolution images Code
#77simple diffusion (U-ViT, L)
3.75
FID· 2023-01-26
Simple diffusion: End-to-end diffusion for high resolution images Code
#78RQ-Transformer
3.83
FID· 2022-03-03
Autoregressive Image Generation using Residual Quantization Code
#79ADM-G, ADM-USOTA
3.94
FID· 2021-05-11
Diffusion Models Beat GANs on Image Synthesis Code
#80ADM-G + EDS (ED-DPM, classifier_scale=0.75)
3.96
FID· 2022-06-23
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation Code
#81MaskGIT (a=0.05)
4.02
FID· 2022-02-08
MaskGIT: Masked Generative Image Transformer Code
#82ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0)
4.09
FID· 2022-06-23
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation Code
#83ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0)
4.09
FID· 2022-06-23
Entropy-driven Sampling and Training Scheme for Conditional Diffusion Generation Code
#84LDM
4.29
FID
No paper
#85ADM-G++ (Recall)
4.45
FID· 2022-11-28
Refining Generative Process with Discriminator Guidance in Score-based Diffusion Models Code
#86LFM
4.46
FID· 2023-07-17
Flow Matching in Latent Space Code
#87RIN
4.51
FID· 2022-12-22
Scalable Adaptive Computation for Iterative Generation Code
#88ADM-GSOTA
4.59
FID· 2021-05-11
Diffusion Models Beat GANs on Image Synthesis Code
#89ADM-G
4.59
FID· 2021-05-11
Diffusion Models Beat GANs on Image Synthesis Code
#90CDM
4.88
FID· 2021-05-30
Cascaded Diffusion Models for High Fidelity Image Generation
#91VQGAN+Transformer (k=600, p=1.0, a=0.05)SOTA
5.2
FID· 2020-12-17
Taming Transformers for High-Resolution Image Synthesis Code
#92MaskGIT
6.18
FID· 2022-02-08
MaskGIT: Masked Generative Image Transformer Code
#93VQGAN+Transformer (k=mixed, p=1.0, a=0.005)SOTA
6.59
FID· 2020-12-17
Taming Transformers for High-Resolution Image Synthesis Code
#94Polarity-BigGAN
6.82
FID· 2022-03-03
Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values Code
#95BigGAN-deepSOTA
8.1
FID· 2018-09-28
Large Scale GAN Training for High Fidelity Natural Image Synthesis Code
#96BigGAN+ [Brock et al.] (chx96)
8.1
FID· 2021-09-10
Instance-Conditioned GAN Code
#97ADM
11.84
FID
No paper
#98Improved DDPM
12.3
FID· 2021-02-18
Improved Denoising Diffusion Probabilistic Models Code