TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Medical/Image Generation/ImageNet 256x256

Image Generation on ImageNet 256x256

Metric: FID (lower is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕FID▲Extra DataPaperDate↕Code
1SiT-XL/2 + UCGM-S (E2E-VAE + 40 sampling steps + CFG)1.06NoUnified Continuous Generative Models2025-05-12Code
2UCGM-XL/2 (VA-VAE + 30 sampling steps, without guidance)1.21NoUnified Continuous Generative Models2025-05-12Code
3UCGM-XL/2 (E2E-VAE + 40 sampling steps, without guidance)1.21NoUnified Continuous Generative Models2025-05-12Code
4EDM2-L + DDO (SD-VAE, 25 steps, DPM-Solver-v3)1.21NoDirect Discriminative Optimization: Your Likelih...2025-03-03Code
5LightningDiT + UCGM-S (VA-VAE + 50 sampling steps + CFG)1.21NoUnified Continuous Generative Models2025-05-12Code
6xAR-H1.24NoBeyond Next-Token: Next-X Prediction for Autoreg...2025-02-27Code
7SiT-XL/2 + REPA-E1.26NoREPA-E: Unlocking VAE for End-to-End Tuning with...2025-04-14Code
8DDT-XL/2(22en6de 675M + guidance interval )1.26NoDDT: Decoupled Diffusion Transformer2025-04-08Code
9xAR-L1.28NoBeyond Next-Token: Next-X Prediction for Autoreg...2025-02-27Code
10FACM (2-step)1.32NoFlow-Anchored Consistency Models2025-07-04Code
11GMem (with the guidance interval)1.32NoGenerative Modeling with Explicit Memory2024-12-11Code
12SiT-XL/2 + MG1.34NoDiffusion Models without Classifier-free Guidance2025-02-17Code
13AliTok-XL, autoregressive, 662M1.35NoAliTok: Towards Sequence Modeling Alignment betw...2025-06-05Code
14LightningDiT + VA-VAE (with the guidance interval)1.35NoReconstruction vs. Generation: Taming Optimizati...2025-01-02Code
15SiD21.38NoSimpler Diffusion (SiD2): 1.5 FID on ImageNet512...2024-10-25-
16SiT↓-XL/2+U-REPA (with the guidance interval)1.41NoU-REPA: Aligning Diffusion U-Nets to ViTs2025-03-24Code
17AliTok-XL, autoregressive, 318M1.42NoAliTok: Towards Sequence Modeling Alignment betw...2025-06-05Code
18SiT-XL/2 + REPA (with the guidance interval)1.42NoRepresentation Alignment for Generation: Trainin...2024-10-09Code
19RAR-XXL, autoregressive1.48NoRandomized Autoregressive Visual Generation2024-11-01Code
20RAR-XL, autoregressive1.5NoRandomized Autoregressive Visual Generation2024-11-01Code
21MaskBit1.52NoMaskBit: Embedding-free Image Generation via Bit...2024-09-24Code
22GMem (w/o guidance)1.53NoGenerative Modeling with Explicit Memory2024-12-11Code
23ELM1.54NoElucidating the design space of language models ...2024-10-21Code
24MAR-H, Diff Loss1.55NoAutoregressive Image Generation without Vector Q...2024-06-17Code
25PaGoDA1.56NoPaGoDA: Progressive Growing of a One-Step Genera...2024-05-23Code
26ViT-XL/2 with limited Interval Guidance1.57NoEfficient Diffusion Training via Min-SNR Weighti...2023-03-16Code
27MDTv21.58NoMDTv2: Masked Diffusion Transformer is a Strong ...2023-03-25Code
28SiT-XL + SRA1.58NoNo Other Representation Component Is Needed: Dif...2025-05-05Code
29RobustTok-L1.6NoRobust Latent Matters: Boosting Image Generation...2025-03-11Code
30DiMR-G/2R1.63NoAlleviating Distortion in Image Generation via M...2024-06-13Code
31FlowAR1.65NoFlowAR: Scale-wise Autoregressive Image Generati...2024-12-19Code
32FACM (1-step)1.7NoFlow-Anchored Consistency Models2025-07-04Code
33DiT-XL/2 with CADS1.7NoCADS: Unleashing the Diversity of Diffusion Mode...2023-10-26-
34DiMR-XL/2R1.7NoAlleviating Distortion in Image Generation via M...2024-06-13Code
35RAR-L, autoregressive1.7NoRandomized Autoregressive Visual Generation2024-11-01Code
36DiffiT1.73NoDiffiT: Diffusion Vision Transformers for Image ...2023-12-04Code
37VAR (Visual Autoregressive)1.73NoVisual Autoregressive Modeling: Scalable Image G...2024-04-03Code
38MAGVIT-v21.78NoLanguage Model Beats Diffusion -- Tokenizer is K...2023-10-09Code
39MAR-L, Diff Loss1.78NoAutoregressive Image Generation without Vector Q...2024-06-17Code
40MDT1.79NoMDTv2: Masked Diffusion Transformer is a Strong ...2023-03-25Code
41Discriminator Guidance1.83NoRefining Generative Process with Discriminator G...2022-11-28Code
42DoD-XL1.83NoDiffusion Models Need Visual Priors for Image Ge...2024-10-11-
43RobustTok-B1.83NoRobust Latent Matters: Boosting Image Generation...2025-03-11Code
44ARPG-XXL1.94NoAutoregressive Image Generation with Randomized ...2025-03-13Code
45RAR-B, autoregressive1.95NoRandomized Autoregressive Visual Generation2024-11-01Code
46TiTok-S-1281.97NoAn Image is Worth 32 Tokens for Reconstruction a...2024-06-11Code
47PixelFlow1.98NoPixelFlow: Pixel-Space Generative Models with Flow2025-04-10Code
48RDM1.99NoRelay Diffusion: Unifying diffusion process acro...2023-09-04Code
49FasterDiT-XL/22.03NoFasterDiT: Towards Faster Diffusion Transformers...2024-10-14Code
50LEGO-XL2.05NoLearning Stackable and Skippable LEGO Bricks for...2023-10-10Code
51ARPG-XL2.1NoAutoregressive Image Generation with Randomized ...2025-03-13Code
52StyleSAN-XL2.14NoSAN: Inducing Metrizability of GAN with Discrimi...2023-01-30Code
53LlamaGen2.18NoAutoregressive Model Beats Diffusion: Llama for ...2024-06-10Code
54DiT-XL/22.27NoScalable Diffusion Models with Transformers2022-12-19Code
55StyleGAN-XL2.3NoStyleGAN-XL: Scaling StyleGAN to Large Diverse D...2022-02-01Code
56MAR-B, Diff Loss2.31NoAutoregressive Image Generation without Vector Q...2024-06-17Code
57Open-MAGVIT2-XL2.33NoOpen-MAGVIT2: An Open-Source Project Toward Demo...2024-09-06Code
58ACDiT2.37NoACDiT: Interpolating Autoregressive Conditional ...2024-12-10Code
59ARPG-L2.44NoAutoregressive Image Generation with Randomized ...2025-03-13Code
60TiTok-B-642.48NoAn Image is Worth 32 Tokens for Reconstruction a...2024-06-11Code
61GIVT-Causal-L+A 2.59NoGIVT: Generative Infinite-Vocabulary Transformers2023-12-04Code
62Patch Diffusion2.74No---
63TiTok-B-322.77NoAn Image is Worth 32 Tokens for Reconstruction a...2024-06-11Code
64DoD-B2.79NoDiffusion Models Need Visual Priors for Image Ge...2024-10-11-
65Poly-INR2.86NoPolynomial Implicit Neural Representations For L...2023-03-20Code
66MGVQ3.02NoMGVQ: Could VQ-VAE Beat VAE? A Generalizable Tok...2025-07-10Code
67ADM-G++ (FID)3.18NoRefining Generative Process with Discriminator G...2022-11-28Code
68DiGIT-0.7B3.39NoStabilize the Latent Space for Image Autoregress...2024-10-16Code
69DiGIT3.39NoStabilize the Latent Space for Image Autoregress...2024-10-16Code
70Contextual RQ-Transformer3.41NoDraft-and-Revise: Effective Image Generation wit...2022-06-09-
71GigaGAN3.45NoScaling up GANs for Text-to-Image Synthesis2023-03-09Code
72RCG-L (w/o guidance)3.49NoReturn of Unconditional Generation: A Self-super...2023-12-06Code
73BIGRoC-gt (Guided-Diffusion)3.63NoBIGRoC: Boosting Image Generation via a Robust C...2021-08-08Code
74MAGVIT-v2 (w/o guidance)3.65NoLanguage Model Beats Diffusion -- Tokenizer is K...2023-10-09Code
75BIGRoC-pl (Guided-Diffusion)3.69NoBIGRoC: Boosting Image Generation via a Robust C...2021-08-08Code
76simple diffusion (U-Net)3.71NoSimple diffusion: End-to-end diffusion for high ...2023-01-26Code
77simple diffusion (U-ViT, L)3.75NoSimple diffusion: End-to-end diffusion for high ...2023-01-26Code
78RQ-Transformer3.83NoAutoregressive Image Generation using Residual Q...2022-03-03Code
79ADM-G, ADM-U3.94NoDiffusion Models Beat GANs on Image Synthesis2021-05-11Code
80ADM-G + EDS (ED-DPM, classifier_scale=0.75)3.96NoEntropy-driven Sampling and Training Scheme for ...2022-06-23Code
81MaskGIT (a=0.05)4.02NoMaskGIT: Masked Generative Image Transformer2022-02-08Code
82ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0)4.09NoEntropy-driven Sampling and Training Scheme for ...2022-06-23Code
83ADM-G + EDS + ECT (ED-DPM, classifier_scale=1.0)4.09NoEntropy-driven Sampling and Training Scheme for ...2022-06-23Code
84LDM4.29No---
85ADM-G++ (Recall)4.45NoRefining Generative Process with Discriminator G...2022-11-28Code
86LFM4.46NoFlow Matching in Latent Space2023-07-17Code
87RIN4.51NoScalable Adaptive Computation for Iterative Gene...2022-12-22Code
88ADM-G4.59NoDiffusion Models Beat GANs on Image Synthesis2021-05-11Code
89ADM-G4.59NoDiffusion Models Beat GANs on Image Synthesis2021-05-11Code
90CDM4.88NoCascaded Diffusion Models for High Fidelity Imag...2021-05-30-
91VQGAN+Transformer (k=600, p=1.0, a=0.05)5.2NoTaming Transformers for High-Resolution Image Sy...2020-12-17Code
92MaskGIT6.18NoMaskGIT: Masked Generative Image Transformer2022-02-08Code
93VQGAN+Transformer (k=mixed, p=1.0, a=0.005)6.59NoTaming Transformers for High-Resolution Image Sy...2020-12-17Code
94Polarity-BigGAN6.82NoPolarity Sampling: Quality and Diversity Control...2022-03-03Code
95BigGAN-deep8.1NoLarge Scale GAN Training for High Fidelity Natur...2018-09-28Code
96BigGAN+ [Brock et al.] (chx96)8.1NoInstance-Conditioned GAN2021-09-10Code
97ADM11.84No---
98Improved DDPM12.3NoImproved Denoising Diffusion Probabilistic Models2021-02-18Code