TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Diffusion Models Beat GANs on Image Synthesis

Diffusion Models Beat GANs on Image Synthesis

Prafulla Dhariwal, Alex Nichol

2021-05-11NeurIPS 2021 12Image GenerationConditional Image Generation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For conditional image synthesis, we further improve sample quality with classifier guidance: a simple, compute-efficient method for trading off diversity for fidelity using gradients from a classifier. We achieve an FID of 2.97 on ImageNet 128$\times$128, 4.59 on ImageNet 256$\times$256, and 7.72 on ImageNet 512$\times$512, and we match BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. Finally, we find that classifier guidance combines well with upsampling diffusion models, further improving FID to 3.94 on ImageNet 256$\times$256 and 3.85 on ImageNet 512$\times$512. We release our code at https://github.com/openai/guided-diffusion

Results

TaskDatasetMetricValueModel
Image GenerationLSUN Cat 256 x 256FID5.57ADM (dropout)
Image GenerationLSUN Horse 256 x 256FID2.57ADM (dropout)
Image GenerationImageNet 64x64FID2.07ADM (dropout)
Image GenerationLSUN Bedroom 256 x 256FID1.9ADM (dropout)
Image GenerationLSUN Bedroom 256 x 256FD59.64ADM (dropout, DINOv2)
Image GenerationLSUN Bedroom 256 x 256Precision0.85ADM (dropout, DINOv2)
Image GenerationLSUN Bedroom 256 x 256Recall0.75ADM (dropout, DINOv2)
Image GenerationImageNet 128x128FID2.97ADM-G
Image GenerationImageNet 512x512FID3.85ADM-G, ADM-U
Image GenerationImageNet 512x512Inception score221.72ADM-G, ADM-U
Image GenerationImageNet 512x512FID7.72ADM-G
Image GenerationImageNet 512x512Inception score172.71ADM-G
Image GenerationImageNet 256x256FID3.94ADM-G, ADM-U
Image GenerationImageNet 256x256FID4.59ADM-G
Image GenerationImageNet 256x256FID4.59ADM-G
Image GenerationImageNet 256x256Inception score186.7ADM-G
Image GenerationImageNet 128x128FID2.97ADM-G (classifier_scale=0.5)
Conditional Image GenerationImageNet 256x256FID4.59ADM-G
Conditional Image GenerationImageNet 256x256Inception score186.7ADM-G
Conditional Image GenerationImageNet 128x128FID2.97ADM-G (classifier_scale=0.5)

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15