Diffusion Models Beat GANs on Image Synthesis

Prafulla Dhariwal, Alex Nichol

2021-05-11NeurIPS 2021 12Image Generation Conditional Image Generation

Paper PDF Code Code Code Code Code Code Code Code Code Code(official)Code Code Code Code Code Code Code Code Code Code Code

Abstract

We show that diffusion models can achieve image sample quality superior to the current state-of-the-art generative models. We achieve this on unconditional image synthesis by finding a better architecture through a series of ablations. For conditional image synthesis, we further improve sample quality with classifier guidance: a simple, compute-efficient method for trading off diversity for fidelity using gradients from a classifier. We achieve an FID of 2.97 on ImageNet 128$\times$128, 4.59 on ImageNet 256$\times$256, and 7.72 on ImageNet 512$\times$512, and we match BigGAN-deep even with as few as 25 forward passes per sample, all while maintaining better coverage of the distribution. Finally, we find that classifier guidance combines well with upsampling diffusion models, further improving FID to 3.94 on ImageNet 256$\times$256 and 3.85 on ImageNet 512$\times$512. We release our code at https://github.com/openai/guided-diffusion

Results

Task	Dataset	Metric	Value	Model
Image Generation	LSUN Cat 256 x 256	FID	5.57	ADM (dropout)
Image Generation	LSUN Horse 256 x 256	FID	2.57	ADM (dropout)
Image Generation	ImageNet 64x64	FID	2.07	ADM (dropout)
Image Generation	LSUN Bedroom 256 x 256	FID	1.9	ADM (dropout)
Image Generation	LSUN Bedroom 256 x 256	FD	59.64	ADM (dropout, DINOv2)
Image Generation	LSUN Bedroom 256 x 256	Precision	0.85	ADM (dropout, DINOv2)
Image Generation	LSUN Bedroom 256 x 256	Recall	0.75	ADM (dropout, DINOv2)
Image Generation	ImageNet 128x128	FID	2.97	ADM-G
Image Generation	ImageNet 512x512	FID	3.85	ADM-G, ADM-U
Image Generation	ImageNet 512x512	Inception score	221.72	ADM-G, ADM-U
Image Generation	ImageNet 512x512	FID	7.72	ADM-G
Image Generation	ImageNet 512x512	Inception score	172.71	ADM-G
Image Generation	ImageNet 256x256	FID	3.94	ADM-G, ADM-U
Image Generation	ImageNet 256x256	FID	4.59	ADM-G
Image Generation	ImageNet 256x256	FID	4.59	ADM-G
Image Generation	ImageNet 256x256	Inception score	186.7	ADM-G
Image Generation	ImageNet 128x128	FID	2.97	ADM-G (classifier_scale=0.5)
Conditional Image Generation	ImageNet 256x256	FID	4.59	ADM-G
Conditional Image Generation	ImageNet 256x256	Inception score	186.7	ADM-G
Conditional Image Generation	ImageNet 128x128	FID	2.97	ADM-G (classifier_scale=0.5)

Diffusion Models Beat GANs on Image Synthesis

Abstract

Results

Related Papers

Diffusion Models Beat GANs on Image Synthesis

Abstract

Results

Related Papers