Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders

Huangjie Zheng, Pengcheng He, Weizhu Chen, Mingyuan Zhou

2022-02-19Text-to-Image Generation Image Generation

Abstract

Employing a forward diffusion chain to gradually map the data to a noise distribution, diffusion-based generative models learn how to generate the data by inferring a reverse diffusion chain. However, this approach is slow and costly because it needs many forward and reverse steps. We propose a faster and cheaper approach that adds noise not until the data become pure random noise, but until they reach a hidden noisy data distribution that we can confidently learn. Then, we use fewer reverse steps to generate data by starting from this hidden distribution that is made similar to the noisy data. We reveal that the proposed model can be cast as an adversarial auto-encoder empowered by both the diffusion process and a learnable implicit prior. Experimental results show even with a significantly smaller number of reverse diffusion steps, the proposed truncated diffusion probabilistic models can provide consistent improvements over the non-truncated ones in terms of performance in both unconditional and text-guided image generations.

Results

Task	Dataset	Metric	Value	Model
Image Generation	LSUN Bedroom 256 x 256	FID	1.88	TDPM+ (TTrunc=99)
Image Generation	LSUN Bedroom 256 x 256	NFE	100	TDPM+ (TTrunc=99)
Image Generation	LSUN Churches 256 x 256	FID	3.98	TDPM+ (TTrunc=99)
Image Generation	LSUN Churches 256 x 256	NFE	100	TDPM+ (TTrunc=99)
Image Generation	COCO (Common Objects in Context)	FID	6.29	TLDM
Image Generation	CUB	FID	6.72	TLDM
Text-to-Image Generation	COCO (Common Objects in Context)	FID	6.29	TLDM
Text-to-Image Generation	CUB	FID	6.72	TLDM
10-shot image generation	COCO (Common Objects in Context)	FID	6.29	TLDM
10-shot image generation	CUB	FID	6.72	TLDM
1 Image, 2*2 Stitchi	COCO (Common Objects in Context)	FID	6.29	TLDM
1 Image, 2*2 Stitchi	CUB	FID	6.72	TLDM

Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders

Abstract

Results

Related Papers

Truncated Diffusion Probabilistic Models and Diffusion-based Adversarial Auto-Encoders

Abstract

Results

Related Papers