A U-Net Based Discriminator for Generative Adversarial Networks

Edgar Schönfeld, Bernt Schiele, Anna Khoreva

2020-02-28Data Augmentation Image Generation Conditional Image Generation

Abstract

Among the major remaining challenges for generative adversarial networks (GANs) is the capacity to synthesize globally and locally coherent images with object shapes and textures indistinguishable from real images. To target this issue we propose an alternative U-Net based discriminator architecture, borrowing the insights from the segmentation literature. The proposed U-Net based architecture allows to provide detailed per-pixel feedback to the generator while maintaining the global coherence of synthesized images, by providing the global image feedback as well. Empowered by the per-pixel response of the discriminator, we further propose a per-pixel consistency regularization technique based on the CutMix data augmentation, encouraging the U-Net discriminator to focus more on semantic and structural changes between real and fake images. This improves the U-Net discriminator training, further enhancing the quality of generated samples. The novel discriminator improves over the state of the art in terms of the standard distribution and image quality metrics, enabling the generator to synthesize images with varying structure, appearance and levels of detail, maintaining global and local realism. Compared to the BigGAN baseline, we achieve an average improvement of 2.7 FID points across FFHQ, CelebA, and the newly introduced COCO-Animals dataset. The code is available at https://github.com/boschresearch/unetgan.

Results

Task	Dataset	Metric	Value	Model
Image Generation	CelebA-HQ 128x128	FID	2.03	U-Net GAN
Image Generation	CelebA-HQ 128x128	Inception score	3.33	U-Net GAN
Image Generation	FFHQ 256 x 256	FID	7.48	U-Net GAN
Image Generation	FFHQ 256 x 256	FID	11.48	BigGAN
Image Generation	CelebA 128x128	FID	2.95	U-Net GAN
Image Generation	CelebA 128x128	Inception score	3.43	U-Net GAN
Image Generation	COCO-Animals	FID	13.73	U-Net GAN
Image Generation	COCO-Animals	IS	12.29	U-Net GAN
Image Generation	COCO-Animals	FID	16.37	BigGAN
Image Generation	COCO-Animals	IS	11.77	BigGAN
Conditional Image Generation	COCO-Animals	FID	13.73	U-Net GAN
Conditional Image Generation	COCO-Animals	IS	12.29	U-Net GAN
Conditional Image Generation	COCO-Animals	FID	16.37	BigGAN
Conditional Image Generation	COCO-Animals	IS	11.77	BigGAN

A U-Net Based Discriminator for Generative Adversarial Networks

Abstract

Results

Related Papers

A U-Net Based Discriminator for Generative Adversarial Networks

Abstract

Results

Related Papers