StarGAN v2: Diverse Image Synthesis for Multiple Domains

Yunjey Choi, Youngjung Uh, Jaejun Yoo, Jung-Woo Ha

2019-12-04CVPR 2020 6Multimodal Unsupervised Image-To-Image Translation Translation Image Generation Image-to-Image Translation

Paper PDF Code Code Code Code Code Code Code Code Code Code Code(official)Code Code Code

Abstract

A good image-to-image translation model should learn a mapping between different visual domains while satisfying the following properties: 1) diversity of generated images and 2) scalability over multiple domains. Existing methods address either of the issues, having limited diversity or multiple models for all domains. We propose StarGAN v2, a single framework that tackles both and shows significantly improved results over the baselines. Experiments on CelebA-HQ and a new animal faces dataset (AFHQ) validate our superiority in terms of visual quality, diversity, and scalability. To better assess image-to-image translation models, we release AFHQ, high-quality animal faces with large inter- and intra-domain differences. The code, pretrained models, and dataset can be found at https://github.com/clovaai/stargan-v2.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	CelebA-HQ	FID	13.73	StarGAN v2
Image-to-Image Translation	CelebA-HQ	LPIPS	0.428	StarGAN v2
Image-to-Image Translation	AFHQ	FID	24.4	StarGAN v2
Image-to-Image Translation	AFHQ	LPIPS	0.524	StarGAN v2
Image-to-Image Translation	CelebA-HQ	FID	13.73	StarGAN v2
Image-to-Image Translation	AFHQ	FID	16.2	StarGAN v2
Image-to-Image Translation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	FID	27.7	StarGAN-v2
Image-to-Image Translation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	Kernel Inception Distance	0.00118	StarGAN-v2
Image Generation	CelebA-HQ	FID	13.73	StarGAN v2
Image Generation	CelebA-HQ	LPIPS	0.428	StarGAN v2
Image Generation	AFHQ	FID	24.4	StarGAN v2
Image Generation	AFHQ	LPIPS	0.524	StarGAN v2
Image Generation	CelebA-HQ	FID	13.73	StarGAN v2
Image Generation	AFHQ	FID	16.2	StarGAN v2
Image Generation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	FID	27.7	StarGAN-v2
Image Generation	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	Kernel Inception Distance	0.00118	StarGAN-v2
1 Image, 2*2 Stitching	CelebA-HQ	FID	13.73	StarGAN v2
1 Image, 2*2 Stitching	CelebA-HQ	LPIPS	0.428	StarGAN v2
1 Image, 2*2 Stitching	AFHQ	FID	24.4	StarGAN v2
1 Image, 2*2 Stitching	AFHQ	LPIPS	0.524	StarGAN v2
1 Image, 2*2 Stitching	CelebA-HQ	FID	13.73	StarGAN v2
1 Image, 2*2 Stitching	AFHQ	FID	16.2	StarGAN v2
1 Image, 2*2 Stitching	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	FID	27.7	StarGAN-v2
1 Image, 2*2 Stitching	Fundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic Patients	Kernel Inception Distance	0.00118	StarGAN-v2

StarGAN v2: Diverse Image Synthesis for Multiple Domains

Abstract

Results

Related Papers

StarGAN v2: Diverse Image Synthesis for Multiple Domains

Abstract

Results

Related Papers