Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros

2017-03-30ICCV 2017 10Multimodal Unsupervised Image-To-Image Translation Style Transfer Unsupervised Image-To-Image Translation Image Colorization Translation Image-to-Image Translation

Abstract

Image-to-image translation is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs. However, for many tasks, paired training data will not be available. We present an approach for learning to translate an image from a source domain $X$ to a target domain $Y$ in the absence of paired examples. Our goal is to learn a mapping $G: X \rightarrow Y$ such that the distribution of images from $G(X)$ is indistinguishable from the distribution $Y$ using an adversarial loss. Because this mapping is highly under-constrained, we couple it with an inverse mapping $F: Y \rightarrow X$ and introduce a cycle consistency loss to push $F(G(X)) \approx X$ (and vice versa). Qualitative results are presented on several tasks where paired training data does not exist, including collection style transfer, object transfiguration, season transfer, photo enhancement, etc. Quantitative comparisons against several prior methods demonstrate the superiority of our approach.

Results

Task	Dataset	Metric	Value	Model
Image-to-Image Translation	vangogh2photo	Frechet Inception Distance	163.4	CycleGAN
Image-to-Image Translation	zebra2horse	Frechet Inception Distance	110.5	CycleGAN
Image-to-Image Translation	photo2vangogh	Frechet Inception Distance	151.4	CycleGAN
Image-to-Image Translation	horse2zebra	Frechet Inception Distance	89.7	CycleGAN
Image-to-Image Translation	Cityscapes Labels-to-Photo	Class IOU	0.11	CycleGAN
Image-to-Image Translation	Cityscapes Photo-to-Labels	Class IOU	0.16	CycleGAN
Image-to-Image Translation	Freiburg Forest Dataset	PSNR	18.57	cycGAN
Image-to-Image Translation	EPFL NIR-VIS	PSNR	17.38	cycGAN
Image-to-Image Translation	Edge-to-Shoes	Diversity	0.01	CycleGAN
Image-to-Image Translation	Cats-and-Dogs	CIS	0.076	CycleGAN
Image-to-Image Translation	Cats-and-Dogs	IS	0.813	CycleGAN
Image-to-Image Translation	Edge-to-Handbags	Diversity	0.012	CycleGAN
Image Generation	vangogh2photo	Frechet Inception Distance	163.4	CycleGAN
Image Generation	zebra2horse	Frechet Inception Distance	110.5	CycleGAN
Image Generation	photo2vangogh	Frechet Inception Distance	151.4	CycleGAN
Image Generation	horse2zebra	Frechet Inception Distance	89.7	CycleGAN
Image Generation	Cityscapes Labels-to-Photo	Class IOU	0.11	CycleGAN
Image Generation	Cityscapes Photo-to-Labels	Class IOU	0.16	CycleGAN
Image Generation	Freiburg Forest Dataset	PSNR	18.57	cycGAN
Image Generation	EPFL NIR-VIS	PSNR	17.38	cycGAN
Image Generation	Edge-to-Shoes	Diversity	0.01	CycleGAN
Image Generation	Cats-and-Dogs	CIS	0.076	CycleGAN
Image Generation	Cats-and-Dogs	IS	0.813	CycleGAN
Image Generation	Edge-to-Handbags	Diversity	0.012	CycleGAN
Unsupervised Image-To-Image Translation	Freiburg Forest Dataset	PSNR	18.57	cycGAN
Image Colorization	NIR2RGB VCIP Challange Dataset	PSNR	19.59	CycleGAN
1 Image, 2*2 Stitching	vangogh2photo	Frechet Inception Distance	163.4	CycleGAN
1 Image, 2*2 Stitching	zebra2horse	Frechet Inception Distance	110.5	CycleGAN
1 Image, 2*2 Stitching	photo2vangogh	Frechet Inception Distance	151.4	CycleGAN
1 Image, 2*2 Stitching	horse2zebra	Frechet Inception Distance	89.7	CycleGAN
1 Image, 2*2 Stitching	Cityscapes Labels-to-Photo	Class IOU	0.11	CycleGAN
1 Image, 2*2 Stitching	Cityscapes Photo-to-Labels	Class IOU	0.16	CycleGAN
1 Image, 2*2 Stitching	Freiburg Forest Dataset	PSNR	18.57	cycGAN
1 Image, 2*2 Stitching	EPFL NIR-VIS	PSNR	17.38	cycGAN
1 Image, 2*2 Stitching	Edge-to-Shoes	Diversity	0.01	CycleGAN
1 Image, 2*2 Stitching	Cats-and-Dogs	CIS	0.076	CycleGAN
1 Image, 2*2 Stitching	Cats-and-Dogs	IS	0.813	CycleGAN
1 Image, 2*2 Stitching	Edge-to-Handbags	Diversity	0.012	CycleGAN

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

Abstract

Results

Related Papers

Unpaired Image-to-Image Translation using Cycle-Consistent Adversarial Networks

Abstract

Results

Related Papers