Jun-Yan Zhu, Taesung Park, Phillip Isola, Alexei A. Efros
Image-to-image translation is a class of vision and graphics problems where the goal is to learn the mapping between an input image and an output image using a training set of aligned image pairs. However, for many tasks, paired training data will not be available. We present an approach for learning to translate an image from a source domain $X$ to a target domain $Y$ in the absence of paired examples. Our goal is to learn a mapping $G: X \rightarrow Y$ such that the distribution of images from $G(X)$ is indistinguishable from the distribution $Y$ using an adversarial loss. Because this mapping is highly under-constrained, we couple it with an inverse mapping $F: Y \rightarrow X$ and introduce a cycle consistency loss to push $F(G(X)) \approx X$ (and vice versa). Qualitative results are presented on several tasks where paired training data does not exist, including collection style transfer, object transfiguration, season transfer, photo enhancement, etc. Quantitative comparisons against several prior methods demonstrate the superiority of our approach.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image-to-Image Translation | vangogh2photo | Frechet Inception Distance | 163.4 | CycleGAN |
| Image-to-Image Translation | zebra2horse | Frechet Inception Distance | 110.5 | CycleGAN |
| Image-to-Image Translation | photo2vangogh | Frechet Inception Distance | 151.4 | CycleGAN |
| Image-to-Image Translation | horse2zebra | Frechet Inception Distance | 89.7 | CycleGAN |
| Image-to-Image Translation | Cityscapes Labels-to-Photo | Class IOU | 0.11 | CycleGAN |
| Image-to-Image Translation | Cityscapes Photo-to-Labels | Class IOU | 0.16 | CycleGAN |
| Image-to-Image Translation | Freiburg Forest Dataset | PSNR | 18.57 | cycGAN |
| Image-to-Image Translation | EPFL NIR-VIS | PSNR | 17.38 | cycGAN |
| Image-to-Image Translation | Edge-to-Shoes | Diversity | 0.01 | CycleGAN |
| Image-to-Image Translation | Cats-and-Dogs | CIS | 0.076 | CycleGAN |
| Image-to-Image Translation | Cats-and-Dogs | IS | 0.813 | CycleGAN |
| Image-to-Image Translation | Edge-to-Handbags | Diversity | 0.012 | CycleGAN |
| Image Generation | vangogh2photo | Frechet Inception Distance | 163.4 | CycleGAN |
| Image Generation | zebra2horse | Frechet Inception Distance | 110.5 | CycleGAN |
| Image Generation | photo2vangogh | Frechet Inception Distance | 151.4 | CycleGAN |
| Image Generation | horse2zebra | Frechet Inception Distance | 89.7 | CycleGAN |
| Image Generation | Cityscapes Labels-to-Photo | Class IOU | 0.11 | CycleGAN |
| Image Generation | Cityscapes Photo-to-Labels | Class IOU | 0.16 | CycleGAN |
| Image Generation | Freiburg Forest Dataset | PSNR | 18.57 | cycGAN |
| Image Generation | EPFL NIR-VIS | PSNR | 17.38 | cycGAN |
| Image Generation | Edge-to-Shoes | Diversity | 0.01 | CycleGAN |
| Image Generation | Cats-and-Dogs | CIS | 0.076 | CycleGAN |
| Image Generation | Cats-and-Dogs | IS | 0.813 | CycleGAN |
| Image Generation | Edge-to-Handbags | Diversity | 0.012 | CycleGAN |
| Unsupervised Image-To-Image Translation | Freiburg Forest Dataset | PSNR | 18.57 | cycGAN |
| Image Colorization | NIR2RGB VCIP Challange Dataset | PSNR | 19.59 | CycleGAN |
| 1 Image, 2*2 Stitching | vangogh2photo | Frechet Inception Distance | 163.4 | CycleGAN |
| 1 Image, 2*2 Stitching | zebra2horse | Frechet Inception Distance | 110.5 | CycleGAN |
| 1 Image, 2*2 Stitching | photo2vangogh | Frechet Inception Distance | 151.4 | CycleGAN |
| 1 Image, 2*2 Stitching | horse2zebra | Frechet Inception Distance | 89.7 | CycleGAN |
| 1 Image, 2*2 Stitching | Cityscapes Labels-to-Photo | Class IOU | 0.11 | CycleGAN |
| 1 Image, 2*2 Stitching | Cityscapes Photo-to-Labels | Class IOU | 0.16 | CycleGAN |
| 1 Image, 2*2 Stitching | Freiburg Forest Dataset | PSNR | 18.57 | cycGAN |
| 1 Image, 2*2 Stitching | EPFL NIR-VIS | PSNR | 17.38 | cycGAN |
| 1 Image, 2*2 Stitching | Edge-to-Shoes | Diversity | 0.01 | CycleGAN |
| 1 Image, 2*2 Stitching | Cats-and-Dogs | CIS | 0.076 | CycleGAN |
| 1 Image, 2*2 Stitching | Cats-and-Dogs | IS | 0.813 | CycleGAN |
| 1 Image, 2*2 Stitching | Edge-to-Handbags | Diversity | 0.012 | CycleGAN |