TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/StarGAN v2: Diverse Image Synthesis for Multiple Domains

StarGAN v2: Diverse Image Synthesis for Multiple Domains

Yunjey Choi, Youngjung Uh, Jaejun Yoo, Jung-Woo Ha

2019-12-04CVPR 2020 6Multimodal Unsupervised Image-To-Image TranslationTranslationImage GenerationImage-to-Image Translation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCode

Abstract

A good image-to-image translation model should learn a mapping between different visual domains while satisfying the following properties: 1) diversity of generated images and 2) scalability over multiple domains. Existing methods address either of the issues, having limited diversity or multiple models for all domains. We propose StarGAN v2, a single framework that tackles both and shows significantly improved results over the baselines. Experiments on CelebA-HQ and a new animal faces dataset (AFHQ) validate our superiority in terms of visual quality, diversity, and scalability. To better assess image-to-image translation models, we release AFHQ, high-quality animal faces with large inter- and intra-domain differences. The code, pretrained models, and dataset can be found at https://github.com/clovaai/stargan-v2.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationCelebA-HQFID13.73StarGAN v2
Image-to-Image TranslationCelebA-HQLPIPS0.428StarGAN v2
Image-to-Image TranslationAFHQFID24.4StarGAN v2
Image-to-Image TranslationAFHQLPIPS0.524StarGAN v2
Image-to-Image TranslationCelebA-HQFID13.73StarGAN v2
Image-to-Image TranslationAFHQFID16.2StarGAN v2
Image-to-Image TranslationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID27.7StarGAN-v2
Image-to-Image TranslationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00118StarGAN-v2
Image GenerationCelebA-HQFID13.73StarGAN v2
Image GenerationCelebA-HQLPIPS0.428StarGAN v2
Image GenerationAFHQFID24.4StarGAN v2
Image GenerationAFHQLPIPS0.524StarGAN v2
Image GenerationCelebA-HQFID13.73StarGAN v2
Image GenerationAFHQFID16.2StarGAN v2
Image GenerationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID27.7StarGAN-v2
Image GenerationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00118StarGAN-v2
1 Image, 2*2 StitchingCelebA-HQFID13.73StarGAN v2
1 Image, 2*2 StitchingCelebA-HQLPIPS0.428StarGAN v2
1 Image, 2*2 StitchingAFHQFID24.4StarGAN v2
1 Image, 2*2 StitchingAFHQLPIPS0.524StarGAN v2
1 Image, 2*2 StitchingCelebA-HQFID13.73StarGAN v2
1 Image, 2*2 StitchingAFHQFID16.2StarGAN v2
1 Image, 2*2 StitchingFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID27.7StarGAN-v2
1 Image, 2*2 StitchingFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00118StarGAN-v2

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16Function-to-Style Guidance of LLMs for Code Translation2025-07-15