TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/High-Resolution Image Synthesis and Semantic Manipulation ...

High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs

Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro

2017-11-30CVPR 2018 6Vocal Bursts Intensity PredictionSketch-to-Image TranslationSemantic SegmentationInstance SegmentationImage GenerationConditional Image GenerationImage-to-Image Translation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

We present a new method for synthesizing high-resolution photo-realistic images from semantic label maps using conditional generative adversarial networks (conditional GANs). Conditional GANs have enabled a variety of applications, but the results are often limited to low-resolution and still far from realistic. In this work, we generate 2048x1024 visually appealing results with a novel adversarial loss, as well as new multi-scale generator and discriminator architectures. Furthermore, we extend our framework to interactive visual manipulation with two additional features. First, we incorporate object instance segmentation information, which enables object manipulations such as removing/adding objects and changing the object category. Second, we propose a method to generate diverse results given the same input, allowing users to edit the object appearance interactively. Human opinion studies demonstrate that our method significantly outperforms existing methods, advancing both the quality and the resolution of deep image synthesis and editing.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationCOCO-Stuff Labels-to-PhotosFID111.5pix2pixHD
Image-to-Image TranslationCOCO-Stuff Labels-to-PhotosmIoU14.6pix2pixHD
Image-to-Image TranslationCityscapes Labels-to-PhotoFID95pix2pixHD
Image-to-Image TranslationCityscapes Labels-to-PhotomIoU58.3pix2pixHD
Image-to-Image TranslationADE20K Labels-to-PhotosFID81.8pix2pixHD
Image-to-Image TranslationADE20K Labels-to-PhotosmIoU20.3pix2pixHD
Image-to-Image TranslationADE20K-Outdoor Labels-to-PhotosFID97.8pix2pixHD
Image-to-Image TranslationADE20K-Outdoor Labels-to-PhotosmIoU17.4pix2pixHD
Image-to-Image TranslationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID42.8pix2pixHD
Image-to-Image TranslationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00258pix2pixHD
Image GenerationCOCO-Stuff Labels-to-PhotosFID111.5pix2pixHD
Image GenerationCOCO-Stuff Labels-to-PhotosmIoU14.6pix2pixHD
Image GenerationCityscapes Labels-to-PhotoFID95pix2pixHD
Image GenerationCityscapes Labels-to-PhotomIoU58.3pix2pixHD
Image GenerationADE20K Labels-to-PhotosFID81.8pix2pixHD
Image GenerationADE20K Labels-to-PhotosmIoU20.3pix2pixHD
Image GenerationADE20K-Outdoor Labels-to-PhotosFID97.8pix2pixHD
Image GenerationADE20K-Outdoor Labels-to-PhotosmIoU17.4pix2pixHD
Image GenerationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID42.8pix2pixHD
Image GenerationFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00258pix2pixHD
Sketch-to-Image TranslationCOCO-StuffFID38.7Pix2PixHD
Sketch-to-Image TranslationCOCO-StuffFID-C27.1Pix2PixHD
1 Image, 2*2 StitchingCOCO-Stuff Labels-to-PhotosFID111.5pix2pixHD
1 Image, 2*2 StitchingCOCO-Stuff Labels-to-PhotosmIoU14.6pix2pixHD
1 Image, 2*2 StitchingCityscapes Labels-to-PhotoFID95pix2pixHD
1 Image, 2*2 StitchingCityscapes Labels-to-PhotomIoU58.3pix2pixHD
1 Image, 2*2 StitchingADE20K Labels-to-PhotosFID81.8pix2pixHD
1 Image, 2*2 StitchingADE20K Labels-to-PhotosmIoU20.3pix2pixHD
1 Image, 2*2 StitchingADE20K-Outdoor Labels-to-PhotosFID97.8pix2pixHD
1 Image, 2*2 StitchingADE20K-Outdoor Labels-to-PhotosmIoU17.4pix2pixHD
1 Image, 2*2 StitchingFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsFID42.8pix2pixHD
1 Image, 2*2 StitchingFundus Fluorescein Angiogram Photographs & Colour Fundus Images of Diabetic PatientsKernel Inception Distance0.00258pix2pixHD

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17