TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Semantic Image Synthesis with Spatially-Adaptive Normaliza...

Semantic Image Synthesis with Spatially-Adaptive Normalization

Taesung Park, Ming-Yu Liu, Ting-Chun Wang, Jun-Yan Zhu

2019-03-18CVPR 2019 6Sketch-to-Image TranslationImage GenerationImage-to-Image Translation
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCode

Abstract

We propose spatially-adaptive normalization, a simple but effective layer for synthesizing photorealistic images given an input semantic layout. Previous methods directly feed the semantic layout as input to the deep network, which is then processed through stacks of convolution, normalization, and nonlinearity layers. We show that this is suboptimal as the normalization layers tend to ``wash away'' semantic information. To address the issue, we propose using the input layout for modulating the activations in normalization layers through a spatially-adaptive, learned transformation. Experiments on several challenging datasets demonstrate the advantage of the proposed method over existing approaches, regarding both visual fidelity and alignment with input layouts. Finally, our model allows user control over both semantic and style. Code is available at https://github.com/NVlabs/SPADE .

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationCOCO-Stuff Labels-to-PhotosFID22.6SPADE
Image-to-Image TranslationCOCO-Stuff Labels-to-PhotosmIoU37.4SPADE
Image-to-Image TranslationCityscapes Labels-to-PhotoFID71.8SPADE
Image-to-Image TranslationCityscapes Labels-to-PhotomIoU62.3SPADE
Image-to-Image TranslationADE20K Labels-to-PhotosFID33.9SPADE
Image-to-Image TranslationADE20K Labels-to-PhotosmIoU38.5SPADE
Image-to-Image TranslationADE20K-Outdoor Labels-to-PhotosFID63.3SPADE
Image-to-Image TranslationADE20K-Outdoor Labels-to-PhotosmIoU30.8SPADE
Image GenerationCOCO-Stuff Labels-to-PhotosFID22.6SPADE
Image GenerationCOCO-Stuff Labels-to-PhotosmIoU37.4SPADE
Image GenerationCityscapes Labels-to-PhotoFID71.8SPADE
Image GenerationCityscapes Labels-to-PhotomIoU62.3SPADE
Image GenerationADE20K Labels-to-PhotosFID33.9SPADE
Image GenerationADE20K Labels-to-PhotosmIoU38.5SPADE
Image GenerationADE20K-Outdoor Labels-to-PhotosFID63.3SPADE
Image GenerationADE20K-Outdoor Labels-to-PhotosmIoU30.8SPADE
Sketch-to-Image TranslationCOCO-StuffFID89.2SPADE
Sketch-to-Image TranslationCOCO-StuffFID-C48.9SPADE
1 Image, 2*2 StitchingCOCO-Stuff Labels-to-PhotosFID22.6SPADE
1 Image, 2*2 StitchingCOCO-Stuff Labels-to-PhotosmIoU37.4SPADE
1 Image, 2*2 StitchingCityscapes Labels-to-PhotoFID71.8SPADE
1 Image, 2*2 StitchingCityscapes Labels-to-PhotomIoU62.3SPADE
1 Image, 2*2 StitchingADE20K Labels-to-PhotosFID33.9SPADE
1 Image, 2*2 StitchingADE20K Labels-to-PhotosmIoU38.5SPADE
1 Image, 2*2 StitchingADE20K-Outdoor Labels-to-PhotosFID63.3SPADE
1 Image, 2*2 StitchingADE20K-Outdoor Labels-to-PhotosmIoU30.8SPADE

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15