TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Wavelet-based Unsupervised Label-to-Image Translation

Wavelet-based Unsupervised Label-to-Image Translation

George Eskandar, Mohamed Abdelsamad, Karim Armanious, Shuai Zhang, Bin Yang

2023-05-16IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022 5Multimodal Unsupervised Image-To-Image TranslationUnsupervised Image-To-Image TranslationTranslationImage GenerationImage-to-Image Translation
PaperPDFCode(official)

Abstract

Semantic Image Synthesis (SIS) is a subclass of image-to-image translation where a semantic layout is used to generate a photorealistic image. State-of-the-art conditional Generative Adversarial Networks (GANs) need a huge amount of paired data to accomplish this task while generic unpaired image-to-image translation frameworks underperform in comparison, because they color-code semantic layouts and learn correspondences in appearance instead of semantic content. Starting from the assumption that a high quality generated image should be segmented back to its semantic layout, we propose a new Unsupervised paradigm for SIS (USIS) that makes use of a self-supervised segmentation loss and whole image wavelet based discrimination. Furthermore, in order to match the high-frequency distribution of real images, a novel generator architecture in the wavelet domain is proposed. We test our methodology on 3 challenging datasets and demonstrate its ability to bridge the performance gap between paired and unpaired models.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationCOCO-Stuff Labels-to-PhotosFID28.6USIS-Wavelet
Image-to-Image TranslationCOCO-Stuff Labels-to-PhotosmIoU13.4USIS-Wavelet
Image-to-Image TranslationCityscapes Labels-to-PhotoFID50.14USIS-Wavelet
Image-to-Image TranslationCityscapes Labels-to-PhotomIoU42.32USIS-Wavelet
Image-to-Image TranslationADE20K Labels-to-PhotosFID34.5USIS-Wavelet
Image-to-Image TranslationADE20K Labels-to-PhotosmIoU16.95USIS-Wavelet
Image GenerationCOCO-Stuff Labels-to-PhotosFID28.6USIS-Wavelet
Image GenerationCOCO-Stuff Labels-to-PhotosmIoU13.4USIS-Wavelet
Image GenerationCityscapes Labels-to-PhotoFID50.14USIS-Wavelet
Image GenerationCityscapes Labels-to-PhotomIoU42.32USIS-Wavelet
Image GenerationADE20K Labels-to-PhotosFID34.5USIS-Wavelet
Image GenerationADE20K Labels-to-PhotosmIoU16.95USIS-Wavelet
1 Image, 2*2 StitchingCOCO-Stuff Labels-to-PhotosFID28.6USIS-Wavelet
1 Image, 2*2 StitchingCOCO-Stuff Labels-to-PhotosmIoU13.4USIS-Wavelet
1 Image, 2*2 StitchingCityscapes Labels-to-PhotoFID50.14USIS-Wavelet
1 Image, 2*2 StitchingCityscapes Labels-to-PhotomIoU42.32USIS-Wavelet
1 Image, 2*2 StitchingADE20K Labels-to-PhotosFID34.5USIS-Wavelet
1 Image, 2*2 StitchingADE20K Labels-to-PhotosmIoU16.95USIS-Wavelet

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16Function-to-Style Guidance of LLMs for Code Translation2025-07-15