TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Image Synthesis From Reconfigurable Layout and Style

Image Synthesis From Reconfigurable Layout and Style

Wei Sun, Tianfu Wu

2019-08-20ICCV 2019 10Layout-to-Image GenerationImage Generation
PaperPDFCodeCode(official)CodeCode

Abstract

Despite remarkable recent progress on both unconditional and conditional image synthesis, it remains a long-standing problem to learn generative models that are capable of synthesizing realistic and sharp images from reconfigurable spatial layout (i.e., bounding boxes + class labels in an image lattice) and style (i.e., structural and appearance variations encoded by latent vectors), especially at high resolution. By reconfigurable, it means that a model can preserve the intrinsic one-to-many mapping from a given layout to multiple plausible images with different styles, and is adaptive with respect to perturbations of a layout and style latent code. In this paper, we present a layout- and style-based architecture for generative adversarial networks (termed LostGANs) that can be trained end-to-end to generate images from reconfigurable layout and style. Inspired by the vanilla StyleGAN, the proposed LostGAN consists of two new components: (i) learning fine-grained mask maps in a weakly-supervised manner to bridge the gap between layouts and images, and (ii) learning object instance-specific layout-aware feature normalization (ISLA-Norm) in the generator to realize multi-object style generation. In experiments, the proposed method is tested on the COCO-Stuff dataset and the Visual Genome dataset with state-of-the-art performance obtained. The code and pretrained models are available at \url{https://github.com/iVMCL/LostGANs}.

Results

TaskDatasetMetricValueModel
Image GenerationCOCO-Stuff 128x128FID29.65LostGAN
Image GenerationCOCO-Stuff 128x128Inception Score13.8LostGAN
Image GenerationCOCO-Stuff 128x128SceneFID20.03LostGAN
Image GenerationCOCO-Stuff 64x64FID34.31LostGAN
Image GenerationCOCO-Stuff 64x64Inception Score9.8LostGAN
Image GenerationVisual Genome 64x64FID34.75LostGAN
Image GenerationVisual Genome 64x64Inception Score8.7LostGAN
Image GenerationVisual Genome 128x128FID29.36LostGAN
Image GenerationVisual Genome 128x128Inception Score11.1LostGAN
Image GenerationVisual Genome 128x128SceneFID13.17LostGAN

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15