TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Local Class-Specific and Global Image-Level Generative Adv...

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Hao Tang, Dan Xu, Yan Yan, Philip H. S. Torr, Nicu Sebe

2019-12-27CVPR 2020 6Scene GenerationImage Generation
PaperPDFCode(official)Code

Abstract

In this paper, we address the task of semantic-guided scene generation. One open challenge in scene generation is the difficulty of the generation of small objects and detailed local texture, which has been widely observed in global image-level generation methods. To tackle this issue, in this work we consider learning the scene generation in a local context, and correspondingly design a local class-specific generative network with semantic maps as a guidance, which separately constructs and learns sub-generators concentrating on the generation of different classes, and is able to provide more scene details. To learn more discriminative class-specific feature representations for the local generation, a novel classification module is also proposed. To combine the advantage of both the global image-level and the local class-specific generation, a joint generation network is designed with an attention fusion module and a dual-discriminator structure embedded. Extensive experiments on two scene image generation tasks show superior generation performance of the proposed model. The state-of-the-art results are established by large margins on both tasks and on challenging public benchmarks. The source code and trained models are available at https://github.com/Ha0Tang/LGGAN.

Results

TaskDatasetMetricValueModel
Image-to-Image TranslationcvusaKL2.55LGGAN
Image-to-Image TranslationcvusaPSNR22.5766LGGAN
Image-to-Image TranslationcvusaSD19.744LGGAN
Image-to-Image TranslationcvusaSSIM0.5238LGGAN
Image-to-Image TranslationDayton (256×256) - aerial-to-groundKL2.18LGGAN
Image-to-Image TranslationDayton (256×256) - aerial-to-groundPSNR22.9949LGGAN
Image-to-Image TranslationDayton (256×256) - aerial-to-groundSD19.6145LGGAN
Image-to-Image TranslationDayton (256×256) - aerial-to-groundSSIM0.5457LGGAN
Image GenerationcvusaKL2.55LGGAN
Image GenerationcvusaPSNR22.5766LGGAN
Image GenerationcvusaSD19.744LGGAN
Image GenerationcvusaSSIM0.5238LGGAN
Image GenerationDayton (256×256) - aerial-to-groundKL2.18LGGAN
Image GenerationDayton (256×256) - aerial-to-groundPSNR22.9949LGGAN
Image GenerationDayton (256×256) - aerial-to-groundSD19.6145LGGAN
Image GenerationDayton (256×256) - aerial-to-groundSSIM0.5457LGGAN
1 Image, 2*2 StitchingcvusaKL2.55LGGAN
1 Image, 2*2 StitchingcvusaPSNR22.5766LGGAN
1 Image, 2*2 StitchingcvusaSD19.744LGGAN
1 Image, 2*2 StitchingcvusaSSIM0.5238LGGAN
1 Image, 2*2 StitchingDayton (256×256) - aerial-to-groundKL2.18LGGAN
1 Image, 2*2 StitchingDayton (256×256) - aerial-to-groundPSNR22.9949LGGAN
1 Image, 2*2 StitchingDayton (256×256) - aerial-to-groundSD19.6145LGGAN
1 Image, 2*2 StitchingDayton (256×256) - aerial-to-groundSSIM0.5457LGGAN

Related Papers

World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15