TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Large Scale Image Completion via Co-Modulated Generative A...

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks

Shengyu Zhao, Jonathan Cui, Yilun Sheng, Yue Dong, Xiao Liang, Eric I Chang, Yan Xu

2021-03-18ICLR 2021 1Image InpaintingTranslationImage-to-Image Translation
PaperPDFCode(official)

Abstract

Numerous task-specific variants of conditional generative adversarial networks have been developed for image completion. Yet, a serious limitation remains that all existing algorithms tend to fail when handling large-scale missing regions. To overcome this challenge, we propose a generic new approach that bridges the gap between image-conditional and recent modulated unconditional generative architectures via co-modulation of both conditional and stochastic style representations. Also, due to the lack of good quantitative metrics for image completion, we propose the new Paired/Unpaired Inception Discriminative Score (P-IDS/U-IDS), which robustly measures the perceptual fidelity of inpainted images compared to real images via linear separability in a feature space. Experiments demonstrate superior performance in terms of both quality and diversity over state-of-the-art methods in free-form image completion and easy generalization to image-to-image translation. Code is available at https://github.com/zsyzzsoft/co-mod-gan.

Results

TaskDatasetMetricValueModel
Image GenerationFFHQ 512 x 512FID3.7CoModGAN
Image GenerationPlaces2FID2.92CoModGAN
Image GenerationPlaces2P-IDS19.64CoModGAN
Image GenerationPlaces2U-IDS35.78CoModGAN
Image GenerationCelebA-HQFID5.65CoModGAN
Image GenerationCelebA-HQP-IDS11.23CoModGAN
Image GenerationCelebA-HQU-IDS22.54CoModGAN
Image InpaintingFFHQ 512 x 512FID3.7CoModGAN
Image InpaintingPlaces2FID2.92CoModGAN
Image InpaintingPlaces2P-IDS19.64CoModGAN
Image InpaintingPlaces2U-IDS35.78CoModGAN
Image InpaintingCelebA-HQFID5.65CoModGAN
Image InpaintingCelebA-HQP-IDS11.23CoModGAN
Image InpaintingCelebA-HQU-IDS22.54CoModGAN

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17Function-to-Style Guidance of LLMs for Code Translation2025-07-15RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting2025-07-11Speak2Sign3D: A Multi-modal Pipeline for English Speech to American Sign Language Animation2025-07-09Pun Intended: Multi-Agent Translation of Wordplay with Contrastive Learning and Phonetic-Semantic Embeddings2025-07-09Unconditional Diffusion for Generative Sequential Recommendation2025-07-08GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation2025-07-04TransLaw: Benchmarking Large Language Models in Multi-Agent Simulation of the Collaborative Translation2025-07-01