TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Feature Quantization Improves GAN Training

Feature Quantization Improves GAN Training

Yang Zhao, Chunyuan Li, Ping Yu, Jianfeng Gao, Changyou Chen

2020-04-05ICML 2020 1Unsupervised Image-To-Image TranslationQuantizationTranslationImage GenerationFace GenerationConditional Image GenerationImage-to-Image Translation
PaperPDFCode(official)Code

Abstract

The instability in GAN training has been a long-standing problem despite remarkable research efforts. We identify that instability issues stem from difficulties of performing feature matching with mini-batch statistics, due to a fragile balance between the fixed target distribution and the progressively generated distribution. In this work, we propose Feature Quantization (FQ) for the discriminator, to embed both true and fake data samples into a shared discrete space. The quantized values of FQ are constructed as an evolving dictionary, which is consistent with feature statistics of the recent distribution history. Hence, FQ implicitly enables robust feature matching in a compact space. Our method can be easily plugged into existing GAN models, with little computational overhead in training. We apply FQ to 3 representative GAN models on 9 benchmarks: BigGAN for image generation, StyleGAN for face synthesis, and U-GAT-IT for unsupervised image-to-image translation. Extensive experimental results show that the proposed FQ-GAN can improve the FID scores of baseline methods by a large margin on a variety of tasks, achieving new state-of-the-art performance.

Results

TaskDatasetMetricValueModel
Image-to-Image Translationanime-to-selfieKernel Inception Distance10.23FQ-GAN
Image-to-Image Translationselfie-to-animeKernel Inception Distance11.4FQ-GAN
Image GenerationFFHQ 1024 x 1024FID3.19FQ-GAN
Image Generationanime-to-selfieKernel Inception Distance10.23FQ-GAN
Image Generationselfie-to-animeKernel Inception Distance11.4FQ-GAN
Image GenerationCIFAR-10FID5.34FQ-GAN
Image GenerationCIFAR-10Inception score8.5FQ-GAN
Image GenerationCIFAR-100FID7.15FQ-GAN
Image GenerationCIFAR-100Inception Score9.74FQ-GAN
Image GenerationImageNet 64x64FID9.67FQ-GAN
Image GenerationImageNet 64x64Inception score25.96FQ-GAN
Image GenerationImageNet 128x128FID13.77FQ-GAN
Image GenerationImageNet 128x128Inception score54.36FQ-GAN
Conditional Image GenerationCIFAR-10FID5.34FQ-GAN
Conditional Image GenerationCIFAR-10Inception score8.5FQ-GAN
Conditional Image GenerationCIFAR-100FID7.15FQ-GAN
Conditional Image GenerationCIFAR-100Inception Score9.74FQ-GAN
Conditional Image GenerationImageNet 64x64FID9.67FQ-GAN
Conditional Image GenerationImageNet 64x64Inception score25.96FQ-GAN
Conditional Image GenerationImageNet 128x128FID13.77FQ-GAN
Conditional Image GenerationImageNet 128x128Inception score54.36FQ-GAN
1 Image, 2*2 Stitchinganime-to-selfieKernel Inception Distance10.23FQ-GAN
1 Image, 2*2 Stitchingselfie-to-animeKernel Inception Distance11.4FQ-GAN

Related Papers

Efficient Deployment of Spiking Neural Networks on SpiNNaker2 for DVS Gesture Recognition Using Neuromorphic Intermediate Representation2025-09-04An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC2025-07-18Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17Angle Estimation of a Single Source with Massive Uniform Circular Arrays2025-07-17A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17