TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

StyleGAN-XL: Scaling StyleGAN to Large Diverse Datasets

Axel Sauer, Katja Schwarz, Andreas Geiger

2022-02-01Image Generation
PaperPDFCodeCode(official)

Abstract

Computer graphics has experienced a recent surge of data-centric approaches for photorealistic and controllable content creation. StyleGAN in particular sets new standards for generative modeling regarding image quality and controllability. However, StyleGAN's performance severely degrades on large unstructured datasets such as ImageNet. StyleGAN was designed for controllability; hence, prior works suspect its restrictive design to be unsuitable for diverse datasets. In contrast, we find the main limiting factor to be the current training strategy. Following the recently introduced Projected GAN paradigm, we leverage powerful neural network priors and a progressive growing strategy to successfully train the latest StyleGAN3 generator on ImageNet. Our final model, StyleGAN-XL, sets a new state-of-the-art on large-scale image synthesis and is the first to generate images at a resolution of $1024^2$ at such a dataset scale. We demonstrate that this model can invert and edit images beyond the narrow domain of portraits or specific object classes.

Results

TaskDatasetMetricValueModel
Image GenerationPokemon 1024x1024FID25.47StyleGAN-XL
Image GenerationImageNet 64x64FID1.51StyleGAN-XL
Image GenerationImageNet 64x64NFE1StyleGAN-XL
Image GenerationImageNet 32x32FID1.1StyleGAN-XL
Image GenerationFFHQ 256 x 256FID2.19StyleGAN-XL
Image GenerationFFHQ 256 x 256FD240.07StyleGAN-XL (DINOv2)
Image GenerationFFHQ 256 x 256Precision0.77StyleGAN-XL (DINOv2)
Image GenerationFFHQ 256 x 256Recall0.43StyleGAN-XL (DINOv2)
Image GenerationFFHQ 1024 x 1024FID2.02StyleGAN-XL
Image GenerationFFHQ 512 x 512FID2.41StyleGAN-XL
Image GenerationPokemon 256x256FID23.97StyleGAN-XL
Image GenerationImageNet 128x128FID1.81StyleGAN-XL
Image GenerationImageNet 512x512FID2.4StyleGAN-XL
Image GenerationImageNet 256x256FID2.3StyleGAN-XL

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15