TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Wavelet Diffusion Models are fast and scalable Image Gener...

Wavelet Diffusion Models are fast and scalable Image Generators

Hao Phung, Quan Dao, Anh Tran

2022-11-29CVPR 2023 1Image GenerationBlocking
PaperPDFCode(official)

Abstract

Diffusion models are rising as a powerful solution for high-fidelity image generation, which exceeds GANs in quality in many circumstances. However, their slow training and inference speed is a huge bottleneck, blocking them from being used in real-time applications. A recent DiffusionGAN method significantly decreases the models' running time by reducing the number of sampling steps from thousands to several, but their speeds still largely lag behind the GAN counterparts. This paper aims to reduce the speed gap by proposing a novel wavelet-based diffusion scheme. We extract low-and-high frequency components from both image and feature levels via wavelet decomposition and adaptively handle these components for faster processing while maintaining good generation quality. Furthermore, we propose to use a reconstruction term, which effectively boosts the model training convergence. Experimental results on CelebA-HQ, CIFAR-10, LSUN-Church, and STL-10 datasets prove our solution is a stepping-stone to offering real-time and high-fidelity diffusion models. Our code and pre-trained checkpoints are available at \url{https://github.com/VinAIResearch/WaveDiff.git}.

Results

TaskDatasetMetricValueModel
Image GenerationSTL-10FID12.93WaveDiff
Image GenerationSTL-10NFE4WaveDiff
Image GenerationSTL-10Recall0.41WaveDiff
Image GenerationCelebA-HQ 1024x1024FID5.98WaveDiff
Image GenerationCelebA-HQ 1024x1024NFE2WaveDiff
Image GenerationCelebA-HQ 256x256FID5.94WaveDiff
Image GenerationCelebA-HQ 256x256NFE2WaveDiff
Image GenerationCelebA-HQ 256x256Recall0.37WaveDiff
Image GenerationCelebA-HQ 512x512FID6.4WaveDiff
Image GenerationCelebA-HQ 512x512NFE2WaveDiff
Image GenerationCelebA-HQ 512x512Recall0.35WaveDiff
Image GenerationLSUN Churches 256 x 256FID5.06WaveDiff
Image GenerationLSUN Churches 256 x 256NFE4WaveDiff
Image GenerationLSUN Churches 256 x 256Recall0.4WaveDiff

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-16CharaConsist: Fine-Grained Consistent Character Generation2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15