TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Soft Truncation: A Universal Training Technique of Score-b...

Soft Truncation: A Universal Training Technique of Score-based Diffusion Model for High Precision Score Estimation

Dongjun Kim, Seungjae Shin, Kyungwoo Song, Wanmo Kang, Il-Chul Moon

2021-06-10Density EstimationImage Generation
PaperPDFCode(official)

Abstract

Recent advances in diffusion models bring state-of-the-art performance on image generation tasks. However, empirical results from previous research in diffusion models imply an inverse correlation between density estimation and sample generation performances. This paper investigates with sufficient empirical evidence that such inverse correlation happens because density estimation is significantly contributed by small diffusion time, whereas sample generation mainly depends on large diffusion time. However, training a score network well across the entire diffusion time is demanding because the loss scale is significantly imbalanced at each diffusion time. For successful training, therefore, we introduce Soft Truncation, a universally applicable training technique for diffusion models, that softens the fixed and static truncation hyperparameter into a random variable. In experiments, Soft Truncation achieves state-of-the-art performance on CIFAR-10, CelebA, CelebA-HQ 256x256, and STL-10 datasets.

Results

TaskDatasetMetricValueModel
Image GenerationSTL-10FID7.71UNCSN++ (RVE) + ST
Image GenerationSTL-10Inception score13.43UNCSN++ (RVE) + ST
Image GenerationImageNet 32x32FID8.42DDPM++ (VP, NLL) + ST
Image GenerationImageNet 32x32Inception score11.82DDPM++ (VP, NLL) + ST
Image GenerationImageNet 32x32bpd3.85DDPM++ (VP, NLL) + ST
Image GenerationCelebA 64x64FID1.9DDPM++ (VP, FID) + ST
Image GenerationCelebA 64x64bits/dimension2.1DDPM++ (VP, FID) + ST
Image GenerationCelebA 64x64FID2.9DDPM++ (VP, NLL) + ST
Image GenerationCelebA 64x64bits/dimension1.96DDPM++ (VP, NLL) + ST
Image GenerationCelebA 64x64bits/dimension1.97UNCSN++ (RVE) + ST
Image GenerationFFHQ 256 x 256FID5.54UDM (RVE) + ST
Image GenerationLSUN Bedroom 256 x 256FID4.57UDM (RVE) + ST
Image GenerationCelebA-HQ 256x256FID7.16UNCSN++ (RVE) + ST

Related Papers

Missing value imputation with adversarial random forests -- MissARF2025-07-21fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Synthesizing Reality: Leveraging the Generative AI-Powered Platform Midjourney for Construction Worker Detection2025-07-17FashionPose: Text to Pose to Relight Image Generation for Personalized Fashion Visualization2025-07-17A Distributed Generative AI Approach for Heterogeneous Multi-Domain Environments under Data Sharing constraints2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17FADE: Adversarial Concept Erasure in Flow Models2025-07-163C-FBI: A Combinatorial method using Convolutions for Circle Fitting in Blurry Images2025-07-15