TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Style Injection in Diffusion: A Training-free Approach for...

Style Injection in Diffusion: A Training-free Approach for Adapting Large-scale Diffusion Models for Style Transfer

Jiwoo Chung, Sangeek Hyun, Jae-Pil Heo

2023-12-11CVPR 2024 1Style Transfer
PaperPDFCode(official)

Abstract

Despite the impressive generative capabilities of diffusion models, existing diffusion model-based style transfer methods require inference-stage optimization (e.g. fine-tuning or textual inversion of style) which is time-consuming, or fails to leverage the generative ability of large-scale diffusion models. To address these issues, we introduce a novel artistic style transfer method based on a pre-trained large-scale diffusion model without any optimization. Specifically, we manipulate the features of self-attention layers as the way the cross-attention mechanism works; in the generation process, substituting the key and value of content with those of style image. This approach provides several desirable characteristics for style transfer including 1) preservation of content by transferring similar styles into similar image patches and 2) transfer of style based on similarity of local texture (e.g. edge) between content and style images. Furthermore, we introduce query preservation and attention temperature scaling to mitigate the issue of disruption of original content, and initial latent Adaptive Instance Normalization (AdaIN) to deal with the disharmonious color (failure to transfer the colors of style). Our experimental results demonstrate that our proposed method surpasses state-of-the-art methods in both conventional and diffusion-based style transfer baselines.

Results

TaskDatasetMetricValueModel
SketchStyleBenchCLIP Score0.604StyleID
Style TransferStyleBenchCLIP Score0.604StyleID
2D Human Pose EstimationStyleBenchCLIP Score0.604StyleID
2D ClassificationStyleBenchCLIP Score0.604StyleID
1 Image, 2*2 StitchiStyleBenchCLIP Score0.604StyleID
Drawing PicturesStyleBenchCLIP Score0.604StyleID

Related Papers

Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14AnyI2V: Animating Any Conditional Image with Motion Control2025-07-03Hita: Holistic Tokenizer for Autoregressive Image Generation2025-07-03SA-LUT: Spatial Adaptive 4D Look-Up Table for Photorealistic Style Transfer2025-06-16Fine-Grained control over Music Generation with Activation Steering2025-06-11Training-Free Identity Preservation in Stylized Image Generation Using Diffusion Models2025-06-07Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion2025-06-04SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation2025-06-03