TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Spatial-Separated Curve Rendering Network for Efficient an...

Spatial-Separated Curve Rendering Network for Efficient and High-Resolution Image Harmonization

Jingtang Liang, Xiaodong Cun, Chi-Man Pun, Jue Wang

2021-09-13Vocal Bursts Intensity PredictionImage HarmonizationPlaying the Game of 2048Image-to-Image Translation
PaperPDFCode(official)Code

Abstract

Image harmonization aims to modify the color of the composited region with respect to the specific background. Previous works model this task as a pixel-wise image-to-image translation using UNet family structures. However, the model size and computational cost limit the ability of their models on edge devices and higher-resolution images. To this end, we propose a novel spatial-separated curve rendering network(S$^2$CRNet) for efficient and high-resolution image harmonization for the first time. In S$^2$CRNet, we firstly extract the spatial-separated embeddings from the thumbnails of the masked foreground and background individually. Then, we design a curve rendering module(CRM), which learns and combines the spatial-specific knowledge using linear layers to generate the parameters of the piece-wise curve mapping in the foreground region. Finally, we directly render the original high-resolution images using the learned color curve. Besides, we also make two extensions of the proposed framework via the Cascaded-CRM and Semantic-CRM for cascaded refinement and semantic guidance, respectively. Experiments show that the proposed method reduces more than 90% parameters compared with previous methods but still achieves the state-of-the-art performance on both synthesized iHarmony4 and real-world DIH test sets. Moreover, our method can work smoothly on higher resolution images(eg., $2048\times2048$) in 0.1 seconds with much lower GPU computational resources than all existing methods. The code will be made available at \url{http://github.com/stefanLeong/S2CRNet}.

Results

TaskDatasetMetricValueModel
Image GenerationiHarmony4MSE35.58S2CRNet-VGG
Image GenerationiHarmony4PSNR37.18S2CRNet-VGG
Image GenerationiHarmony4fMSE274.99S2CRNet-VGG

Related Papers

CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image Translation2025-06-29ThermalDiffusion: Visual-to-Thermal Image-to-Image Translation for Autonomous Navigation2025-06-26Transforming H&E images into IHC: A Variance-Penalized GAN for Precision Oncology2025-06-23Optimal Transport Driven Asymmetric Image-to-Image Translation for Nuclei Segmentation of Histological Images2025-06-08Gen-n-Val: Agentic Image Data Generation and Validation2025-06-05Deep learning image burst stacking to reconstruct high-resolution ground-based solar observations2025-06-05Multi-Platform Methane Plume Detection via Model and Domain Adaptation2025-06-02Segmenting France Across Four Centuries2025-05-30