TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hierarchical Dynamic Image Harmonization

Hierarchical Dynamic Image Harmonization

Haoxing Chen, Zhangxuan Gu, Yaohui Li, Jun Lan, Changhua Meng, Weiqiang Wang, Huaxiong Li

2022-11-16Image Harmonization
PaperPDFCode(official)

Abstract

Image harmonization is a critical task in computer vision, which aims to adjust the foreground to make it compatible with the background. Recent works mainly focus on using global transformations (i.e., normalization and color curve rendering) to achieve visual consistency. However, these models ignore local visual consistency and their huge model sizes limit their harmonization ability on edge devices. In this paper, we propose a hierarchical dynamic network (HDNet) to adapt features from local to global view for better feature transformation in efficient image harmonization. Inspired by the success of various dynamic models, local dynamic (LD) module and mask-aware global dynamic (MGD) module are proposed in this paper. Specifically, LD matches local representations between the foreground and background regions based on semantic similarities, then adaptively adjust every foreground local representation according to the appearance of its $K$-nearest neighbor background regions. In this way, LD can produce more realistic images at a more fine-grained level, and simultaneously enjoy the characteristic of semantic alignment. The MGD effectively applies distinct convolution to the foreground and background, learning the representations of foreground and background regions as well as their correlations to the global harmonization, facilitating local visual consistency for the images much more efficiently. Experimental results demonstrate that the proposed HDNet significantly reduces the total model parameters by more than 80\% compared to previous methods, while still attaining state-of-the-art performance on the popular iHarmony4 dataset. Notably, the HDNet achieves a 4\% improvement in PSNR and a 19\% reduction in MSE compared to the prior state-of-the-art methods.

Results

TaskDatasetMetricValueModel
Image GenerationiHarmony4MSE16.55HDNet
Image GenerationiHarmony4PSNR40.46HDNet
Image GenerationiHarmony4fMSE179.49HDNet
Image GenerationiHarmony4MSE24.99HDNet-lite
Image GenerationiHarmony4PSNR38.63HDNet-lite
Image GenerationiHarmony4fMSE260.65HDNet-lite
Image GenerationHAdobe5k(1024$\times$1024)MSE13.24HDNet
Image GenerationHAdobe5k(1024$\times$1024)PSNR41.56HDNet
Image GenerationHAdobe5k(1024$\times$1024)SSIM0.9931HDNet
Image GenerationHAdobe5k(1024$\times$1024)fMSE102.53HDNet

Related Papers

Gen-n-Val: Agentic Image Data Generation and Validation2025-06-05DISARM++: Beyond scanner-free harmonization2025-05-06Efficient and robust 3D blind harmonization for large domain gaps2025-04-30Learning to Harmonize Cross-vendor X-ray Images by Non-linear Image Dynamics Correction2025-04-14SceneBooth: Diffusion-based Framework for Subject-preserved Text-to-Image Generation2025-01-07HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment2025-01-02Retrieval Augmented Image Harmonization2024-12-18Image Harmonization using Robust Restricted CDF Matching2024-11-20