TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/High-Resolution Photorealistic Image Translation in Real-T...

High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network

Jie Liang, Hui Zeng, Lei Zhang

2021-05-19CVPR 2021 1Image EnhancementAttributePhoto RetouchingTranslation4kImage-to-Image Translation
PaperPDFCode(official)

Abstract

Existing image-to-image translation (I2IT) methods are either constrained to low-resolution images or long inference time due to their heavy computational burden on the convolution of high-resolution feature maps. In this paper, we focus on speeding-up the high-resolution photorealistic I2IT tasks based on closed-form Laplacian pyramid decomposition and reconstruction. Specifically, we reveal that the attribute transformations, such as illumination and color manipulation, relate more to the low-frequency component, while the content details can be adaptively refined on high-frequency components. We consequently propose a Laplacian Pyramid Translation Network (LPTN) to simultaneously perform these two tasks, where we design a lightweight network for translating the low-frequency component with reduced resolution and a progressive masking strategy to efficiently refine the high-frequency ones. Our model avoids most of the heavy computation consumed by processing high-resolution feature maps and faithfully preserves the image details. Extensive experimental results on various tasks demonstrate that the proposed method can translate 4K images in real-time using one normal GPU while achieving comparable transformation performance against existing methods. Datasets and codes are available: https://github.com/csjliang/LPTN.

Results

TaskDatasetMetricValueModel
Photo RetouchingMIT-Adobe 5k (480p)PSNR22.12LPTN (L=3)
Photo RetouchingMIT-Adobe 5k (480p)SSIM0.878LPTN (L=3)
Photo RetouchingMIT-Adobe 5k (480p)PSNR21.99DPE
Photo RetouchingMIT-Adobe 5k (480p)SSIM0.875DPE
Photo RetouchingMIT-Adobe 5kPSNR22.02LPTN (L=3)
Photo RetouchingMIT-Adobe 5kSSIM0.879LPTN (L=3)
Photo RetouchingMIT-Adobe 5k (1080p)PSNR22.09LPTN (L=3)
Photo RetouchingMIT-Adobe 5k (1080p)SSIM0.883LPTN (L=3)
Photo RetouchingMIT-Adobe 5k (1080p)PSNR21.94DPE
Photo RetouchingMIT-Adobe 5k (1080p)SSIM0.885DPE

Related Papers

A Translation of Probabilistic Event Calculus into Markov Decision Processes2025-07-17MGFFD-VLM: Multi-Granularity Prompt Learning for Face Forgery Detection with VLM2025-07-16Non-Adaptive Adversarial Face Generation2025-07-16Attributes Shape the Embedding Space of Face Recognition Models2025-07-15COLIBRI Fuzzy Model: Color Linguistic-Based Representation and Interpretation2025-07-15Function-to-Style Guidance of LLMs for Code Translation2025-07-15Ref-Long: Benchmarking the Long-context Referencing Capability of Long-context Language Models2025-07-13Model Parallelism With Subnetwork Data Parallelism2025-07-11