TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Instruct-IPT: All-in-One Image Processing Transformer via ...

Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation

Yuchuan Tian, Jianhong Han, Hanting Chen, Yuanyuan Xi, Ning Ding, Jie Hu, Chao Xu, Yunhe Wang

2024-06-30DenoisingDeblurringRain RemovalImage RestorationAll
PaperPDFCode(official)

Abstract

Due to the unaffordable size and intensive computation costs of low-level vision models, All-in-One models that are designed to address a handful of low-level vision tasks simultaneously have been popular. However, existing All-in-One models are limited in terms of the range of tasks and performance. To overcome these limitations, we propose Instruct-IPT -- an All-in-One Image Processing Transformer (IPT) that could effectively address manifold image restoration tasks with large inter-task gaps, such as denoising, deblurring, deraining, dehazing, and desnowing. While most research propose feature adaptation methods, we reveal their failure in addressing highly distinct tasks, and suggest weight modulation that adapts weights to specific tasks. Firstly, we search for task-sensitive weights and introduce task-specific biases on top of them. Secondly, we conduct rank analysis for a good compression strategy and perform low-rank decomposition on the biases. Thirdly, we propose synchronous training that updates the task-general backbone model and the task-specific biases simultaneously. In this way, the model is instructed to learn both general and task-specific knowledge. Via our simple yet effective method that instructs the IPT to be task experts, Instruct-IPT could better cooperate between tasks with distinct characteristics at humble costs. As an additional feature, we enable Instruct-IPT to receive human prompts. We have conducted experiments on Instruct-IPT to demonstrate the effectiveness of our method on manifold tasks, and we have effectively extended our method to diffusion denoisers as well. The code is available at https://github.com/huawei-noah/Pretrained-IPT.

Results

TaskDatasetMetricValueModel
Rain RemovalRain100LPSNR39.35Instruct-IPT
Rain RemovalRain100LSSIM0.977Instruct-IPT
DehazingSOTS OutdoorPSNR39.95Instruct-IPT
DehazingSOTS OutdoorSSIM0.992Instruct-IPT
Image RestorationCSDAverage PSNR (dB)40.12Instruct-IPT
Image DehazingSOTS OutdoorPSNR39.95Instruct-IPT
Image DehazingSOTS OutdoorSSIM0.992Instruct-IPT
DenoisingCBSD68 sigma50PSNR28.61Instruct-IPT
Image DeblurringGoProPSNR33.86Instruct-IPT
Image DeblurringGoProSSIM0.967Instruct-IPT
3D ArchitectureCBSD68 sigma50PSNR28.61Instruct-IPT
10-shot image generationCSDAverage PSNR (dB)40.12Instruct-IPT
10-shot image generationGoProPSNR33.86Instruct-IPT
10-shot image generationGoProSSIM0.967Instruct-IPT
1 Image, 2*2 StitchiGoProPSNR33.86Instruct-IPT
1 Image, 2*2 StitchiGoProSSIM0.967Instruct-IPT
16kGoProPSNR33.86Instruct-IPT
16kGoProSSIM0.967Instruct-IPT

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints2025-07-16HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing2025-07-15AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air2025-07-15Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15