Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation

Yuchuan Tian, Jianhong Han, Hanting Chen, Yuanyuan Xi, Ning Ding, Jie Hu, Chao Xu, Yunhe Wang

2024-06-30Denoising Deblurring Rain Removal Image Restoration All

Abstract

Due to the unaffordable size and intensive computation costs of low-level vision models, All-in-One models that are designed to address a handful of low-level vision tasks simultaneously have been popular. However, existing All-in-One models are limited in terms of the range of tasks and performance. To overcome these limitations, we propose Instruct-IPT -- an All-in-One Image Processing Transformer (IPT) that could effectively address manifold image restoration tasks with large inter-task gaps, such as denoising, deblurring, deraining, dehazing, and desnowing. While most research propose feature adaptation methods, we reveal their failure in addressing highly distinct tasks, and suggest weight modulation that adapts weights to specific tasks. Firstly, we search for task-sensitive weights and introduce task-specific biases on top of them. Secondly, we conduct rank analysis for a good compression strategy and perform low-rank decomposition on the biases. Thirdly, we propose synchronous training that updates the task-general backbone model and the task-specific biases simultaneously. In this way, the model is instructed to learn both general and task-specific knowledge. Via our simple yet effective method that instructs the IPT to be task experts, Instruct-IPT could better cooperate between tasks with distinct characteristics at humble costs. As an additional feature, we enable Instruct-IPT to receive human prompts. We have conducted experiments on Instruct-IPT to demonstrate the effectiveness of our method on manifold tasks, and we have effectively extended our method to diffusion denoisers as well. The code is available at https://github.com/huawei-noah/Pretrained-IPT.

Results

Task	Dataset	Metric	Value	Model
Rain Removal	Rain100L	PSNR	39.35	Instruct-IPT
Rain Removal	Rain100L	SSIM	0.977	Instruct-IPT
Dehazing	SOTS Outdoor	PSNR	39.95	Instruct-IPT
Dehazing	SOTS Outdoor	SSIM	0.992	Instruct-IPT
Image Restoration	CSD	Average PSNR (dB)	40.12	Instruct-IPT
Image Dehazing	SOTS Outdoor	PSNR	39.95	Instruct-IPT
Image Dehazing	SOTS Outdoor	SSIM	0.992	Instruct-IPT
Denoising	CBSD68 sigma50	PSNR	28.61	Instruct-IPT
Image Deblurring	GoPro	PSNR	33.86	Instruct-IPT
Image Deblurring	GoPro	SSIM	0.967	Instruct-IPT
3D Architecture	CBSD68 sigma50	PSNR	28.61	Instruct-IPT
10-shot image generation	CSD	Average PSNR (dB)	40.12	Instruct-IPT
10-shot image generation	GoPro	PSNR	33.86	Instruct-IPT
10-shot image generation	GoPro	SSIM	0.967	Instruct-IPT
1 Image, 2*2 Stitchi	GoPro	PSNR	33.86	Instruct-IPT
1 Image, 2*2 Stitchi	GoPro	SSIM	0.967	Instruct-IPT
16k	GoPro	PSNR	33.86	Instruct-IPT
16k	GoPro	SSIM	0.967	Instruct-IPT

Abstract

Results

Task	Dataset	Metric	Value	Model
Rain Removal	Rain100L	PSNR	39.35	Instruct-IPT
Rain Removal	Rain100L	SSIM	0.977	Instruct-IPT
Dehazing	SOTS Outdoor	PSNR	39.95	Instruct-IPT
Dehazing	SOTS Outdoor	SSIM	0.992	Instruct-IPT
Image Restoration	CSD	Average PSNR (dB)	40.12	Instruct-IPT
Image Dehazing	SOTS Outdoor	PSNR	39.95	Instruct-IPT
Image Dehazing	SOTS Outdoor	SSIM	0.992	Instruct-IPT
Denoising	CBSD68 sigma50	PSNR	28.61	Instruct-IPT
Image Deblurring	GoPro	PSNR	33.86	Instruct-IPT
Image Deblurring	GoPro	SSIM	0.967	Instruct-IPT
3D Architecture	CBSD68 sigma50	PSNR	28.61	Instruct-IPT
10-shot image generation	CSD	Average PSNR (dB)	40.12	Instruct-IPT
10-shot image generation	GoPro	PSNR	33.86	Instruct-IPT
10-shot image generation	GoPro	SSIM	0.967	Instruct-IPT
1 Image, 2*2 Stitchi	GoPro	PSNR	33.86	Instruct-IPT
1 Image, 2*2 Stitchi	GoPro	SSIM	0.967	Instruct-IPT
16k	GoPro	PSNR	33.86	Instruct-IPT
16k	GoPro	SSIM	0.967	Instruct-IPT

Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation

Abstract

Results

Related Papers

Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation

Abstract

Results

Related Papers