Pre-Trained Image Processing Transformer

Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao

2020-12-01CVPR 2021 1Denoising Super-Resolution Rain Removal Color Image Denoising Image Super-Resolution Contrastive Learning Single Image Deraining

Paper PDF Code Code Code(official)Code Code Code

Abstract

As the computing power of modern hardware is increasing strongly, pre-trained deep learning models (e.g., BERT, GPT-3) learned on large-scale datasets have shown their effectiveness over conventional methods. The big progress is mainly contributed to the representation ability of transformer and its variant architectures. In this paper, we study the low-level computer vision task (e.g., denoising, super-resolution and deraining) and develop a new pre-trained model, namely, image processing transformer (IPT). To maximally excavate the capability of transformer, we present to utilize the well-known ImageNet benchmark for generating a large amount of corrupted image pairs. The IPT model is trained on these images with multi-heads and multi-tails. In addition, the contrastive learning is introduced for well adapting to different image processing tasks. The pre-trained model can therefore efficiently employed on desired task after fine-tuning. With only one pre-trained model, IPT outperforms the current state-of-the-art methods on various low-level benchmarks. Code is available at https://github.com/huawei-noah/Pretrained-IPT and https://gitee.com/mindspore/mindspore/tree/master/model_zoo/research/cv/IPT

Results

Task	Dataset	Metric	Value	Model
Super-Resolution	BSD100 - 2x upscaling	PSNR	32.48	IPT
Super-Resolution	Set14 - 3x upscaling	PSNR	30.85	IPT
Super-Resolution	Urban100 - 3x upscaling	PSNR	29.49	IPT
Rain Removal	Rain100L	PSNR	41.62	IPT
Rain Removal	Rain100L	SSIM	0.988	IPT
Denoising	Urban100 sigma50	PSNR	29.71	IPT
Denoising	CBSD68 sigma50	PSNR	29.39	IPT
Image Super-Resolution	BSD100 - 2x upscaling	PSNR	32.48	IPT
Image Super-Resolution	Set14 - 3x upscaling	PSNR	30.85	IPT
Image Super-Resolution	Urban100 - 3x upscaling	PSNR	29.49	IPT
3D Architecture	Urban100 sigma50	PSNR	29.71	IPT
3D Architecture	CBSD68 sigma50	PSNR	29.39	IPT
3D Object Super-Resolution	BSD100 - 2x upscaling	PSNR	32.48	IPT
3D Object Super-Resolution	Set14 - 3x upscaling	PSNR	30.85	IPT
3D Object Super-Resolution	Urban100 - 3x upscaling	PSNR	29.49	IPT
16k	BSD100 - 2x upscaling	PSNR	32.48	IPT
16k	Set14 - 3x upscaling	PSNR	30.85	IPT
16k	Urban100 - 3x upscaling	PSNR	29.49	IPT

Pre-Trained Image Processing Transformer

Abstract

Results

Related Papers

Pre-Trained Image Processing Transformer

Abstract

Results

Related Papers