TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Pre-Trained Image Processing Transformer

Pre-Trained Image Processing Transformer

Hanting Chen, Yunhe Wang, Tianyu Guo, Chang Xu, Yiping Deng, Zhenhua Liu, Siwei Ma, Chunjing Xu, Chao Xu, Wen Gao

2020-12-01CVPR 2021 1DenoisingSuper-ResolutionRain RemovalColor Image DenoisingImage Super-ResolutionContrastive LearningSingle Image Deraining
PaperPDFCodeCodeCode(official)CodeCodeCode

Abstract

As the computing power of modern hardware is increasing strongly, pre-trained deep learning models (e.g., BERT, GPT-3) learned on large-scale datasets have shown their effectiveness over conventional methods. The big progress is mainly contributed to the representation ability of transformer and its variant architectures. In this paper, we study the low-level computer vision task (e.g., denoising, super-resolution and deraining) and develop a new pre-trained model, namely, image processing transformer (IPT). To maximally excavate the capability of transformer, we present to utilize the well-known ImageNet benchmark for generating a large amount of corrupted image pairs. The IPT model is trained on these images with multi-heads and multi-tails. In addition, the contrastive learning is introduced for well adapting to different image processing tasks. The pre-trained model can therefore efficiently employed on desired task after fine-tuning. With only one pre-trained model, IPT outperforms the current state-of-the-art methods on various low-level benchmarks. Code is available at https://github.com/huawei-noah/Pretrained-IPT and https://gitee.com/mindspore/mindspore/tree/master/model_zoo/research/cv/IPT

Results

TaskDatasetMetricValueModel
Super-ResolutionBSD100 - 2x upscalingPSNR32.48IPT
Super-ResolutionSet14 - 3x upscalingPSNR30.85IPT
Super-ResolutionUrban100 - 3x upscalingPSNR29.49IPT
Rain RemovalRain100LPSNR41.62IPT
Rain RemovalRain100LSSIM0.988IPT
DenoisingUrban100 sigma50PSNR29.71IPT
DenoisingCBSD68 sigma50PSNR29.39IPT
Image Super-ResolutionBSD100 - 2x upscalingPSNR32.48IPT
Image Super-ResolutionSet14 - 3x upscalingPSNR30.85IPT
Image Super-ResolutionUrban100 - 3x upscalingPSNR29.49IPT
3D ArchitectureUrban100 sigma50PSNR29.71IPT
3D ArchitectureCBSD68 sigma50PSNR29.39IPT
3D Object Super-ResolutionBSD100 - 2x upscalingPSNR32.48IPT
3D Object Super-ResolutionSet14 - 3x upscalingPSNR30.85IPT
3D Object Super-ResolutionUrban100 - 3x upscalingPSNR29.49IPT
16kBSD100 - 2x upscalingPSNR32.48IPT
16kSet14 - 3x upscalingPSNR30.85IPT
16kUrban100 - 3x upscalingPSNR29.49IPT

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17HapticCap: A Multimodal Dataset and Task for Understanding User Experience of Vibration Haptic Signals2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17SGCL: Unifying Self-Supervised and Supervised Learning for Graph Recommendation2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16