TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Uformer: A General U-Shaped Transformer for Image Restorat...

Uformer: A General U-Shaped Transformer for Image Restoration

Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianzhuang Liu, Houqiang Li

2021-06-06CVPR 2022 1DenoisingDeblurringImage Defocus DeblurringImage DenoisingImage EnhancementImage DeblurringSingle Image DesnowingRain RemovalImage Restoration
PaperPDFCodeCode(official)CodeCode

Abstract

In this paper, we present Uformer, an effective and efficient Transformer-based architecture for image restoration, in which we build a hierarchical encoder-decoder network using the Transformer block. In Uformer, there are two core designs. First, we introduce a novel locally-enhanced window (LeWin) Transformer block, which performs nonoverlapping window-based self-attention instead of global self-attention. It significantly reduces the computational complexity on high resolution feature map while capturing local context. Second, we propose a learnable multi-scale restoration modulator in the form of a multi-scale spatial bias to adjust features in multiple layers of the Uformer decoder. Our modulator demonstrates superior capability for restoring details for various image restoration tasks while introducing marginal extra parameters and computational cost. Powered by these two designs, Uformer enjoys a high capability for capturing both local and global dependencies for image restoration. To evaluate our approach, extensive experiments are conducted on several image restoration tasks, including image denoising, motion deblurring, defocus deblurring and deraining. Without bells and whistles, our Uformer achieves superior or comparable performance compared with the state-of-the-art algorithms. The code and models are available at https://github.com/ZhendongWang6/Uformer.

Results

TaskDatasetMetricValueModel
DeblurringGoProPSNR32.97Uformer-B
DeblurringGoProSSIM0.967Uformer-B
DeblurringRealBlur-R (trained on GoPro)PSNR (sRGB)36.22Uformer-B
DeblurringRealBlur-R (trained on GoPro)SSIM (sRGB)0.957Uformer-B
DeblurringRealBlur-J (trained on GoPro)PSNR (sRGB)29.06Uformer-B
DeblurringRealBlur-J (trained on GoPro)SSIM (sRGB)0.884Uformer-B
DeblurringHIDE (trained on GOPRO)PSNR (sRGB)30.83Uformer-B
DeblurringHIDE (trained on GOPRO)Params (M)50.88Uformer-B
DeblurringHIDE (trained on GOPRO)SSIM (sRGB)0.952Uformer-B
DeblurringRSBlurAverage PSNR33.98Uformer-B
Image EnhancementTIP 2018PSNR29.28Uformer-B
Image EnhancementTIP 2018SSIM0.917Uformer-B
DehazingSOTS IndoorPSNR31.91Uformer
DehazingSOTS IndoorSSIM0.971Uformer
DehazingSOTS OutdoorPSNR26.52Uformer
DehazingSOTS OutdoorSSIM0.945Uformer
Image RestorationCSDAverage PSNR (dB)33.8UFormer
Image DehazingSOTS IndoorPSNR31.91Uformer
Image DehazingSOTS IndoorSSIM0.971Uformer
Image DehazingSOTS OutdoorPSNR26.52Uformer
Image DehazingSOTS OutdoorSSIM0.945Uformer
DenoisingSIDDPSNR (sRGB)39.89Uformer-B
DenoisingSIDDSSIM (sRGB)0.96Uformer-B
DenoisingDNDPSNR (sRGB)39.98Uformer-B
DenoisingDNDSSIM (sRGB)0.955Uformer-B
Image DenoisingSIDDPSNR (sRGB)39.89Uformer-B
Image DenoisingSIDDSSIM (sRGB)0.96Uformer-B
Image DenoisingDNDPSNR (sRGB)39.98Uformer-B
Image DenoisingDNDSSIM (sRGB)0.955Uformer-B
2D ClassificationGoProPSNR32.97Uformer-B
2D ClassificationGoProSSIM0.967Uformer-B
2D ClassificationRealBlur-R (trained on GoPro)PSNR (sRGB)36.22Uformer-B
2D ClassificationRealBlur-R (trained on GoPro)SSIM (sRGB)0.957Uformer-B
2D ClassificationRealBlur-J (trained on GoPro)PSNR (sRGB)29.06Uformer-B
2D ClassificationRealBlur-J (trained on GoPro)SSIM (sRGB)0.884Uformer-B
2D ClassificationHIDE (trained on GOPRO)PSNR (sRGB)30.83Uformer-B
2D ClassificationHIDE (trained on GOPRO)Params (M)50.88Uformer-B
2D ClassificationHIDE (trained on GOPRO)SSIM (sRGB)0.952Uformer-B
2D ClassificationRSBlurAverage PSNR33.98Uformer-B
Image DeblurringGoProPSNR32.97Uformer-B
Image DeblurringGoProParams (M)50.88Uformer-B
Image DeblurringGoProSSIM0.967Uformer-B
3D ArchitectureSIDDPSNR (sRGB)39.89Uformer-B
3D ArchitectureSIDDSSIM (sRGB)0.96Uformer-B
3D ArchitectureDNDPSNR (sRGB)39.98Uformer-B
3D ArchitectureDNDSSIM (sRGB)0.955Uformer-B
10-shot image generationCSDAverage PSNR (dB)33.8UFormer
10-shot image generationGoProPSNR32.97Uformer-B
10-shot image generationGoProSSIM0.967Uformer-B
10-shot image generationRealBlur-R (trained on GoPro)PSNR (sRGB)36.22Uformer-B
10-shot image generationRealBlur-R (trained on GoPro)SSIM (sRGB)0.957Uformer-B
10-shot image generationRealBlur-J (trained on GoPro)PSNR (sRGB)29.06Uformer-B
10-shot image generationRealBlur-J (trained on GoPro)SSIM (sRGB)0.884Uformer-B
10-shot image generationHIDE (trained on GOPRO)PSNR (sRGB)30.83Uformer-B
10-shot image generationHIDE (trained on GOPRO)Params (M)50.88Uformer-B
10-shot image generationHIDE (trained on GOPRO)SSIM (sRGB)0.952Uformer-B
10-shot image generationRSBlurAverage PSNR33.98Uformer-B
10-shot image generationGoProPSNR32.97Uformer-B
10-shot image generationGoProParams (M)50.88Uformer-B
10-shot image generationGoProSSIM0.967Uformer-B
1 Image, 2*2 StitchiGoProPSNR32.97Uformer-B
1 Image, 2*2 StitchiGoProParams (M)50.88Uformer-B
1 Image, 2*2 StitchiGoProSSIM0.967Uformer-B
16kGoProPSNR32.97Uformer-B
16kGoProParams (M)50.88Uformer-B
16kGoProSSIM0.967Uformer-B
Blind Image DeblurringGoProPSNR32.97Uformer-B
Blind Image DeblurringGoProSSIM0.967Uformer-B
Blind Image DeblurringRealBlur-R (trained on GoPro)PSNR (sRGB)36.22Uformer-B
Blind Image DeblurringRealBlur-R (trained on GoPro)SSIM (sRGB)0.957Uformer-B
Blind Image DeblurringRealBlur-J (trained on GoPro)PSNR (sRGB)29.06Uformer-B
Blind Image DeblurringRealBlur-J (trained on GoPro)SSIM (sRGB)0.884Uformer-B
Blind Image DeblurringHIDE (trained on GOPRO)PSNR (sRGB)30.83Uformer-B
Blind Image DeblurringHIDE (trained on GOPRO)Params (M)50.88Uformer-B
Blind Image DeblurringHIDE (trained on GOPRO)SSIM (sRGB)0.952Uformer-B
Blind Image DeblurringRSBlurAverage PSNR33.98Uformer-B

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints2025-07-16HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing2025-07-15AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air2025-07-15Generative Latent Kernel Modeling for Blind Motion Deblurring2025-07-12A statistical physics framework for optimal learning2025-07-10