TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MAXIM: Multi-Axis MLP for Image Processing

MAXIM: Multi-Axis MLP for Image Processing

Zhengzhong Tu, Hossein Talebi, Han Zhang, Feng Yang, Peyman Milanfar, Alan Bovik, Yinxiao Li

2022-01-09CVPR 2022 1DenoisingDeblurringImage DenoisingImage DeblurringPhoto RetouchingRain RemovalImage DehazingImage RestorationSingle Image DerainingLow-Light Image Enhancement
PaperPDFCode(official)CodeCode

Abstract

Recent progress on Transformers and multi-layer perceptron (MLP) models provide new network architectural designs for computer vision tasks. Although these models proved to be effective in many vision tasks such as image recognition, there remain challenges in adapting them for low-level vision. The inflexibility to support high-resolution images and limitations of local attention are perhaps the main bottlenecks. In this work, we present a multi-axis MLP based architecture called MAXIM, that can serve as an efficient and flexible general-purpose vision backbone for image processing tasks. MAXIM uses a UNet-shaped hierarchical structure and supports long-range interactions enabled by spatially-gated MLPs. Specifically, MAXIM contains two MLP-based building blocks: a multi-axis gated MLP that allows for efficient and scalable spatial mixing of local and global visual cues, and a cross-gating block, an alternative to cross-attention, which accounts for cross-feature conditioning. Both these modules are exclusively based on MLPs, but also benefit from being both global and `fully-convolutional', two properties that are desirable for image processing. Our extensive experimental results show that the proposed MAXIM model achieves state-of-the-art performance on more than ten benchmarks across a range of image processing tasks, including denoising, deblurring, deraining, dehazing, and enhancement while requiring fewer or comparable numbers of parameters and FLOPs than competitive models. The source code and trained models will be available at \url{https://github.com/google-research/maxim}.

Results

TaskDatasetMetricValueModel
DeblurringRealBlur-JPSNR (sRGB)32.84MAXIM
DeblurringRealBlur-JParams(M)22.2MAXIM
DeblurringRealBlur-JSSIM (sRGB)0.935MAXIM
DeblurringBASEDPSNR30.65728MAXIM (REDS)
DeblurringRealBlur-RPSNR (sRGB)39.45MAXIM
DeblurringRealBlur-RSSIM (sRGB)0.961MAXIM-3S
DeblurringGoProPSNR32.86MAXIM-3S
DeblurringRealBlur-R (trained on GoPro)PSNR (sRGB)35.78MAXIM
DeblurringMSU BASEDERQAv2.00.74277MAXIM (REDS)
DeblurringMSU BASEDLPIPS0.07836MAXIM (REDS)
DeblurringMSU BASEDSSIM0.94959MAXIM (REDS)
DeblurringMSU BASEDSubjective1.0081MAXIM (REDS)
DeblurringMSU BASEDVMAF67.3502MAXIM (REDS)
DeblurringMSU BASEDLPIPS0.09188MAXIM (GoPro)
DeblurringMSU BASEDPSNR31.36344MAXIM (GoPro)
DeblurringMSU BASEDSSIM0.94386MAXIM (GoPro)
DeblurringMSU BASEDSubjective0.207MAXIM (GoPro)
DeblurringMSU BASEDVMAF67.7557MAXIM (GoPro)
DeblurringHIDEPSNR32.83MAXIM-3S
DeblurringRealBlur-J (trained on GoPro)PSNR (sRGB)28.83MAXIM
DeblurringRealBlur-J (trained on GoPro)SSIM (sRGB)0.875MAXIM
DeblurringHIDE (trained on GOPRO)PSNR (sRGB)32.83MAXIM
DeblurringHIDE (trained on GOPRO)Params (M)22.2MAXIM
DeblurringHIDE (trained on GOPRO)SSIM (sRGB)0.956MAXIM
Image EnhancementLOLAverage PSNR23.43MAXIM
Image EnhancementLOLSSIM0.863MAXIM
Rain RemovalTest1200SSIM0.922MAXIM
Rain RemovalRain100HSSIM0.903MAXIM
Rain RemovalTest2800PSNR33.8MAXIM
Rain RemovalTest100PSNR31.17MAXIM
Rain RemovalTest100SSIM0.922MAXIM
Rain RemovalRain100LSSIM0.977MAXIM
DehazingSOTS IndoorPSNR38.11MAXIM-2S
DehazingSOTS OutdoorPSNR34.19MAXIM-2S
Image DehazingSOTS IndoorPSNR38.11MAXIM-2S
Image DehazingSOTS OutdoorPSNR34.19MAXIM-2S
DenoisingSIDDPSNR (sRGB)39.96MAXIM-3S
DenoisingSIDDSSIM (sRGB)0.96MAXIM-3S
DenoisingDNDPSNR (sRGB)39.84MAXIM-3S
DenoisingDNDSSIM (sRGB)0.954MAXIM-3S
Image DenoisingSIDDPSNR (sRGB)39.96MAXIM-3S
Image DenoisingSIDDSSIM (sRGB)0.96MAXIM-3S
Image DenoisingDNDPSNR (sRGB)39.84MAXIM-3S
Image DenoisingDNDSSIM (sRGB)0.954MAXIM-3S
2D ClassificationRealBlur-JPSNR (sRGB)32.84MAXIM
2D ClassificationRealBlur-JParams(M)22.2MAXIM
2D ClassificationRealBlur-JSSIM (sRGB)0.935MAXIM
2D ClassificationBASEDPSNR30.65728MAXIM (REDS)
2D ClassificationRealBlur-RPSNR (sRGB)39.45MAXIM
2D ClassificationRealBlur-RSSIM (sRGB)0.961MAXIM-3S
2D ClassificationGoProPSNR32.86MAXIM-3S
2D ClassificationRealBlur-R (trained on GoPro)PSNR (sRGB)35.78MAXIM
2D ClassificationMSU BASEDERQAv2.00.74277MAXIM (REDS)
2D ClassificationMSU BASEDLPIPS0.07836MAXIM (REDS)
2D ClassificationMSU BASEDSSIM0.94959MAXIM (REDS)
2D ClassificationMSU BASEDSubjective1.0081MAXIM (REDS)
2D ClassificationMSU BASEDVMAF67.3502MAXIM (REDS)
2D ClassificationMSU BASEDLPIPS0.09188MAXIM (GoPro)
2D ClassificationMSU BASEDPSNR31.36344MAXIM (GoPro)
2D ClassificationMSU BASEDSSIM0.94386MAXIM (GoPro)
2D ClassificationMSU BASEDSubjective0.207MAXIM (GoPro)
2D ClassificationMSU BASEDVMAF67.7557MAXIM (GoPro)
2D ClassificationHIDEPSNR32.83MAXIM-3S
2D ClassificationRealBlur-J (trained on GoPro)PSNR (sRGB)28.83MAXIM
2D ClassificationRealBlur-J (trained on GoPro)SSIM (sRGB)0.875MAXIM
2D ClassificationHIDE (trained on GOPRO)PSNR (sRGB)32.83MAXIM
2D ClassificationHIDE (trained on GOPRO)Params (M)22.2MAXIM
2D ClassificationHIDE (trained on GOPRO)SSIM (sRGB)0.956MAXIM
Photo RetouchingMIT-Adobe 5kPSNR26.15MAXIM
Photo RetouchingMIT-Adobe 5kSSIM0.945MAXIM
Image DeblurringHIDESSIM0.956MAXIM-3S
Image DeblurringGoProPSNR32.86MAXIM-3S
3D ArchitectureSIDDPSNR (sRGB)39.96MAXIM-3S
3D ArchitectureSIDDSSIM (sRGB)0.96MAXIM-3S
3D ArchitectureDNDPSNR (sRGB)39.84MAXIM-3S
3D ArchitectureDNDSSIM (sRGB)0.954MAXIM-3S
10-shot image generationRealBlur-JPSNR (sRGB)32.84MAXIM
10-shot image generationRealBlur-JParams(M)22.2MAXIM
10-shot image generationRealBlur-JSSIM (sRGB)0.935MAXIM
10-shot image generationBASEDPSNR30.65728MAXIM (REDS)
10-shot image generationRealBlur-RPSNR (sRGB)39.45MAXIM
10-shot image generationRealBlur-RSSIM (sRGB)0.961MAXIM-3S
10-shot image generationGoProPSNR32.86MAXIM-3S
10-shot image generationRealBlur-R (trained on GoPro)PSNR (sRGB)35.78MAXIM
10-shot image generationMSU BASEDERQAv2.00.74277MAXIM (REDS)
10-shot image generationMSU BASEDLPIPS0.07836MAXIM (REDS)
10-shot image generationMSU BASEDSSIM0.94959MAXIM (REDS)
10-shot image generationMSU BASEDSubjective1.0081MAXIM (REDS)
10-shot image generationMSU BASEDVMAF67.3502MAXIM (REDS)
10-shot image generationMSU BASEDLPIPS0.09188MAXIM (GoPro)
10-shot image generationMSU BASEDPSNR31.36344MAXIM (GoPro)
10-shot image generationMSU BASEDSSIM0.94386MAXIM (GoPro)
10-shot image generationMSU BASEDSubjective0.207MAXIM (GoPro)
10-shot image generationMSU BASEDVMAF67.7557MAXIM (GoPro)
10-shot image generationHIDEPSNR32.83MAXIM-3S
10-shot image generationRealBlur-J (trained on GoPro)PSNR (sRGB)28.83MAXIM
10-shot image generationRealBlur-J (trained on GoPro)SSIM (sRGB)0.875MAXIM
10-shot image generationHIDE (trained on GOPRO)PSNR (sRGB)32.83MAXIM
10-shot image generationHIDE (trained on GOPRO)Params (M)22.2MAXIM
10-shot image generationHIDE (trained on GOPRO)SSIM (sRGB)0.956MAXIM
10-shot image generationHIDESSIM0.956MAXIM-3S
10-shot image generationGoProPSNR32.86MAXIM-3S
1 Image, 2*2 StitchiHIDESSIM0.956MAXIM-3S
1 Image, 2*2 StitchiGoProPSNR32.86MAXIM-3S
16kHIDESSIM0.956MAXIM-3S
16kGoProPSNR32.86MAXIM-3S
Blind Image DeblurringRealBlur-JPSNR (sRGB)32.84MAXIM
Blind Image DeblurringRealBlur-JParams(M)22.2MAXIM
Blind Image DeblurringRealBlur-JSSIM (sRGB)0.935MAXIM
Blind Image DeblurringBASEDPSNR30.65728MAXIM (REDS)
Blind Image DeblurringRealBlur-RPSNR (sRGB)39.45MAXIM
Blind Image DeblurringRealBlur-RSSIM (sRGB)0.961MAXIM-3S
Blind Image DeblurringGoProPSNR32.86MAXIM-3S
Blind Image DeblurringRealBlur-R (trained on GoPro)PSNR (sRGB)35.78MAXIM
Blind Image DeblurringMSU BASEDERQAv2.00.74277MAXIM (REDS)
Blind Image DeblurringMSU BASEDLPIPS0.07836MAXIM (REDS)
Blind Image DeblurringMSU BASEDSSIM0.94959MAXIM (REDS)
Blind Image DeblurringMSU BASEDSubjective1.0081MAXIM (REDS)
Blind Image DeblurringMSU BASEDVMAF67.3502MAXIM (REDS)
Blind Image DeblurringMSU BASEDLPIPS0.09188MAXIM (GoPro)
Blind Image DeblurringMSU BASEDPSNR31.36344MAXIM (GoPro)
Blind Image DeblurringMSU BASEDSSIM0.94386MAXIM (GoPro)
Blind Image DeblurringMSU BASEDSubjective0.207MAXIM (GoPro)
Blind Image DeblurringMSU BASEDVMAF67.7557MAXIM (GoPro)
Blind Image DeblurringHIDEPSNR32.83MAXIM-3S
Blind Image DeblurringRealBlur-J (trained on GoPro)PSNR (sRGB)28.83MAXIM
Blind Image DeblurringRealBlur-J (trained on GoPro)SSIM (sRGB)0.875MAXIM
Blind Image DeblurringHIDE (trained on GOPRO)PSNR (sRGB)32.83MAXIM
Blind Image DeblurringHIDE (trained on GOPRO)Params (M)22.2MAXIM
Blind Image DeblurringHIDE (trained on GOPRO)SSIM (sRGB)0.956MAXIM

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Unsupervised Part Discovery via Descriptor-Based Masked Image Restoration with Optimized Constraints2025-07-16HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing2025-07-15AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air2025-07-15Generative Latent Kernel Modeling for Blind Motion Deblurring2025-07-12A statistical physics framework for optimal learning2025-07-10