TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Recursive Generalization Transformer for Image Super-Resol...

Recursive Generalization Transformer for Image Super-Resolution

Zheng Chen, Yulun Zhang, Jinjin Gu, Linghe Kong, Xiaokang Yang

2023-03-11Super-ResolutionImage ReconstructionImage Super-Resolution
PaperPDFCode(official)

Abstract

Transformer architectures have exhibited remarkable performance in image super-resolution (SR). Since the quadratic computational complexity of the self-attention (SA) in Transformer, existing methods tend to adopt SA in a local region to reduce overheads. However, the local design restricts the global context exploitation, which is crucial for accurate image reconstruction. In this work, we propose the Recursive Generalization Transformer (RGT) for image SR, which can capture global spatial information and is suitable for high-resolution images. Specifically, we propose the recursive-generalization self-attention (RG-SA). It recursively aggregates input features into representative feature maps, and then utilizes cross-attention to extract global information. Meanwhile, the channel dimensions of attention matrices (query, key, and value) are further scaled to mitigate the redundancy in the channel domain. Furthermore, we combine the RG-SA with local self-attention to enhance the exploitation of the global context, and propose the hybrid adaptive integration (HAI) for module integration. The HAI allows the direct and effective fusion between features at different levels (local or global). Extensive experiments demonstrate that our RGT outperforms recent state-of-the-art methods quantitatively and qualitatively. Code and pre-trained models are available at https://github.com/zhengchen1999/RGT.

Results

TaskDatasetMetricValueModel
Super-ResolutionSet14 - 4x upscalingPSNR29.28RGT+
Super-ResolutionSet14 - 4x upscalingSSIM0.7979RGT+
Super-ResolutionSet14 - 4x upscalingPSNR29.23RGT
Super-ResolutionSet14 - 4x upscalingSSIM0.7972RGT
Super-ResolutionManga109 - 4x upscalingPSNR32.68RGT+
Super-ResolutionManga109 - 4x upscalingSSIM0.9303RGT+
Super-ResolutionManga109 - 4x upscalingPSNR32.5RGT
Super-ResolutionManga109 - 4x upscalingSSIM0.9291RGT
Image Super-ResolutionSet14 - 4x upscalingPSNR29.28RGT+
Image Super-ResolutionSet14 - 4x upscalingSSIM0.7979RGT+
Image Super-ResolutionSet14 - 4x upscalingPSNR29.23RGT
Image Super-ResolutionSet14 - 4x upscalingSSIM0.7972RGT
Image Super-ResolutionManga109 - 4x upscalingPSNR32.68RGT+
Image Super-ResolutionManga109 - 4x upscalingSSIM0.9303RGT+
Image Super-ResolutionManga109 - 4x upscalingPSNR32.5RGT
Image Super-ResolutionManga109 - 4x upscalingSSIM0.9291RGT
3D Object Super-ResolutionSet14 - 4x upscalingPSNR29.28RGT+
3D Object Super-ResolutionSet14 - 4x upscalingSSIM0.7979RGT+
3D Object Super-ResolutionSet14 - 4x upscalingPSNR29.23RGT
3D Object Super-ResolutionSet14 - 4x upscalingSSIM0.7972RGT
3D Object Super-ResolutionManga109 - 4x upscalingPSNR32.68RGT+
3D Object Super-ResolutionManga109 - 4x upscalingSSIM0.9303RGT+
3D Object Super-ResolutionManga109 - 4x upscalingPSNR32.5RGT
3D Object Super-ResolutionManga109 - 4x upscalingSSIM0.9291RGT
16kSet14 - 4x upscalingPSNR29.28RGT+
16kSet14 - 4x upscalingSSIM0.7979RGT+
16kSet14 - 4x upscalingPSNR29.23RGT
16kSet14 - 4x upscalingSSIM0.7972RGT
16kManga109 - 4x upscalingPSNR32.68RGT+
16kManga109 - 4x upscalingSSIM0.9303RGT+
16kManga109 - 4x upscalingPSNR32.5RGT
16kManga109 - 4x upscalingSSIM0.9291RGT

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17The model is the message: Lightweight convolutional autoencoders applied to noisy imaging data for planetary science and astrobiology2025-07-153D Magnetic Inverse Routine for Single-Segment Magnetic Field Images2025-07-15IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution2025-07-14MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization2025-07-14PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution2025-07-12Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Image Generation2025-07-11HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation2025-07-10