TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Swin2-MoSE: A New Single Image Super-Resolution Model for ...

Swin2-MoSE: A New Single Image Super-Resolution Model for Remote Sensing

Leonardo Rossi, Vittorio Bernuzzi, Tomaso Fontanini, Massimo Bertozzi, Andrea Prati

2024-04-29Super-ResolutionMultispectral Image Super-resolutionSSIMImage Super-ResolutionSemantic Segmentation
PaperPDFCodeCode(official)

Abstract

Due to the limitations of current optical and sensor technologies and the high cost of updating them, the spectral and spatial resolution of satellites may not always meet desired requirements. For these reasons, Remote-Sensing Single-Image Super-Resolution (RS-SISR) techniques have gained significant interest. In this paper, we propose Swin2-MoSE model, an enhanced version of Swin2SR. Our model introduces MoE-SM, an enhanced Mixture-of-Experts (MoE) to replace the Feed-Forward inside all Transformer block. MoE-SM is designed with Smart-Merger, and new layer for merging the output of individual experts, and with a new way to split the work between experts, defining a new per-example strategy instead of the commonly used per-token one. Furthermore, we analyze how positional encodings interact with each other, demonstrating that per-channel bias and per-head bias can positively cooperate. Finally, we propose to use a combination of Normalized-Cross-Correlation (NCC) and Structural Similarity Index Measure (SSIM) losses, to avoid typical MSE loss limitations. Experimental results demonstrate that Swin2-MoSE outperforms any Swin derived models by up to 0.377 - 0.958 dB (PSNR) on task of 2x, 3x and 4x resolution-upscaling (Sen2Venus and OLI2MSI datasets). It also outperforms SOTA models by a good margin, proving to be competitive and with excellent potential, especially for complex tasks. Additionally, an analysis of computational costs is also performed. Finally, we show the efficacy of Swin2-MoSE, applying it to a semantic segmentation task (SeasoNet dataset). Code and pretrained are available on https://github.com/IMPLabUniPr/swin2-mose/tree/official_code

Results

TaskDatasetMetricValueModel
Super-ResolutionSen2venus - 2x upscalingPSNR49.4784Swin2SR-MoSE
Super-ResolutionSen2venus - 2x upscalingSSIM0.9948Swin2SR-MoSE
Super-ResolutionSen2venus - 4x upscalingPSNR45.9272Swin2SR-MoSE
Super-ResolutionSen2venus - 4x upscalingSSIM0.9849Swin2SR-MoSE
Super-ResolutionOLI2MSI - 3x upscalingPSNR45.9194Swin2SR-MoSE
Super-ResolutionOLI2MSI - 3x upscalingSSIM0.9912Swin2SR-MoSE
Image Super-ResolutionSen2venus - 2x upscalingPSNR49.4784Swin2SR-MoSE
Image Super-ResolutionSen2venus - 2x upscalingSSIM0.9948Swin2SR-MoSE
Image Super-ResolutionSen2venus - 4x upscalingPSNR45.9272Swin2SR-MoSE
Image Super-ResolutionSen2venus - 4x upscalingSSIM0.9849Swin2SR-MoSE
Image Super-ResolutionOLI2MSI - 3x upscalingPSNR45.9194Swin2SR-MoSE
Image Super-ResolutionOLI2MSI - 3x upscalingSSIM0.9912Swin2SR-MoSE
3D Object Super-ResolutionSen2venus - 2x upscalingPSNR49.4784Swin2SR-MoSE
3D Object Super-ResolutionSen2venus - 2x upscalingSSIM0.9948Swin2SR-MoSE
3D Object Super-ResolutionSen2venus - 4x upscalingPSNR45.9272Swin2SR-MoSE
3D Object Super-ResolutionSen2venus - 4x upscalingSSIM0.9849Swin2SR-MoSE
3D Object Super-ResolutionOLI2MSI - 3x upscalingPSNR45.9194Swin2SR-MoSE
3D Object Super-ResolutionOLI2MSI - 3x upscalingSSIM0.9912Swin2SR-MoSE
16kSen2venus - 2x upscalingPSNR49.4784Swin2SR-MoSE
16kSen2venus - 2x upscalingSSIM0.9948Swin2SR-MoSE
16kSen2venus - 4x upscalingPSNR45.9272Swin2SR-MoSE
16kSen2venus - 4x upscalingSSIM0.9849Swin2SR-MoSE
16kOLI2MSI - 3x upscalingPSNR45.9194Swin2SR-MoSE
16kOLI2MSI - 3x upscalingSSIM0.9912Swin2SR-MoSE

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16