TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multi-image Super Resolution of Remotely Sensed Images usi...

Multi-image Super Resolution of Remotely Sensed Images using Residual Feature Attention Deep Neural Networks

Francesco Salvetti, Vittorio Mazzia, Aleem Khaliq, Marcello Chiaberge

2020-07-06Super-ResolutionMulti-Frame Super-ResolutionRepresentation LearningVideo Super-ResolutionImage Super-Resolution
PaperPDFCode(official)Code

Abstract

Convolutional Neural Networks (CNNs) have been consistently proved state-of-the-art results in image Super-Resolution (SR), representing an exceptional opportunity for the remote sensing field to extract further information and knowledge from captured data. However, most of the works published in the literature have been focusing on the Single-Image Super-Resolution problem so far. At present, satellite based remote sensing platforms offer huge data availability with high temporal resolution and low spatial resolution. In this context, the presented research proposes a novel residual attention model (RAMS) that efficiently tackles the multi-image super-resolution task, simultaneously exploiting spatial and temporal correlations to combine multiple images. We introduce the mechanism of visual feature attention with 3D convolutions in order to obtain an aware data fusion and information extraction of the multiple low-resolution images, transcending limitations of the local region of convolutional operations. Moreover, having multiple inputs with the same scene, our representation learning network makes extensive use of nestled residual connections to let flow redundant low-frequency signals and focus the computation on more important high-frequency components. Extensive experimentation and evaluations against other available solutions, either for single or multi-image super-resolution, have demonstrated that the proposed deep learning-based solution can be considered state-of-the-art for Multi-Image Super-Resolution for remote sensing applications.

Results

TaskDatasetMetricValueModel
Super-ResolutionEPFL NIR-VISSSIM0.9875RAMS (ours)
Super-ResolutionPROBA-VNormalized cPSNR0.9336790819983855RAMS
Super-ResolutionUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
Super-ResolutionUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
3D Human Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
3D Human Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
VideoUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
VideoUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
3DUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
3DUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
3D Face AnimationUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
3D Face AnimationUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
Image Super-ResolutionEPFL NIR-VISSSIM0.9875RAMS (ours)
Image Super-ResolutionPROBA-VNormalized cPSNR0.9336790819983855RAMS
2D Human Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
2D Human Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
3D Absolute Human Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
3D Absolute Human Pose EstimationUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
Video Super-ResolutionUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
Video Super-ResolutionUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
3D Object Super-ResolutionEPFL NIR-VISSSIM0.9875RAMS (ours)
3D Object Super-ResolutionPROBA-VNormalized cPSNR0.9336790819983855RAMS
3D Object Super-ResolutionUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
3D Object Super-ResolutionUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
1 Image, 2*2 StitchiUltra Video Group HD - 4x upscalingAverage PSNR48.23RAMS (ours)
1 Image, 2*2 StitchiUltra Video Group HD - 4x upscalingAverage PSNR47.84DeepSUM[41]
16kEPFL NIR-VISSSIM0.9875RAMS (ours)
16kPROBA-VNormalized cPSNR0.9336790819983855RAMS

Related Papers

Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper2025-07-20SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Boosting Team Modeling through Tempo-Relational Representation Learning2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Language-Guided Contrastive Audio-Visual Masked Autoencoder with Automatically Generated Audio-Visual-Text Triplets from Videos2025-07-16A Mixed-Primitive-based Gaussian Splatting Method for Surface Reconstruction2025-07-15