Multi-image Super Resolution of Remotely Sensed Images using Residual Feature Attention Deep Neural Networks

Francesco Salvetti, Vittorio Mazzia, Aleem Khaliq, Marcello Chiaberge

2020-07-06Super-Resolution Multi-Frame Super-Resolution Representation Learning Video Super-Resolution Image Super-Resolution

Paper PDF Code(official)Code

Abstract

Convolutional Neural Networks (CNNs) have been consistently proved state-of-the-art results in image Super-Resolution (SR), representing an exceptional opportunity for the remote sensing field to extract further information and knowledge from captured data. However, most of the works published in the literature have been focusing on the Single-Image Super-Resolution problem so far. At present, satellite based remote sensing platforms offer huge data availability with high temporal resolution and low spatial resolution. In this context, the presented research proposes a novel residual attention model (RAMS) that efficiently tackles the multi-image super-resolution task, simultaneously exploiting spatial and temporal correlations to combine multiple images. We introduce the mechanism of visual feature attention with 3D convolutions in order to obtain an aware data fusion and information extraction of the multiple low-resolution images, transcending limitations of the local region of convolutional operations. Moreover, having multiple inputs with the same scene, our representation learning network makes extensive use of nestled residual connections to let flow redundant low-frequency signals and focus the computation on more important high-frequency components. Extensive experimentation and evaluations against other available solutions, either for single or multi-image super-resolution, have demonstrated that the proposed deep learning-based solution can be considered state-of-the-art for Multi-Image Super-Resolution for remote sensing applications.

Results

Task	Dataset	Metric	Value	Model
Super-Resolution	EPFL NIR-VIS	SSIM	0.9875	RAMS (ours)
Super-Resolution	PROBA-V	Normalized cPSNR	0.9336790819983855	RAMS
Super-Resolution	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
Super-Resolution	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
3D Human Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
3D Human Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
Video	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
Video	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
3D	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
3D	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
3D Face Animation	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
3D Face Animation	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
Image Super-Resolution	EPFL NIR-VIS	SSIM	0.9875	RAMS (ours)
Image Super-Resolution	PROBA-V	Normalized cPSNR	0.9336790819983855	RAMS
2D Human Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
2D Human Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
3D Absolute Human Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
3D Absolute Human Pose Estimation	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
Video Super-Resolution	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
Video Super-Resolution	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
3D Object Super-Resolution	EPFL NIR-VIS	SSIM	0.9875	RAMS (ours)
3D Object Super-Resolution	PROBA-V	Normalized cPSNR	0.9336790819983855	RAMS
3D Object Super-Resolution	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
3D Object Super-Resolution	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
1 Image, 2*2 Stitchi	Ultra Video Group HD - 4x upscaling	Average PSNR	48.23	RAMS (ours)
1 Image, 2*2 Stitchi	Ultra Video Group HD - 4x upscaling	Average PSNR	47.84	DeepSUM[41]
16k	EPFL NIR-VIS	SSIM	0.9875	RAMS (ours)
16k	PROBA-V	Normalized cPSNR	0.9336790819983855	RAMS

Multi-image Super Resolution of Remotely Sensed Images using Residual Feature Attention Deep Neural Networks

Abstract

Results

Related Papers

Multi-image Super Resolution of Remotely Sensed Images using Residual Feature Attention Deep Neural Networks

Abstract

Results

Related Papers