TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Real-Time Video Super-Resolution with Spatio-Temporal Netw...

Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation

Jose Caballero, Christian Ledig, Andrew Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi

2016-11-16CVPR 2017 7Motion CompensationVideo Super-Resolution
PaperPDF

Abstract

Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency.

Results

TaskDatasetMetricValueModel
Super-ResolutionMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
Super-ResolutionMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
Super-ResolutionMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
Super-ResolutionVid4 - 4x upscalingMOVIE5.82VESPCN
Super-ResolutionVid4 - 4x upscalingPSNR25.35VESPCN
Super-ResolutionVid4 - 4x upscalingSSIM0.7557VESPCN
Super-ResolutionVid4 - 4x upscalingMOVIE9.31bicubic
Super-ResolutionVid4 - 4x upscalingPSNR23.82bicubic
Super-ResolutionVid4 - 4x upscalingSSIM0.6548bicubic
3D Human Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
3D Human Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
3D Human Pose EstimationMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
3D Human Pose EstimationVid4 - 4x upscalingMOVIE5.82VESPCN
3D Human Pose EstimationVid4 - 4x upscalingPSNR25.35VESPCN
3D Human Pose EstimationVid4 - 4x upscalingSSIM0.7557VESPCN
3D Human Pose EstimationVid4 - 4x upscalingMOVIE9.31bicubic
3D Human Pose EstimationVid4 - 4x upscalingPSNR23.82bicubic
3D Human Pose EstimationVid4 - 4x upscalingSSIM0.6548bicubic
VideoMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
VideoMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
VideoMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
VideoVid4 - 4x upscalingMOVIE5.82VESPCN
VideoVid4 - 4x upscalingPSNR25.35VESPCN
VideoVid4 - 4x upscalingSSIM0.7557VESPCN
VideoVid4 - 4x upscalingMOVIE9.31bicubic
VideoVid4 - 4x upscalingPSNR23.82bicubic
VideoVid4 - 4x upscalingSSIM0.6548bicubic
Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
Pose EstimationMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
Pose EstimationVid4 - 4x upscalingMOVIE5.82VESPCN
Pose EstimationVid4 - 4x upscalingPSNR25.35VESPCN
Pose EstimationVid4 - 4x upscalingSSIM0.7557VESPCN
Pose EstimationVid4 - 4x upscalingMOVIE9.31bicubic
Pose EstimationVid4 - 4x upscalingPSNR23.82bicubic
Pose EstimationVid4 - 4x upscalingSSIM0.6548bicubic
3DMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
3DMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
3DMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
3DVid4 - 4x upscalingMOVIE5.82VESPCN
3DVid4 - 4x upscalingPSNR25.35VESPCN
3DVid4 - 4x upscalingSSIM0.7557VESPCN
3DVid4 - 4x upscalingMOVIE9.31bicubic
3DVid4 - 4x upscalingPSNR23.82bicubic
3DVid4 - 4x upscalingSSIM0.6548bicubic
3D Face AnimationMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
3D Face AnimationMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
3D Face AnimationMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
3D Face AnimationVid4 - 4x upscalingMOVIE5.82VESPCN
3D Face AnimationVid4 - 4x upscalingPSNR25.35VESPCN
3D Face AnimationVid4 - 4x upscalingSSIM0.7557VESPCN
3D Face AnimationVid4 - 4x upscalingMOVIE9.31bicubic
3D Face AnimationVid4 - 4x upscalingPSNR23.82bicubic
3D Face AnimationVid4 - 4x upscalingSSIM0.6548bicubic
2D Human Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
2D Human Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
2D Human Pose EstimationMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
2D Human Pose EstimationVid4 - 4x upscalingMOVIE5.82VESPCN
2D Human Pose EstimationVid4 - 4x upscalingPSNR25.35VESPCN
2D Human Pose EstimationVid4 - 4x upscalingSSIM0.7557VESPCN
2D Human Pose EstimationVid4 - 4x upscalingMOVIE9.31bicubic
2D Human Pose EstimationVid4 - 4x upscalingPSNR23.82bicubic
2D Human Pose EstimationVid4 - 4x upscalingSSIM0.6548bicubic
3D Absolute Human Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
3D Absolute Human Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
3D Absolute Human Pose EstimationMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
3D Absolute Human Pose EstimationVid4 - 4x upscalingMOVIE5.82VESPCN
3D Absolute Human Pose EstimationVid4 - 4x upscalingPSNR25.35VESPCN
3D Absolute Human Pose EstimationVid4 - 4x upscalingSSIM0.7557VESPCN
3D Absolute Human Pose EstimationVid4 - 4x upscalingMOVIE9.31bicubic
3D Absolute Human Pose EstimationVid4 - 4x upscalingPSNR23.82bicubic
3D Absolute Human Pose EstimationVid4 - 4x upscalingSSIM0.6548bicubic
Video Super-ResolutionMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
Video Super-ResolutionMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
Video Super-ResolutionMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
Video Super-ResolutionVid4 - 4x upscalingMOVIE5.82VESPCN
Video Super-ResolutionVid4 - 4x upscalingPSNR25.35VESPCN
Video Super-ResolutionVid4 - 4x upscalingSSIM0.7557VESPCN
Video Super-ResolutionVid4 - 4x upscalingMOVIE9.31bicubic
Video Super-ResolutionVid4 - 4x upscalingPSNR23.82bicubic
Video Super-ResolutionVid4 - 4x upscalingSSIM0.6548bicubic
3D Object Super-ResolutionMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
3D Object Super-ResolutionMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
3D Object Super-ResolutionMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
3D Object Super-ResolutionVid4 - 4x upscalingMOVIE5.82VESPCN
3D Object Super-ResolutionVid4 - 4x upscalingPSNR25.35VESPCN
3D Object Super-ResolutionVid4 - 4x upscalingSSIM0.7557VESPCN
3D Object Super-ResolutionVid4 - 4x upscalingMOVIE9.31bicubic
3D Object Super-ResolutionVid4 - 4x upscalingPSNR23.82bicubic
3D Object Super-ResolutionVid4 - 4x upscalingSSIM0.6548bicubic
1 Image, 2*2 StitchiMSU Video Upscalers: Quality EnhancementPSNR26.92VESPCN
1 Image, 2*2 StitchiMSU Video Upscalers: Quality EnhancementSSIM0.932VESPCN
1 Image, 2*2 StitchiMSU Video Upscalers: Quality EnhancementVMAF53.96VESPCN
1 Image, 2*2 StitchiVid4 - 4x upscalingMOVIE5.82VESPCN
1 Image, 2*2 StitchiVid4 - 4x upscalingPSNR25.35VESPCN
1 Image, 2*2 StitchiVid4 - 4x upscalingSSIM0.7557VESPCN
1 Image, 2*2 StitchiVid4 - 4x upscalingMOVIE9.31bicubic
1 Image, 2*2 StitchiVid4 - 4x upscalingPSNR23.82bicubic
1 Image, 2*2 StitchiVid4 - 4x upscalingSSIM0.6548bicubic

Related Papers

SAM4D: Segment Anything in Camera and LiDAR Streams2025-06-26Compressed Video Super-Resolution based on Hierarchical Encoding2025-06-17ICME 2025 Grand Challenge on Video Super-Resolution for Video Conferencing2025-06-13FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-Resolution2025-06-13LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s2025-06-10A Systematic Investigation on Deep Learning-Based Omnidirectional Image and Video Super-Resolution2025-06-07DualX-VSR: Dual Axial Spatial$\times$Temporal Transformer for Real-World Video Super-Resolution without Motion Compensation2025-06-05A Survey of Deep Learning Video Super-Resolution2025-06-03