TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Learning Trajectory-Aware Transformer for Video Super-Reso...

Learning Trajectory-Aware Transformer for Video Super-Resolution

Chengxu Liu, Huan Yang, Jianlong Fu, Xueming Qian

2022-04-08CVPR 2022 1Super-ResolutionVideo Super-ResolutionVideo deraining
PaperPDFCode(official)

Abstract

Video super-resolution (VSR) aims to restore a sequence of high-resolution (HR) frames from their low-resolution (LR) counterparts. Although some progress has been made, there are grand challenges to effectively utilize temporal dependency in entire video sequences. Existing approaches usually align and aggregate video frames from limited adjacent frames (e.g., 5 or 7 frames), which prevents these approaches from satisfactory results. In this paper, we take one step further to enable effective spatio-temporal learning in videos. We propose a novel Trajectory-aware Transformer for Video Super-Resolution (TTVSR). In particular, we formulate video frames into several pre-aligned trajectories which consist of continuous visual tokens. For a query token, self-attention is only learned on relevant visual tokens along spatio-temporal trajectories. Compared with vanilla vision Transformers, such a design significantly reduces the computational cost and enables Transformers to model long-range features. We further propose a cross-scale feature tokenization module to overcome scale-changing problems that often occur in long-range videos. Experimental results demonstrate the superiority of the proposed TTVSR over state-of-the-art models, by extensive quantitative and qualitative evaluations in four widely-used video super-resolution benchmarks. Both code and pre-trained models can be downloaded at https://github.com/researchmm/TTVSR.

Results

TaskDatasetMetricValueModel
Super-ResolutionVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
Super-ResolutionVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
Super-ResolutionUDM10 - 4x upscalingPSNR40.41TTVSR
Super-ResolutionUDM10 - 4x upscalingSSIM0.9712TTVSR
3D Human Pose EstimationVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
3D Human Pose EstimationVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
3D Human Pose EstimationUDM10 - 4x upscalingPSNR40.41TTVSR
3D Human Pose EstimationUDM10 - 4x upscalingSSIM0.9712TTVSR
VideoVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
VideoVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
VideoUDM10 - 4x upscalingPSNR40.41TTVSR
VideoUDM10 - 4x upscalingSSIM0.9712TTVSR
Pose EstimationVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
Pose EstimationVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
Pose EstimationUDM10 - 4x upscalingPSNR40.41TTVSR
Pose EstimationUDM10 - 4x upscalingSSIM0.9712TTVSR
3DVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
3DVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
3DUDM10 - 4x upscalingPSNR40.41TTVSR
3DUDM10 - 4x upscalingSSIM0.9712TTVSR
3D Face AnimationVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
3D Face AnimationVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
3D Face AnimationUDM10 - 4x upscalingPSNR40.41TTVSR
3D Face AnimationUDM10 - 4x upscalingSSIM0.9712TTVSR
2D Human Pose EstimationVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
2D Human Pose EstimationVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
2D Human Pose EstimationUDM10 - 4x upscalingPSNR40.41TTVSR
2D Human Pose EstimationUDM10 - 4x upscalingSSIM0.9712TTVSR
3D Absolute Human Pose EstimationVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
3D Absolute Human Pose EstimationVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
3D Absolute Human Pose EstimationUDM10 - 4x upscalingPSNR40.41TTVSR
3D Absolute Human Pose EstimationUDM10 - 4x upscalingSSIM0.9712TTVSR
Video Super-ResolutionVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
Video Super-ResolutionVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
Video Super-ResolutionUDM10 - 4x upscalingPSNR40.41TTVSR
Video Super-ResolutionUDM10 - 4x upscalingSSIM0.9712TTVSR
3D Object Super-ResolutionVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
3D Object Super-ResolutionVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
3D Object Super-ResolutionUDM10 - 4x upscalingPSNR40.41TTVSR
3D Object Super-ResolutionUDM10 - 4x upscalingSSIM0.9712TTVSR
Video derainingVRDSPSNR28.05TTVSR
Video derainingVRDSSSIM0.8998TTVSR
1 Image, 2*2 StitchiVid4 - 4x upscaling - BD degradationPSNR28.4TTVSR
1 Image, 2*2 StitchiVid4 - 4x upscaling - BD degradationSSIM0.8643TTVSR
1 Image, 2*2 StitchiUDM10 - 4x upscalingPSNR40.41TTVSR
1 Image, 2*2 StitchiUDM10 - 4x upscalingSSIM0.9712TTVSR

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution2025-07-14PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution2025-07-12HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation2025-07-104KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models2025-06-25Unsupervised Image Super-Resolution Reconstruction Based on Real-World Degradation Patterns2025-06-20