TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Investigating Tradeoffs in Real-World Video Super-Resolution

Investigating Tradeoffs in Real-World Video Super-Resolution

Kelvin C. K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy

2021-11-24CVPR 2022 1Super-ResolutionBenchmarkingVideo Super-Resolution
PaperPDFCode(official)

Abstract

The diversity and complexity of degradations in real-world video super-resolution (VSR) pose non-trivial challenges in inference and training. First, while long-term propagation leads to improved performance in cases of mild degradations, severe in-the-wild degradations could be exaggerated through propagation, impairing output quality. To balance the tradeoff between detail synthesis and artifact suppression, we found an image pre-cleaning stage indispensable to reduce noises and artifacts prior to propagation. Equipped with a carefully designed cleaning module, our RealBasicVSR outperforms existing methods in both quality and efficiency. Second, real-world VSR models are often trained with diverse degradations to improve generalizability, requiring increased batch size to produce a stable gradient. Inevitably, the increased computational burden results in various problems, including 1) speed-performance tradeoff and 2) batch-length tradeoff. To alleviate the first tradeoff, we propose a stochastic degradation scheme that reduces up to 40\% of training time without sacrificing performance. We then analyze different training settings and suggest that employing longer sequences rather than larger batches during training allows more effective uses of temporal information, leading to more stable performance during inference. To facilitate fair comparisons, we propose the new VideoLQ dataset, which contains a large variety of real-world low-quality video sequences containing rich textures and patterns. Our dataset can serve as a common ground for benchmarking. Code, models, and the dataset will be made publicly available.

Results

TaskDatasetMetricValueModel
Super-ResolutionMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
Super-ResolutionMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
Super-ResolutionMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
3D Human Pose EstimationMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
3D Human Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
3D Human Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
VideoMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
VideoMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
VideoMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
Pose EstimationMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
3DMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
3DMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
3DMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
3D Face AnimationMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
3D Face AnimationMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
3D Face AnimationMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
2D Human Pose EstimationMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
2D Human Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
2D Human Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
3D Absolute Human Pose EstimationMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
3D Absolute Human Pose EstimationMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
3D Absolute Human Pose EstimationMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
Video Super-ResolutionMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
Video Super-ResolutionMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
Video Super-ResolutionMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
3D Object Super-ResolutionMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
3D Object Super-ResolutionMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
3D Object Super-ResolutionMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR
1 Image, 2*2 StitchiMSU Video Upscalers: Quality EnhancementLPIPS0.201RealBasicVSR
1 Image, 2*2 StitchiMSU Video Upscalers: Quality EnhancementPSNR29.54RealBasicVSR
1 Image, 2*2 StitchiMSU Video Upscalers: Quality EnhancementSSIM0.838RealBasicVSR

Related Papers

Visual Place Recognition for Large-Scale UAV Applications2025-07-20SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17Training Transformers with Enforced Lipschitz Constants2025-07-17Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action Recognition2025-07-16DCR: Quantifying Data Contamination in LLMs Evaluation2025-07-15A Multi-View High-Resolution Foot-Ankle Complex Point Cloud Dataset During Gait for Occlusion-Robust 3D Completion2025-07-15