TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/VideoGigaGAN: Towards Detail-rich Video Super-Resolution

VideoGigaGAN: Towards Detail-rich Video Super-Resolution

Yiran Xu, Taesung Park, Richard Zhang, Yang Zhou, Eli Shechtman, Feng Liu, Jia-Bin Huang, Difan Liu

2024-04-18CVPR 2025 1Super-ResolutionVideo Super-Resolution
PaperPDF

Abstract

Video super-resolution (VSR) approaches have shown impressive temporal consistency in upsampled videos. However, these approaches tend to generate blurrier results than their image counterparts as they are limited in their generative capability. This raises a fundamental question: can we extend the success of a generative image upsampler to the VSR task while preserving the temporal consistency? We introduce VideoGigaGAN, a new generative VSR model that can produce videos with high-frequency details and temporal consistency. VideoGigaGAN builds upon a large-scale image upsampler -- GigaGAN. Simply inflating GigaGAN to a video model by adding temporal modules produces severe temporal flickering. We identify several key issues and propose techniques that significantly improve the temporal consistency of upsampled videos. Our experiments show that, unlike previous VSR methods, VideoGigaGAN generates temporally consistent videos with more fine-grained appearance details. We validate the effectiveness of VideoGigaGAN by comparing it with state-of-the-art VSR models on public datasets and showcasing video results with $8\times$ super-resolution.

Results

TaskDatasetMetricValueModel
Super-ResolutionVid4 - 4x upscalingPSNR27.04VideoGigaGAN
3D Human Pose EstimationVid4 - 4x upscalingPSNR27.04VideoGigaGAN
VideoVid4 - 4x upscalingPSNR27.04VideoGigaGAN
Pose EstimationVid4 - 4x upscalingPSNR27.04VideoGigaGAN
3DVid4 - 4x upscalingPSNR27.04VideoGigaGAN
3D Face AnimationVid4 - 4x upscalingPSNR27.04VideoGigaGAN
2D Human Pose EstimationVid4 - 4x upscalingPSNR27.04VideoGigaGAN
3D Absolute Human Pose EstimationVid4 - 4x upscalingPSNR27.04VideoGigaGAN
Video Super-ResolutionVid4 - 4x upscalingPSNR27.04VideoGigaGAN
3D Object Super-ResolutionVid4 - 4x upscalingPSNR27.04VideoGigaGAN
1 Image, 2*2 StitchiVid4 - 4x upscalingPSNR27.04VideoGigaGAN

Related Papers

SpectraLift: Physics-Guided Spectral-Inversion Network for Self-Supervised Hyperspectral Image Super-Resolution2025-07-17IM-LUT: Interpolation Mixing Look-Up Tables for Image Super-Resolution2025-07-14PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolution2025-07-12HNOSeg-XS: Extremely Small Hartley Neural Operator for Efficient and Resolution-Robust 3D Image Segmentation2025-07-104KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models2025-06-25Unsupervised Image Super-Resolution Reconstruction Based on Real-World Degradation Patterns2025-06-20