TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Video Frame Interpolation with Transformer

Video Frame Interpolation with Transformer

Liying Lu, Ruizheng Wu, Huaijia Lin, Jiangbo Lu, Jiaya Jia

2022-05-15CVPR 2022 1Video Frame Interpolation
PaperPDFCode(official)

Abstract

Video frame interpolation (VFI), which aims to synthesize intermediate frames of a video, has made remarkable progress with development of deep convolutional networks over past years. Existing methods built upon convolutional networks generally face challenges of handling large motion due to the locality of convolution operations. To overcome this limitation, we introduce a novel framework, which takes advantage of Transformer to model long-range pixel correlation among video frames. Further, our network is equipped with a novel cross-scale window-based attention mechanism, where cross-scale windows interact with each other. This design effectively enlarges the receptive field and aggregates multi-scale information. Extensive quantitative and qualitative experiments demonstrate that our method achieves new state-of-the-art results on various benchmarks.

Results

TaskDatasetMetricValueModel
VideoMSU Video Frame InterpolationLPIPS0.044VFIformer
VideoMSU Video Frame InterpolationMS-SSIM0.942VFIformer
VideoMSU Video Frame InterpolationPSNR28.34VFIformer
VideoMSU Video Frame InterpolationSSIM0.917VFIformer
VideoMSU Video Frame InterpolationVMAF68.87VFIformer
Video Frame InterpolationMSU Video Frame InterpolationLPIPS0.044VFIformer
Video Frame InterpolationMSU Video Frame InterpolationMS-SSIM0.942VFIformer
Video Frame InterpolationMSU Video Frame InterpolationPSNR28.34VFIformer
Video Frame InterpolationMSU Video Frame InterpolationSSIM0.917VFIformer
Video Frame InterpolationMSU Video Frame InterpolationVMAF68.87VFIformer

Related Papers

TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation2025-06-01PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization2025-05-28EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation2025-05-13TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion2025-05-06Time-adaptive Video Frame Interpolation based on Residual Diffusion2025-04-07Coupled Video Frame Interpolation and Encoding with Hybrid Event Cameras for Low-Power High-Framerate Video2025-03-28Video Motion Graphs2025-03-26