TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Extracting Motion and Appearance via Inter-Frame Attention...

Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation

Guozhen Zhang, Yuhan Zhu, Haonan Wang, Youxin Chen, Gangshan Wu, LiMin Wang

2023-03-01CVPR 2023 1Video Frame Interpolation
PaperPDFCode(official)

Abstract

Effectively extracting inter-frame motion and appearance information is important for video frame interpolation (VFI). Previous works either extract both types of information in a mixed way or elaborate separate modules for each type of information, which lead to representation ambiguity and low efficiency. In this paper, we propose a novel module to explicitly extract motion and appearance information via a unifying operation. Specifically, we rethink the information process in inter-frame attention and reuse its attention map for both appearance feature enhancement and motion information extraction. Furthermore, for efficient VFI, our proposed module could be seamlessly integrated into a hybrid CNN and Transformer architecture. This hybrid pipeline can alleviate the computational complexity of inter-frame attention as well as preserve detailed low-level structure information. Experimental results demonstrate that, for both fixed- and arbitrary-timestep interpolation, our method achieves state-of-the-art performance on various datasets. Meanwhile, our approach enjoys a lighter computation overhead over models with close performance. The source code and models are available at https://github.com/MCG-NJU/EMA-VFI.

Results

TaskDatasetMetricValueModel
VideoVimeo90KPSNR36.64EMA-VFI
VideoVimeo90KSSIM0.9819EMA-VFI
VideoXiph-2KPSNR36.9EMA-VFI
VideoXiph-2KSSIM0.945EMA-VFI
VideoSNU-FILM (medium)PSNR36.09EMA-VFI
VideoSNU-FILM (medium)SSIM0.9801EMA-VFI
VideoXiph-4kPSNR34.67EMA-VFI
VideoXiph-4kSSIM0.907EMA-VFI
VideoSNU-FILM (easy)PSNR39.98EMA-VFI
VideoSNU-FILM (easy)SSIM0.991EMA-VFI
VideoUCF101PSNR35.48EMA-VFI
VideoUCF101SSIM0.9701EMA-VFI
VideoSNU-FILM (extreme)PSNR25.69EMA-VFI
VideoSNU-FILM (extreme)SSIM0.8661EMA-VFI
VideoSNU-FILM (hard)PSNR30.94EMA-VFI
VideoSNU-FILM (hard)SSIM0.9392EMA-VFI
VideoMSU Video Frame InterpolationLPIPS0.022EMA-VFI
VideoMSU Video Frame InterpolationMS-SSIM0.965EMA-VFI
VideoMSU Video Frame InterpolationPSNR29.89EMA-VFI
VideoMSU Video Frame InterpolationSSIM0.953EMA-VFI
VideoMSU Video Frame InterpolationVMAF71.71EMA-VFI
VideoX4K1000FPSPSNR31.46EMA-VFI
VideoX4K1000FPS-2KPSNR32.85EMA-VFI
Video Frame InterpolationVimeo90KPSNR36.64EMA-VFI
Video Frame InterpolationVimeo90KSSIM0.9819EMA-VFI
Video Frame InterpolationXiph-2KPSNR36.9EMA-VFI
Video Frame InterpolationXiph-2KSSIM0.945EMA-VFI
Video Frame InterpolationSNU-FILM (medium)PSNR36.09EMA-VFI
Video Frame InterpolationSNU-FILM (medium)SSIM0.9801EMA-VFI
Video Frame InterpolationXiph-4kPSNR34.67EMA-VFI
Video Frame InterpolationXiph-4kSSIM0.907EMA-VFI
Video Frame InterpolationSNU-FILM (easy)PSNR39.98EMA-VFI
Video Frame InterpolationSNU-FILM (easy)SSIM0.991EMA-VFI
Video Frame InterpolationUCF101PSNR35.48EMA-VFI
Video Frame InterpolationUCF101SSIM0.9701EMA-VFI
Video Frame InterpolationSNU-FILM (extreme)PSNR25.69EMA-VFI
Video Frame InterpolationSNU-FILM (extreme)SSIM0.8661EMA-VFI
Video Frame InterpolationSNU-FILM (hard)PSNR30.94EMA-VFI
Video Frame InterpolationSNU-FILM (hard)SSIM0.9392EMA-VFI
Video Frame InterpolationMSU Video Frame InterpolationLPIPS0.022EMA-VFI
Video Frame InterpolationMSU Video Frame InterpolationMS-SSIM0.965EMA-VFI
Video Frame InterpolationMSU Video Frame InterpolationPSNR29.89EMA-VFI
Video Frame InterpolationMSU Video Frame InterpolationSSIM0.953EMA-VFI
Video Frame InterpolationMSU Video Frame InterpolationVMAF71.71EMA-VFI
Video Frame InterpolationX4K1000FPSPSNR31.46EMA-VFI
Video Frame InterpolationX4K1000FPS-2KPSNR32.85EMA-VFI

Related Papers

TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation2025-06-01PS4PRO: Pixel-to-pixel Supervision for Photorealistic Rendering and Optimization2025-05-28EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation2025-05-13TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion2025-05-06Time-adaptive Video Frame Interpolation based on Residual Diffusion2025-04-07Coupled Video Frame Interpolation and Encoding with Hybrid Event Cameras for Low-Power High-Framerate Video2025-03-28Video Motion Graphs2025-03-26