TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/XVFI: eXtreme Video Frame Interpolation

XVFI: eXtreme Video Frame Interpolation

Hyeonjun Sim, Jihyong Oh, Munchurl Kim

2021-03-30ICCV 2021 10Optical Flow EstimationeXtreme-Video-Frame-Interpolation4kVideo Frame Interpolation
PaperPDFCode(official)

Abstract

In this paper, we firstly present a dataset (X4K1000FPS) of 4K videos of 1000 fps with the extreme motion to the research community for video frame interpolation (VFI), and propose an extreme VFI network, called XVFI-Net, that first handles the VFI for 4K videos with large motion. The XVFI-Net is based on a recursive multi-scale shared structure that consists of two cascaded modules for bidirectional optical flow learning between two input frames (BiOF-I) and for bidirectional optical flow learning from target to input frames (BiOF-T). The optical flows are stably approximated by a complementary flow reversal (CFR) proposed in BiOF-T module. During inference, the BiOF-I module can start at any scale of input while the BiOF-T module only operates at the original input scale so that the inference can be accelerated while maintaining highly accurate VFI performance. Extensive experimental results show that our XVFI-Net can successfully capture the essential information of objects with extremely large motions and complex textures while the state-of-the-art methods exhibit poor performance. Furthermore, our XVFI-Net framework also performs comparably on the previous lower resolution benchmark dataset, which shows a robustness of our algorithm as well. All source codes, pre-trained models, and proposed X4K1000FPS datasets are publicly available at https://github.com/JihyongOh/XVFI.

Results

TaskDatasetMetricValueModel
VideoVimeo90KPSNR35.07XVFI
VideoVimeo90KSSIM0.976XVFI
VideoMSU Video Frame InterpolationFPS5.4XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationLPIPS0.061XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationMS-SSIM0.933XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationPSNR27.35XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationSSIM0.913XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationSubjective score1.38XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationVMAF63.47XVFI (S_{tst}=3)
VideoMSU Video Frame InterpolationLPIPS0.049XVFI (S_{tst}=5)
VideoMSU Video Frame InterpolationMS-SSIM0.955XVFI (S_{tst}=5)
VideoMSU Video Frame InterpolationPSNR27.86XVFI (S_{tst}=5)
VideoMSU Video Frame InterpolationSSIM0.921XVFI (S_{tst}=5)
VideoMSU Video Frame InterpolationVMAF67.25XVFI (S_{tst}=5)
VideoX4K1000FPSPSNR30.12XVFI-Net (S_{tst}=5)
VideoX4K1000FPSSSIM0.87XVFI-Net (S_{tst}=5)
VideoX4K1000FPStOF2.15XVFI-Net (S_{tst}=5)
VideoX4K1000FPSPSNR28.86XVFI-Net (S_{tst}=3)
VideoX4K1000FPSSSIM0.858XVFI-Net (S_{tst}=3)
VideoX4K1000FPStOF2.67XVFI-Net (S_{tst}=3)
Video Frame InterpolationVimeo90KPSNR35.07XVFI
Video Frame InterpolationVimeo90KSSIM0.976XVFI
Video Frame InterpolationMSU Video Frame InterpolationFPS5.4XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationLPIPS0.061XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationMS-SSIM0.933XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationPSNR27.35XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationSSIM0.913XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationSubjective score1.38XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationVMAF63.47XVFI (S_{tst}=3)
Video Frame InterpolationMSU Video Frame InterpolationLPIPS0.049XVFI (S_{tst}=5)
Video Frame InterpolationMSU Video Frame InterpolationMS-SSIM0.955XVFI (S_{tst}=5)
Video Frame InterpolationMSU Video Frame InterpolationPSNR27.86XVFI (S_{tst}=5)
Video Frame InterpolationMSU Video Frame InterpolationSSIM0.921XVFI (S_{tst}=5)
Video Frame InterpolationMSU Video Frame InterpolationVMAF67.25XVFI (S_{tst}=5)
Video Frame InterpolationX4K1000FPSPSNR30.12XVFI-Net (S_{tst}=5)
Video Frame InterpolationX4K1000FPSSSIM0.87XVFI-Net (S_{tst}=5)
Video Frame InterpolationX4K1000FPStOF2.15XVFI-Net (S_{tst}=5)
Video Frame InterpolationX4K1000FPSPSNR28.86XVFI-Net (S_{tst}=3)
Video Frame InterpolationX4K1000FPSSSIM0.858XVFI-Net (S_{tst}=3)
Video Frame InterpolationX4K1000FPStOF2.67XVFI-Net (S_{tst}=3)

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Dynamic Parameter Memory: Temporary LoRA-Enhanced LLM for Long-Sequence Emotion Recognition in Conversation2025-07-114KAgent: Agentic Any Image to 4K Super-Resolution2025-07-09Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26