TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame I...

ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Duolikun Danier, Fan Zhang, David Bull

2021-11-30CVPR 2022 1Video Frame InterpolationTexture Synthesis
PaperPDFCode(official)CodeCode

Abstract

Video frame interpolation (VFI) is currently a very active research topic, with applications spanning computer vision, post production and video encoding. VFI can be extremely challenging, particularly in sequences containing large motions, occlusions or dynamic textures, where existing approaches fail to offer perceptually robust interpolation performance. In this context, we present a novel deep learning based VFI method, ST-MFNet, based on a Spatio-Temporal Multi-Flow architecture. ST-MFNet employs a new multi-scale multi-flow predictor to estimate many-to-one intermediate flows, which are combined with conventional one-to-one optical flows to capture both large and complex motions. In order to enhance interpolation performance for various textures, a 3D CNN is also employed to model the content dynamics over an extended temporal window. Moreover, ST-MFNet has been trained within an ST-GAN framework, which was originally developed for texture synthesis, with the aim of further improving perceptual interpolation quality. Our approach has been comprehensively evaluated -- compared with fourteen state-of-the-art VFI algorithms -- clearly demonstrating that ST-MFNet consistently outperforms these benchmarks on varied and representative test datasets, with significant gains up to 1.09dB in PSNR for cases including large motions and dynamic textures. Project page: https://danielism97.github.io/ST-MFNet.

Results

TaskDatasetMetricValueModel
VideoSNU-FILM (medium)PSNR37.111ST-MFNet
VideoVFITexPSNR29.175ST-MFNet
VideoDAVISPSNR28.287ST-MFNet
VideoDAVISSSIM0.895ST-MFNet
VideoSNU-FILM (easy)PSNR40.775ST-MFNet
VideoUCF101PSNR33.384ST-MFNet
VideoSNU-FILM (extreme)PSNR25.81ST-MFNet
VideoSNU-FILM (hard)PSNR31.698ST-MFNet
Video Frame InterpolationSNU-FILM (medium)PSNR37.111ST-MFNet
Video Frame InterpolationVFITexPSNR29.175ST-MFNet
Video Frame InterpolationDAVISPSNR28.287ST-MFNet
Video Frame InterpolationDAVISSSIM0.895ST-MFNet
Video Frame InterpolationSNU-FILM (easy)PSNR40.775ST-MFNet
Video Frame InterpolationUCF101PSNR33.384ST-MFNet
Video Frame InterpolationSNU-FILM (extreme)PSNR25.81ST-MFNet
Video Frame InterpolationSNU-FILM (hard)PSNR31.698ST-MFNet

Related Papers

TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07Consistent Zero-shot 3D Texture Synthesis Using Geometry-aware Diffusion and Temporal Video Models2025-06-26Hunyuan3D 2.5: Towards High-Fidelity 3D Assets Generation with Ultimate Details2025-06-19TexTailor: Customized Text-aligned Texturing via Effective Resampling2025-06-12EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence2025-06-12FlexPainter: Flexible and Multi-View Consistent Texture Generation2025-06-03AceVFI: A Comprehensive Survey of Advances in Video Frame Interpolation2025-06-01UniTEX: Universal High Fidelity Generative Texturing for 3D Shapes2025-05-29