ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Duolikun Danier, Fan Zhang, David Bull

2021-11-30CVPR 2022 1Video Frame Interpolation Texture Synthesis

Abstract

Video frame interpolation (VFI) is currently a very active research topic, with applications spanning computer vision, post production and video encoding. VFI can be extremely challenging, particularly in sequences containing large motions, occlusions or dynamic textures, where existing approaches fail to offer perceptually robust interpolation performance. In this context, we present a novel deep learning based VFI method, ST-MFNet, based on a Spatio-Temporal Multi-Flow architecture. ST-MFNet employs a new multi-scale multi-flow predictor to estimate many-to-one intermediate flows, which are combined with conventional one-to-one optical flows to capture both large and complex motions. In order to enhance interpolation performance for various textures, a 3D CNN is also employed to model the content dynamics over an extended temporal window. Moreover, ST-MFNet has been trained within an ST-GAN framework, which was originally developed for texture synthesis, with the aim of further improving perceptual interpolation quality. Our approach has been comprehensively evaluated -- compared with fourteen state-of-the-art VFI algorithms -- clearly demonstrating that ST-MFNet consistently outperforms these benchmarks on varied and representative test datasets, with significant gains up to 1.09dB in PSNR for cases including large motions and dynamic textures. Project page: https://danielism97.github.io/ST-MFNet.

Results

Task	Dataset	Metric	Value	Model
Video	SNU-FILM (medium)	PSNR	37.111	ST-MFNet
Video	VFITex	PSNR	29.175	ST-MFNet
Video	DAVIS	PSNR	28.287	ST-MFNet
Video	DAVIS	SSIM	0.895	ST-MFNet
Video	SNU-FILM (easy)	PSNR	40.775	ST-MFNet
Video	UCF101	PSNR	33.384	ST-MFNet
Video	SNU-FILM (extreme)	PSNR	25.81	ST-MFNet
Video	SNU-FILM (hard)	PSNR	31.698	ST-MFNet
Video Frame Interpolation	SNU-FILM (medium)	PSNR	37.111	ST-MFNet
Video Frame Interpolation	VFITex	PSNR	29.175	ST-MFNet
Video Frame Interpolation	DAVIS	PSNR	28.287	ST-MFNet
Video Frame Interpolation	DAVIS	SSIM	0.895	ST-MFNet
Video Frame Interpolation	SNU-FILM (easy)	PSNR	40.775	ST-MFNet
Video Frame Interpolation	UCF101	PSNR	33.384	ST-MFNet
Video Frame Interpolation	SNU-FILM (extreme)	PSNR	25.81	ST-MFNet
Video Frame Interpolation	SNU-FILM (hard)	PSNR	31.698	ST-MFNet

ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Abstract

Results

Related Papers

ST-MFNet: A Spatio-Temporal Multi-Flow Network for Frame Interpolation

Abstract

Results

Related Papers