Video Frame Synthesis using Deep Voxel Flow

Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala

2017-02-08ICCV 2017 10Optical Flow Estimation Video Prediction

Abstract

We address the problem of synthesizing new video frames in an existing video, either in-between existing frames (interpolation), or subsequent to them (extrapolation). This problem is challenging because video appearance and motion can be highly complex. Traditional optical-flow-based solutions often fail where flow estimation is challenging, while newer neural-network-based methods that hallucinate pixel values directly often produce blurry results. We combine the advantages of these two methods by training a deep network that learns to synthesize video frames by flowing pixel values from existing ones, which we call deep voxel flow. Our method requires no human supervision, and any video can be used as training data by dropping, and then learning to predict, existing frames. The technique is efficient, and can be applied at any video resolution. We demonstrate that our method produces results that both quantitatively and qualitatively improve upon the state-of-the-art.

Results

Task	Dataset	Metric	Value	Model
Video	Cityscapes	LPIPS	0.1737	DVF
Video	Cityscapes	MS-SSIM	0.835	DVF
Video	DAVIS 2017	LPIPS	0.2323	DVF
Video	DAVIS 2017	MS-SSIM	0.6861	DVF
Video	Vimeo90K	LPIPS	0.0773	DVF
Video	Vimeo90K	MS-SSIM	0.9211	DVF
Video	KITTI	LPIPS	0.3247	DVF
Video	KITTI	MS-SSIM	0.5393	DVF
Video Prediction	Cityscapes	LPIPS	0.1737	DVF
Video Prediction	Cityscapes	MS-SSIM	0.835	DVF
Video Prediction	DAVIS 2017	LPIPS	0.2323	DVF
Video Prediction	DAVIS 2017	MS-SSIM	0.6861	DVF
Video Prediction	Vimeo90K	LPIPS	0.0773	DVF
Video Prediction	Vimeo90K	MS-SSIM	0.9211	DVF
Video Prediction	KITTI	LPIPS	0.3247	DVF
Video Prediction	KITTI	MS-SSIM	0.5393	DVF

Video Frame Synthesis using Deep Voxel Flow

Abstract

Results

Related Papers

Video Frame Synthesis using Deep Voxel Flow

Abstract

Results

Related Papers