TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Video Frame Synthesis using Deep Voxel Flow

Video Frame Synthesis using Deep Voxel Flow

Ziwei Liu, Raymond A. Yeh, Xiaoou Tang, Yiming Liu, Aseem Agarwala

2017-02-08ICCV 2017 10Optical Flow EstimationVideo Prediction
PaperPDFCodeCodeCode

Abstract

We address the problem of synthesizing new video frames in an existing video, either in-between existing frames (interpolation), or subsequent to them (extrapolation). This problem is challenging because video appearance and motion can be highly complex. Traditional optical-flow-based solutions often fail where flow estimation is challenging, while newer neural-network-based methods that hallucinate pixel values directly often produce blurry results. We combine the advantages of these two methods by training a deep network that learns to synthesize video frames by flowing pixel values from existing ones, which we call deep voxel flow. Our method requires no human supervision, and any video can be used as training data by dropping, and then learning to predict, existing frames. The technique is efficient, and can be applied at any video resolution. We demonstrate that our method produces results that both quantitatively and qualitatively improve upon the state-of-the-art.

Results

TaskDatasetMetricValueModel
VideoCityscapesLPIPS0.1737DVF
VideoCityscapesMS-SSIM0.835DVF
VideoDAVIS 2017LPIPS0.2323DVF
VideoDAVIS 2017MS-SSIM0.6861DVF
VideoVimeo90KLPIPS0.0773DVF
VideoVimeo90KMS-SSIM0.9211DVF
VideoKITTILPIPS0.3247DVF
VideoKITTIMS-SSIM0.5393DVF
Video PredictionCityscapesLPIPS0.1737DVF
Video PredictionCityscapesMS-SSIM0.835DVF
Video PredictionDAVIS 2017LPIPS0.2323DVF
Video PredictionDAVIS 2017MS-SSIM0.6861DVF
Video PredictionVimeo90KLPIPS0.0773DVF
Video PredictionVimeo90KMS-SSIM0.9211DVF
Video PredictionKITTILPIPS0.3247DVF
Video PredictionKITTIMS-SSIM0.5393DVF

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07Epona: Autoregressive Diffusion World Model for Autonomous Driving2025-06-30MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26WAFT: Warping-Alone Field Transforms for Optical Flow2025-06-26