We propose RIFE, a Real-time Intermediate Flow Estimation algorithm for Video Frame Interpolation (VFI). Many recent flow-based VFI methods first estimate the bi-directional optical flows, then scale and reverse them to approximate intermediate flows, leading to artifacts on motion boundaries and complex pipelines. RIFE uses a neural network named IFNet that can directly estimate the intermediate flows from coarse-to-fine with much better speed. We design a privileged distillation scheme for training IFNet, resulting in a large performance improvement. RIFE does not rely on pre-trained optical flow models and can support arbitrary-timestep frame interpolation with the temporal encoding input. Experiments demonstrate that RIFE achieves state-of-the-art performance on several public benchmarks. Compared with the popular SuperSlomo and DAIN methods, RIFE is 4--27 times faster and produces better results. The code is available at https://github.com/hzwer/arXiv2020-RIFE.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Video | MSU Video Frame Interpolation | FPS | 27.3 | RIFE |
| Video | MSU Video Frame Interpolation | LPIPS | 0.039 | RIFE |
| Video | MSU Video Frame Interpolation | MS-SSIM | 0.939 | RIFE |
| Video | MSU Video Frame Interpolation | PSNR | 27.15 | RIFE |
| Video | MSU Video Frame Interpolation | SSIM | 0.914 | RIFE |
| Video | MSU Video Frame Interpolation | Subjective score | 1.99 | RIFE |
| Video | MSU Video Frame Interpolation | VMAF | 66.33 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | FPS | 27.3 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | LPIPS | 0.039 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | MS-SSIM | 0.939 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | PSNR | 27.15 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | SSIM | 0.914 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | Subjective score | 1.99 | RIFE |
| Video Frame Interpolation | MSU Video Frame Interpolation | VMAF | 66.33 | RIFE |