Xin Tao, Hongyun Gao, Renjie Liao, Jue Wang, Jiaya Jia
Previous CNN-based video super-resolution approaches need to align multiple frames to the reference. In this paper, we show that proper frame alignment and motion compensation is crucial for achieving high quality results. We accordingly propose a `sub-pixel motion compensation' (SPMC) layer in a CNN framework. Analysis and experiments show the suitability of this layer in video SR. The final end-to-end, scalable CNN framework effectively incorporates the SPMC layer and fuses multiple frames to reveal image details. Our implementation can generate visually and quantitatively high-quality results, superior to current state-of-the-arts, without the need of parameter tuning.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Super-Resolution | Set14 - 4x upscaling | PSNR | 27.57 | SPMC |
| Super-Resolution | Set14 - 4x upscaling | SSIM | 0.76 | SPMC |
| Super-Resolution | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| Super-Resolution | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| Super-Resolution | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| Super-Resolution | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| Super-Resolution | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 3D Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 3D Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 3D Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 3D Human Pose Estimation | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 3D Human Pose Estimation | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| Video | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| Video | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| Video | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| Video | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| Video | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| Pose Estimation | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| Pose Estimation | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| Pose Estimation | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| Pose Estimation | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| Pose Estimation | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 3D | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 3D | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 3D | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 3D | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 3D | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 3D Face Animation | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 3D Face Animation | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 3D Face Animation | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 3D Face Animation | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 3D Face Animation | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| Image Super-Resolution | Set14 - 4x upscaling | PSNR | 27.57 | SPMC |
| Image Super-Resolution | Set14 - 4x upscaling | SSIM | 0.76 | SPMC |
| 2D Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 2D Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 2D Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 2D Human Pose Estimation | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 2D Human Pose Estimation | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 3D Absolute Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 3D Absolute Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 3D Absolute Human Pose Estimation | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 3D Absolute Human Pose Estimation | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 3D Absolute Human Pose Estimation | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| Video Super-Resolution | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| Video Super-Resolution | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| Video Super-Resolution | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| Video Super-Resolution | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| Video Super-Resolution | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 3D Object Super-Resolution | Set14 - 4x upscaling | PSNR | 27.57 | SPMC |
| 3D Object Super-Resolution | Set14 - 4x upscaling | SSIM | 0.76 | SPMC |
| 3D Object Super-Resolution | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 3D Object Super-Resolution | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 3D Object Super-Resolution | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 3D Object Super-Resolution | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 3D Object Super-Resolution | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 1 Image, 2*2 Stitchi | MSU Video Upscalers: Quality Enhancement | PSNR | 26.99 | SPMC |
| 1 Image, 2*2 Stitchi | MSU Video Upscalers: Quality Enhancement | SSIM | 0.933 | SPMC |
| 1 Image, 2*2 Stitchi | MSU Video Upscalers: Quality Enhancement | VMAF | 51.96 | SPMC |
| 1 Image, 2*2 Stitchi | Vid4 - 4x upscaling | PSNR | 25.88 | DRDVSR |
| 1 Image, 2*2 Stitchi | Vid4 - 4x upscaling | SSIM | 0.774 | DRDVSR |
| 16k | Set14 - 4x upscaling | PSNR | 27.57 | SPMC |
| 16k | Set14 - 4x upscaling | SSIM | 0.76 | SPMC |