Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Jiri Matas, Marc Pollefeys
Objects moving at high speed appear significantly blurred when captured with cameras. The blurry appearance is especially ambiguous when the object has complex shape or texture. In such cases, classical methods, or even humans, are unable to recover the object's appearance and motion. We propose a method that, given a single image with its estimated background, outputs the object's appearance and position in a series of sub-frames as if captured by a high-speed camera (i.e. temporal super-resolution). The proposed generative model embeds an image of the blurred object into a latent space representation, disentangles the background, and renders the sharp appearance. Inspired by the image formation model, we design novel self-supervised loss function terms that boost performance and show good generalization capabilities. The proposed DeFMO method is trained on a complex synthetic dataset, yet it performs well on real-world data from several datasets. DeFMO outperforms the state of the art and generates high-quality temporal super-resolution frames.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Super-Resolution | Falling Objects | PSNR | 26.83 | DeFMO |
| Super-Resolution | Falling Objects | SSIM | 0.753 | DeFMO |
| Super-Resolution | Falling Objects | TIoU | 0.684 | DeFMO |
| Super-Resolution | TbD-3D | PSNR | 26.23 | DeFMO |
| Super-Resolution | TbD-3D | SSIM | 0.699 | DeFMO |
| Super-Resolution | TbD-3D | TIoU | 0.879 | DeFMO |
| Super-Resolution | TbD | PSNR | 25.57 | DeFMO |
| Super-Resolution | TbD | SSIM | 0.602 | DeFMO |
| Super-Resolution | TbD | TIoU | 0.55 | DeFMO |
| 3D Human Pose Estimation | Falling Objects | PSNR | 26.83 | DeFMO |
| 3D Human Pose Estimation | Falling Objects | SSIM | 0.753 | DeFMO |
| 3D Human Pose Estimation | Falling Objects | TIoU | 0.684 | DeFMO |
| 3D Human Pose Estimation | TbD-3D | PSNR | 26.23 | DeFMO |
| 3D Human Pose Estimation | TbD-3D | SSIM | 0.699 | DeFMO |
| 3D Human Pose Estimation | TbD-3D | TIoU | 0.879 | DeFMO |
| 3D Human Pose Estimation | TbD | PSNR | 25.57 | DeFMO |
| 3D Human Pose Estimation | TbD | SSIM | 0.602 | DeFMO |
| 3D Human Pose Estimation | TbD | TIoU | 0.55 | DeFMO |
| Video | Falling Objects | PSNR | 26.83 | DeFMO |
| Video | Falling Objects | SSIM | 0.753 | DeFMO |
| Video | Falling Objects | TIoU | 0.684 | DeFMO |
| Video | TbD-3D | PSNR | 26.23 | DeFMO |
| Video | TbD-3D | SSIM | 0.699 | DeFMO |
| Video | TbD-3D | TIoU | 0.879 | DeFMO |
| Video | TbD | PSNR | 25.57 | DeFMO |
| Video | TbD | SSIM | 0.602 | DeFMO |
| Video | TbD | TIoU | 0.55 | DeFMO |
| Pose Estimation | Falling Objects | PSNR | 26.83 | DeFMO |
| Pose Estimation | Falling Objects | SSIM | 0.753 | DeFMO |
| Pose Estimation | Falling Objects | TIoU | 0.684 | DeFMO |
| Pose Estimation | TbD-3D | PSNR | 26.23 | DeFMO |
| Pose Estimation | TbD-3D | SSIM | 0.699 | DeFMO |
| Pose Estimation | TbD-3D | TIoU | 0.879 | DeFMO |
| Pose Estimation | TbD | PSNR | 25.57 | DeFMO |
| Pose Estimation | TbD | SSIM | 0.602 | DeFMO |
| Pose Estimation | TbD | TIoU | 0.55 | DeFMO |
| 3D | Falling Objects | PSNR | 26.83 | DeFMO |
| 3D | Falling Objects | SSIM | 0.753 | DeFMO |
| 3D | Falling Objects | TIoU | 0.684 | DeFMO |
| 3D | TbD-3D | PSNR | 26.23 | DeFMO |
| 3D | TbD-3D | SSIM | 0.699 | DeFMO |
| 3D | TbD-3D | TIoU | 0.879 | DeFMO |
| 3D | TbD | PSNR | 25.57 | DeFMO |
| 3D | TbD | SSIM | 0.602 | DeFMO |
| 3D | TbD | TIoU | 0.55 | DeFMO |
| 3D Face Animation | Falling Objects | PSNR | 26.83 | DeFMO |
| 3D Face Animation | Falling Objects | SSIM | 0.753 | DeFMO |
| 3D Face Animation | Falling Objects | TIoU | 0.684 | DeFMO |
| 3D Face Animation | TbD-3D | PSNR | 26.23 | DeFMO |
| 3D Face Animation | TbD-3D | SSIM | 0.699 | DeFMO |
| 3D Face Animation | TbD-3D | TIoU | 0.879 | DeFMO |
| 3D Face Animation | TbD | PSNR | 25.57 | DeFMO |
| 3D Face Animation | TbD | SSIM | 0.602 | DeFMO |
| 3D Face Animation | TbD | TIoU | 0.55 | DeFMO |
| 2D Human Pose Estimation | Falling Objects | PSNR | 26.83 | DeFMO |
| 2D Human Pose Estimation | Falling Objects | SSIM | 0.753 | DeFMO |
| 2D Human Pose Estimation | Falling Objects | TIoU | 0.684 | DeFMO |
| 2D Human Pose Estimation | TbD-3D | PSNR | 26.23 | DeFMO |
| 2D Human Pose Estimation | TbD-3D | SSIM | 0.699 | DeFMO |
| 2D Human Pose Estimation | TbD-3D | TIoU | 0.879 | DeFMO |
| 2D Human Pose Estimation | TbD | PSNR | 25.57 | DeFMO |
| 2D Human Pose Estimation | TbD | SSIM | 0.602 | DeFMO |
| 2D Human Pose Estimation | TbD | TIoU | 0.55 | DeFMO |
| 3D Absolute Human Pose Estimation | Falling Objects | PSNR | 26.83 | DeFMO |
| 3D Absolute Human Pose Estimation | Falling Objects | SSIM | 0.753 | DeFMO |
| 3D Absolute Human Pose Estimation | Falling Objects | TIoU | 0.684 | DeFMO |
| 3D Absolute Human Pose Estimation | TbD-3D | PSNR | 26.23 | DeFMO |
| 3D Absolute Human Pose Estimation | TbD-3D | SSIM | 0.699 | DeFMO |
| 3D Absolute Human Pose Estimation | TbD-3D | TIoU | 0.879 | DeFMO |
| 3D Absolute Human Pose Estimation | TbD | PSNR | 25.57 | DeFMO |
| 3D Absolute Human Pose Estimation | TbD | SSIM | 0.602 | DeFMO |
| 3D Absolute Human Pose Estimation | TbD | TIoU | 0.55 | DeFMO |
| Video Super-Resolution | Falling Objects | PSNR | 26.83 | DeFMO |
| Video Super-Resolution | Falling Objects | SSIM | 0.753 | DeFMO |
| Video Super-Resolution | Falling Objects | TIoU | 0.684 | DeFMO |
| Video Super-Resolution | TbD-3D | PSNR | 26.23 | DeFMO |
| Video Super-Resolution | TbD-3D | SSIM | 0.699 | DeFMO |
| Video Super-Resolution | TbD-3D | TIoU | 0.879 | DeFMO |
| Video Super-Resolution | TbD | PSNR | 25.57 | DeFMO |
| Video Super-Resolution | TbD | SSIM | 0.602 | DeFMO |
| Video Super-Resolution | TbD | TIoU | 0.55 | DeFMO |
| 3D Object Super-Resolution | Falling Objects | PSNR | 26.83 | DeFMO |
| 3D Object Super-Resolution | Falling Objects | SSIM | 0.753 | DeFMO |
| 3D Object Super-Resolution | Falling Objects | TIoU | 0.684 | DeFMO |
| 3D Object Super-Resolution | TbD-3D | PSNR | 26.23 | DeFMO |
| 3D Object Super-Resolution | TbD-3D | SSIM | 0.699 | DeFMO |
| 3D Object Super-Resolution | TbD-3D | TIoU | 0.879 | DeFMO |
| 3D Object Super-Resolution | TbD | PSNR | 25.57 | DeFMO |
| 3D Object Super-Resolution | TbD | SSIM | 0.602 | DeFMO |
| 3D Object Super-Resolution | TbD | TIoU | 0.55 | DeFMO |
| 1 Image, 2*2 Stitchi | Falling Objects | PSNR | 26.83 | DeFMO |
| 1 Image, 2*2 Stitchi | Falling Objects | SSIM | 0.753 | DeFMO |
| 1 Image, 2*2 Stitchi | Falling Objects | TIoU | 0.684 | DeFMO |
| 1 Image, 2*2 Stitchi | TbD-3D | PSNR | 26.23 | DeFMO |
| 1 Image, 2*2 Stitchi | TbD-3D | SSIM | 0.699 | DeFMO |
| 1 Image, 2*2 Stitchi | TbD-3D | TIoU | 0.879 | DeFMO |
| 1 Image, 2*2 Stitchi | TbD | PSNR | 25.57 | DeFMO |
| 1 Image, 2*2 Stitchi | TbD | SSIM | 0.602 | DeFMO |
| 1 Image, 2*2 Stitchi | TbD | TIoU | 0.55 | DeFMO |