Amirhosein Ghasemabadi, Muhammad Kamran Janjua, Mohammad Salameh, Di Niu
One key challenge to video restoration is to model the transition dynamics of video frames governed by motion. In this work, we propose TURTLE to learn the truncated causal history model for efficient and high-performing video restoration. Unlike traditional methods that process a range of contextual frames in parallel, TURTLE enhances efficiency by storing and summarizing a truncated history of the input frame latent representation into an evolving historical state. This is achieved through a sophisticated similarity-based retrieval mechanism that implicitly accounts for inter-frame motion and alignment. The causal design in TURTLE enables recurrence in inference through state-memorized historical features while allowing parallel training by sampling truncated video clips. We report new state-of-the-art results on a multitude of video restoration benchmark tasks, including video desnowing, nighttime video deraining, video raindrops and rain streak removal, video super-resolution, real-world and synthetic video deblurring, and blind video denoising while reducing the computational cost compared to existing best contextual methods on all these tasks.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Deblurring | GoPro | PSNR | 34.5 | Turtle |
| Deblurring | GoPro | SSIM | 0.972 | Turtle |
| Deblurring | Beam-Splitter Deblurring (BSD) | PSNR | 33.58 | Turtle |
| Rain Removal | Nightrain | PSNR | 29.26 | Turtle |
| Video | Set8 sigma50 | PSNR | 30.29 | Turtle |
| 2D Classification | GoPro | PSNR | 34.5 | Turtle |
| 2D Classification | GoPro | SSIM | 0.972 | Turtle |
| 2D Classification | Beam-Splitter Deblurring (BSD) | PSNR | 33.58 | Turtle |
| 10-shot image generation | GoPro | PSNR | 34.5 | Turtle |
| 10-shot image generation | GoPro | SSIM | 0.972 | Turtle |
| 10-shot image generation | Beam-Splitter Deblurring (BSD) | PSNR | 33.58 | Turtle |
| Video deraining | VRDS | PSNR | 32.01 | Turtle |
| Video deraining | VRDS | SSIM | 0.959 | Turtle |
| Blind Image Deblurring | GoPro | PSNR | 34.5 | Turtle |
| Blind Image Deblurring | GoPro | SSIM | 0.972 | Turtle |
| Blind Image Deblurring | Beam-Splitter Deblurring (BSD) | PSNR | 33.58 | Turtle |