TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Simple Baseline for Video Restoration with Grouped Spati...

A Simple Baseline for Video Restoration with Grouped Spatial-temporal Shift

Dasong Li, Xiaoyu Shi, Yi Zhang, Ka Chun Cheung, Simon See, Xiaogang Wang, Hongwei Qin, Hongsheng Li

2022-06-22CVPR 2023 1DenoisingDeblurringOptical Flow EstimationVideo DenoisingVideo DeblurringVideo Restoration
PaperPDFCode(official)

Abstract

Video restoration, which aims to restore clear frames from degraded videos, has numerous important applications. The key to video restoration depends on utilizing inter-frame information. However, existing deep learning methods often rely on complicated network architectures, such as optical flow estimation, deformable convolution, and cross-frame self-attention layers, resulting in high computational costs. In this study, we propose a simple yet effective framework for video restoration. Our approach is based on grouped spatial-temporal shift, which is a lightweight and straightforward technique that can implicitly capture inter-frame correspondences for multi-frame aggregation. By introducing grouped spatial shift, we attain expansive effective receptive fields. Combined with basic 2D convolution, this simple framework can effectively aggregate inter-frame information. Extensive experiments demonstrate that our framework outperforms the previous state-of-the-art method, while using less than a quarter of its computational cost, on both video deblurring and video denoising tasks. These results indicate the potential for our approach to significantly reduce computational overhead while maintaining high-quality results. Code is avaliable at https://github.com/dasongli1/Shift-Net.

Results

TaskDatasetMetricValueModel
DeblurringDVDPSNR34.69GShift-Net
DeblurringDVDSSIM0.969GShift-Net
DeblurringGoProPSNR35.88GShift-Net
DeblurringGoProSSIM0.979GShift-Net
2D ClassificationDVDPSNR34.69GShift-Net
2D ClassificationDVDSSIM0.969GShift-Net
2D ClassificationGoProPSNR35.88GShift-Net
2D ClassificationGoProSSIM0.979GShift-Net
10-shot image generationDVDPSNR34.69GShift-Net
10-shot image generationDVDSSIM0.969GShift-Net
10-shot image generationGoProPSNR35.88GShift-Net
10-shot image generationGoProSSIM0.979GShift-Net
Blind Image DeblurringDVDPSNR34.69GShift-Net
Blind Image DeblurringDVDSSIM0.969GShift-Net
Blind Image DeblurringGoProPSNR35.88GShift-Net
Blind Image DeblurringGoProSSIM0.979GShift-Net

Related Papers

fastWDM3D: Fast and Accurate 3D Healthy Tissue Inpainting2025-07-17Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17Similarity-Guided Diffusion for Contrastive Sequential Recommendation2025-07-16HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing2025-07-15AirLLM: Diffusion Policy-based Adaptive LoRA for Remote Fine-Tuning of LLM over the Air2025-07-15Generative Latent Kernel Modeling for Blind Motion Deblurring2025-07-12An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11