TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Flow-Guided Sparse Transformer for Video Deblurring

Flow-Guided Sparse Transformer for Video Deblurring

Jing Lin, Yuanhao Cai, Xiaowan Hu, Haoqian Wang, Youliang Yan, Xueyi Zou, Henghui Ding, Yulun Zhang, Radu Timofte, Luc van Gool

2022-01-06DeblurringOptical Flow EstimationVideo Deblurring
PaperPDFCode(official)

Abstract

Exploiting similar and sharper scene patches in spatio-temporal neighborhoods is critical for video deblurring. However, CNN-based methods show limitations in capturing long-range dependencies and modeling non-local self-similarity. In this paper, we propose a novel framework, Flow-Guided Sparse Transformer (FGST), for video deblurring. In FGST, we customize a self-attention module, Flow-Guided Sparse Window-based Multi-head Self-Attention (FGSW-MSA). For each $query$ element on the blurry reference frame, FGSW-MSA enjoys the guidance of the estimated optical flow to globally sample spatially sparse yet highly related $key$ elements corresponding to the same scene patch in neighboring frames. Besides, we present a Recurrent Embedding (RE) mechanism to transfer information from past frames and strengthen long-range temporal dependencies. Comprehensive experiments demonstrate that our proposed FGST outperforms state-of-the-art (SOTA) methods on both DVD and GOPRO datasets and even yields more visually pleasing results in real video deblurring. Code and pre-trained models are publicly available at https://github.com/linjing7/VR-Baseline

Results

TaskDatasetMetricValueModel
DeblurringDVDPSNR33.5FGST
DeblurringDVD PSNR33.03FGST
DeblurringGoProPSNR33.03FGST
DeblurringGoProSSIM0.964FGST
2D ClassificationDVDPSNR33.5FGST
2D ClassificationDVD PSNR33.03FGST
2D ClassificationGoProPSNR33.03FGST
2D ClassificationGoProSSIM0.964FGST
10-shot image generationDVDPSNR33.5FGST
10-shot image generationDVD PSNR33.03FGST
10-shot image generationGoProPSNR33.03FGST
10-shot image generationGoProSSIM0.964FGST
Blind Image DeblurringDVDPSNR33.5FGST
Blind Image DeblurringDVD PSNR33.03FGST
Blind Image DeblurringGoProPSNR33.03FGST
Blind Image DeblurringGoProSSIM0.964FGST

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17Generative Latent Kernel Modeling for Blind Motion Deblurring2025-07-12An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26