Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring

Huicong Zhang, Haozhe Xie, Hongxun Yao

2024-06-11CVPR 2024 1Deblurring Optical Flow Estimation Video Deblurring

Abstract

Video deblurring relies on leveraging information from other frames in the video sequence to restore the blurred regions in the current frame. Mainstream approaches employ bidirectional feature propagation, spatio-temporal transformers, or a combination of both to extract information from the video sequence. However, limitations in memory and computational resources constraints the temporal window length of the spatio-temporal transformer, preventing the extraction of longer temporal contextual information from the video sequence. Additionally, bidirectional feature propagation is highly sensitive to inaccurate optical flow in blurry frames, leading to error accumulation during the propagation process. To address these issues, we propose \textbf{BSSTNet}, \textbf{B}lur-aware \textbf{S}patio-temporal \textbf{S}parse \textbf{T}ransformer Network. It introduces the blur map, which converts the originally dense attention into a sparse form, enabling a more extensive utilization of information throughout the entire video sequence. Specifically, BSSTNet (1) uses a longer temporal window in the transformer, leveraging information from more distant frames to restore the blurry pixels in the current frame. (2) introduces bidirectional feature propagation guided by blur maps, which reduces error accumulation caused by the blur frame. The experimental results demonstrate the proposed BSSTNet outperforms the state-of-the-art methods on the GoPro and DVD datasets.

Results

Task	Dataset	Metric	Value	Model
Deblurring	DVD	PSNR	34.95	BSSTNet
Deblurring	DVD	SSIM	0.9703	BSSTNet
Deblurring	GoPro	PSNR	35.98	BSSTNet
Deblurring	GoPro	SSIM	0.9792	BSSTNet
2D Classification	DVD	PSNR	34.95	BSSTNet
2D Classification	DVD	SSIM	0.9703	BSSTNet
2D Classification	GoPro	PSNR	35.98	BSSTNet
2D Classification	GoPro	SSIM	0.9792	BSSTNet
10-shot image generation	DVD	PSNR	34.95	BSSTNet
10-shot image generation	DVD	SSIM	0.9703	BSSTNet
10-shot image generation	GoPro	PSNR	35.98	BSSTNet
10-shot image generation	GoPro	SSIM	0.9792	BSSTNet
Blind Image Deblurring	DVD	PSNR	34.95	BSSTNet
Blind Image Deblurring	DVD	SSIM	0.9703	BSSTNet
Blind Image Deblurring	GoPro	PSNR	35.98	BSSTNet
Blind Image Deblurring	GoPro	SSIM	0.9792	BSSTNet

Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring

Abstract

Results

Related Papers

Blur-aware Spatio-temporal Sparse Transformer for Video Deblurring

Abstract

Results

Related Papers