ProPainter: Improving Propagation and Transformer for Video Inpainting

Shangchen Zhou, Chongyi Li, Kelvin C. K. Chan, Chen Change Loy

2023-09-07ICCV 2023 1Optical Flow Estimation Video Inpainting

Abstract

Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms in video inpainting (VI). Despite the effectiveness of these components, they still suffer from some limitations that affect their performance. Previous propagation-based approaches are performed separately either in the image or feature domain. Global image propagation isolated from learning may cause spatial misalignment due to inaccurate optical flow. Moreover, memory or computational constraints limit the temporal range of feature propagation and video Transformer, preventing exploration of correspondence information from distant frames. To address these issues, we propose an improved framework, called ProPainter, which involves enhanced ProPagation and an efficient Transformer. Specifically, we introduce dual-domain propagation that combines the advantages of image and feature warping, exploiting global correspondences reliably. We also propose a mask-guided sparse video Transformer, which achieves high efficiency by discarding unnecessary and redundant tokens. With these components, ProPainter outperforms prior arts by a large margin of 1.46 dB in PSNR while maintaining appealing efficiency.

Results

Task	Dataset	Metric	Value	Model
3D	YouTube-VOS 2018	PSNR	34.43	ProPainter
3D	YouTube-VOS 2018	SSIM	0.9735	ProPainter
3D	YouTube-VOS 2018	VFID	0.042	ProPainter
3D	HQVI (240p)	LPIPS	0.0388	ProPainter
3D	HQVI (240p)	PSNR	30.62	ProPainter
3D	HQVI (240p)	SSIM	0.9413	ProPainter
3D	HQVI (240p)	VFID	0.2128	ProPainter
3D	HQVI (480p)	LPIPS	0.0457	ProPainter
3D	HQVI (480p)	PSNR	30.69	ProPainter
3D	HQVI (480p)	SSIM	0.9414	ProPainter
3D	HQVI (480p)	VFID	0.0478	ProPainter
Video Inpainting	YouTube-VOS 2018	PSNR	34.43	ProPainter
Video Inpainting	YouTube-VOS 2018	SSIM	0.9735	ProPainter
Video Inpainting	YouTube-VOS 2018	VFID	0.042	ProPainter
Video Inpainting	HQVI (240p)	LPIPS	0.0388	ProPainter
Video Inpainting	HQVI (240p)	PSNR	30.62	ProPainter
Video Inpainting	HQVI (240p)	SSIM	0.9413	ProPainter
Video Inpainting	HQVI (240p)	VFID	0.2128	ProPainter
Video Inpainting	HQVI (480p)	LPIPS	0.0457	ProPainter
Video Inpainting	HQVI (480p)	PSNR	30.69	ProPainter
Video Inpainting	HQVI (480p)	SSIM	0.9414	ProPainter
Video Inpainting	HQVI (480p)	VFID	0.0478	ProPainter

ProPainter: Improving Propagation and Transformer for Video Inpainting

Abstract

Results

Related Papers

ProPainter: Improving Propagation and Transformer for Video Inpainting

Abstract

Results

Related Papers