TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ProPainter: Improving Propagation and Transformer for Vide...

ProPainter: Improving Propagation and Transformer for Video Inpainting

Shangchen Zhou, Chongyi Li, Kelvin C. K. Chan, Chen Change Loy

2023-09-07ICCV 2023 1Optical Flow EstimationVideo Inpainting
PaperPDFCode(official)CodeCode

Abstract

Flow-based propagation and spatiotemporal Transformer are two mainstream mechanisms in video inpainting (VI). Despite the effectiveness of these components, they still suffer from some limitations that affect their performance. Previous propagation-based approaches are performed separately either in the image or feature domain. Global image propagation isolated from learning may cause spatial misalignment due to inaccurate optical flow. Moreover, memory or computational constraints limit the temporal range of feature propagation and video Transformer, preventing exploration of correspondence information from distant frames. To address these issues, we propose an improved framework, called ProPainter, which involves enhanced ProPagation and an efficient Transformer. Specifically, we introduce dual-domain propagation that combines the advantages of image and feature warping, exploiting global correspondences reliably. We also propose a mask-guided sparse video Transformer, which achieves high efficiency by discarding unnecessary and redundant tokens. With these components, ProPainter outperforms prior arts by a large margin of 1.46 dB in PSNR while maintaining appealing efficiency.

Results

TaskDatasetMetricValueModel
3DYouTube-VOS 2018PSNR34.43ProPainter
3DYouTube-VOS 2018SSIM0.9735ProPainter
3DYouTube-VOS 2018VFID0.042ProPainter
3DHQVI (240p)LPIPS0.0388ProPainter
3DHQVI (240p)PSNR30.62ProPainter
3DHQVI (240p)SSIM0.9413ProPainter
3DHQVI (240p)VFID0.2128ProPainter
3DHQVI (480p)LPIPS0.0457ProPainter
3DHQVI (480p)PSNR30.69ProPainter
3DHQVI (480p)SSIM0.9414ProPainter
3DHQVI (480p)VFID0.0478ProPainter
Video InpaintingYouTube-VOS 2018PSNR34.43ProPainter
Video InpaintingYouTube-VOS 2018SSIM0.9735ProPainter
Video InpaintingYouTube-VOS 2018VFID0.042ProPainter
Video InpaintingHQVI (240p)LPIPS0.0388ProPainter
Video InpaintingHQVI (240p)PSNR30.62ProPainter
Video InpaintingHQVI (240p)SSIM0.9413ProPainter
Video InpaintingHQVI (240p)VFID0.2128ProPainter
Video InpaintingHQVI (480p)LPIPS0.0457ProPainter
Video InpaintingHQVI (480p)PSNR30.69ProPainter
Video InpaintingHQVI (480p)SSIM0.9414ProPainter
Video InpaintingHQVI (480p)VFID0.0478ProPainter

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26WAFT: Warping-Alone Field Transforms for Optical Flow2025-06-26Video Virtual Try-on with Conditional Diffusion Transformer Inpainter2025-06-26