WavePaint: Resource-efficient Token-mixer for Self-supervised Inpainting

Pranav Jeevan, Dharshan Sampath Kumar, Amit Sethi

2023-07-01Image Inpainting

Abstract

Image inpainting, which refers to the synthesis of missing regions in an image, can help restore occluded or degraded areas and also serve as a precursor task for self-supervision. The current state-of-the-art models for image inpainting are computationally heavy as they are based on transformer or CNN backbones that are trained in adversarial or diffusion settings. This paper diverges from vision transformers by using a computationally-efficient WaveMix-based fully convolutional architecture -- WavePaint. It uses a 2D-discrete wavelet transform (DWT) for spatial and multi-resolution token-mixing along with convolutional layers. The proposed model outperforms the current state-of-the-art models for image inpainting on reconstruction quality while also using less than half the parameter count and considerably lower training and evaluation times. Our model even outperforms current GAN-based architectures in CelebA-HQ dataset without using an adversarially trainable discriminator. Our work suggests that neural architectures that are modeled after natural image priors require fewer parameters and computations to achieve generalization comparable to transformers.

Results

Task	Dataset	Metric	Value	Model
Image Generation	ImageNet	FID	3.21	WavePaint
Image Generation	CelebA-HQ	FID	5.53	WavePaint
Image Inpainting	ImageNet	FID	3.21	WavePaint
Image Inpainting	CelebA-HQ	FID	5.53	WavePaint

Related Papers

RePaintGS: Reference-Guided Gaussian Splatting for Realistic and View-Consistent 3D Scene Inpainting2025-07-11 MTADiffusion: Mask Text Alignment Diffusion Model for Object Inpainting2025-06-30 3DeepRep: 3D Deep Low-rank Tensor Representation for Hyperspectral Image Inpainting2025-06-20 Geological Field Restoration through the Lens of Image Inpainting2025-06-05 DreamDance: Animating Character Art via Inpainting Stable Gaussian Worlds2025-05-30 Structure Disruption: Subverting Malicious Diffusion-Based Inpainting via Self-Attention Query Perturbation2025-05-26 Unsupervised Raindrop Removal from a Single Image using Conditional Diffusion Models2025-05-13 CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting2025-05-06