Pengju Liu, Hongzhi Zhang, Kai Zhang, Liang Lin, WangMeng Zuo
The tradeoff between receptive field size and efficiency is a crucial issue in low level vision. Plain convolutional networks (CNNs) generally enlarge the receptive field at the expense of computational cost. Recently, dilated filtering has been adopted to address this issue. But it suffers from gridding effect, and the resulting receptive field is only a sparse sampling of input image with checkerboard patterns. In this paper, we present a novel multi-level wavelet CNN (MWCNN) model for better tradeoff between receptive field size and computational efficiency. With the modified U-Net architecture, wavelet transform is introduced to reduce the size of feature maps in the contracting subnetwork. Furthermore, another convolutional layer is further used to decrease the channels of feature maps. In the expanding subnetwork, inverse wavelet transform is then deployed to reconstruct the high resolution feature maps. Our MWCNN can also be explained as the generalization of dilated filtering and subsampling, and can be applied to many image restoration tasks. The experimental results clearly show the effectiveness of MWCNN for image denoising, single image super-resolution, and JPEG image artifacts removal.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Super-Resolution | BSD100 - 2x upscaling | PSNR | 32.23 | MWCNN |
| Super-Resolution | Set14 - 3x upscaling | PSNR | 30.16 | MWCNN |
| Super-Resolution | Set14 - 2x upscaling | PSNR | 33.7 | MWCNN |
| Super-Resolution | Set14 - 4x upscaling | PSNR | 28.41 | MWCNN |
| Super-Resolution | Set14 - 4x upscaling | SSIM | 0.7816 | MWCNN |
| Super-Resolution | Set5 - 3x upscaling | PSNR | 34.17 | MWCNN |
| Super-Resolution | Urban100 - 2x upscaling | PSNR | 32.3 | MWCNN |
| Super-Resolution | Set5 - 2x upscaling | PSNR | 37.91 | MWCNN |
| Super-Resolution | Urban100 - 4x upscaling | PSNR | 26.27 | MWCNN |
| Super-Resolution | Urban100 - 4x upscaling | SSIM | 0.789 | MWCNN |
| Super-Resolution | Urban100 - 3x upscaling | PSNR | 28.13 | MWCNN |
| Super-Resolution | BSD100 - 4x upscaling | PSNR | 27.62 | MWCNN |
| Super-Resolution | BSD100 - 4x upscaling | SSIM | 0.7355 | MWCNN |
| Super-Resolution | BSD100 - 3x upscaling | PSNR | 29.12 | MWCNN |
| Image Restoration | Live1 (Quality 10 Grayscale) | PSNR | 29.69 | MWCNN |
| Image Restoration | Live1 (Quality 10 Grayscale) | PSNR-B | 29.39 | MWCNN |
| Image Restoration | Live1 (Quality 10 Grayscale) | SSIM | 0.8357 | MWCNN |
| Image Restoration | Classic5 (Quality 30 Grayscale) | PSNR | 33.43 | MWCNN |
| Image Restoration | LIVE1 (Quality 40 Grayscale) | PSNR | 34.45 | MWCNN |
| Image Restoration | ICB (Quality 20 Color) | PSNR | 32.79 | MWCNN |
| Image Restoration | ICB (Quality 20 Color) | PSNR-B | 33.32 | MWCNN |
| Image Restoration | ICB (Quality 20 Color) | SSIM | 0.812 | MWCNN |
| Image Restoration | LIVE1 (Quality 10 Color) | PSNR | 27.45 | MWCNN |
| Image Restoration | LIVE1 (Quality 10 Color) | PSNR-B | 27.44 | MWCNN |
| Image Restoration | LIVE1 (Quality 10 Color) | SSIM | 0.808 | MWCNN |
| Image Restoration | Classic5 (Quality 40 Grayscale) | PSNR | 34.27 | MWCNN |
| Image Restoration | Classic5 (Quality 20 Grayscale) | PSNR | 32.16 | MWCNN |
| Image Restoration | ICB (Quality 30 Color) | PSNR | 34.11 | MWCNN |
| Image Restoration | ICB (Quality 30 Color) | PSNR-B | 34.69 | MWCNN |
| Image Restoration | ICB (Quality 30 Color) | SSIM | 0.845 | MWCNN |
| Image Restoration | LIVE1 (Quality 30 Grayscale) | PSNR | 33.45 | MWCNN |
| Image Restoration | LIVE1 (Quality 20 Color) | PSNR | 29.8 | MWCNN |
| Image Restoration | LIVE1 (Quality 20 Color) | PSNR-B | 29.78 | MWCNN |
| Image Restoration | LIVE1 (Quality 20 Color) | SSIM | 0.877 | MWCNN |
| Image Restoration | ICB (Quality 20 Grayscale) | PSNR | 36.56 | MWCNN |
| Image Restoration | ICB (Quality 20 Grayscale) | PSNR-B | 36.44 | MWCNN |
| Image Restoration | ICB (Quality 20 Grayscale) | SSIM | 0.902 | MWCNN |
| Image Restoration | ICB (Quality 10 Grayscale) | PSNR | 34.12 | MWCNN |
| Image Restoration | ICB (Quality 10 Grayscale) | PSNR-B | 34.06 | MWCNN |
| Image Restoration | ICB (Quality 10 Grayscale) | SSIM | 0.884 | MWCNN |
| Image Restoration | ICB (Quality 10 Color) | PSNR | 30.76 | MWCNN |
| Image Restoration | ICB (Quality 10 Color) | PSNR-B | 31.21 | MWCNN |
| Image Restoration | ICB (Quality 10 Color) | SSIM | 0.779 | MWCNN |
| Image Restoration | Classic5 (Quality 10 Grayscale) | PSNR | 30.01 | MWCNN |
| Image Restoration | LIVE1 (Quality 20 Grayscale) | PSNR | 32.04 | MWCNN |
| Image Restoration | LIVE1 (Quality 20 Grayscale) | PSNR-B | 31.83 | MWCNN |
| Image Restoration | LIVE1 (Quality 20 Grayscale) | SSIM | 0.8989 | MWCNN |
| Denoising | Urban100 sigma25 | PSNR | 30.66 | MWCNN |
| Denoising | Urban100 sigma15 | PSNR | 33.17 | MWCNN |
| Denoising | Set12 sigma50 | PSNR | 27.74 | MWCNN |
| Denoising | Urban100 sigma50 | PSNR | 27.42 | MWCNN |
| Denoising | Set12 sigma15 | PSNR | 33.15 | MWCNN |
| Denoising | BSD68 sigma15 | PSNR | 31.86 | MWCNN |
| Denoising | BSD68 sigma25 | PSNR | 29.41 | MWCNN |
| Denoising | Set12 sigma25 | PSNR | 30.79 | MWCNN |
| Denoising | BSD68 sigma50 | PSNR | 26.53 | MWCNN |
| Image Super-Resolution | BSD100 - 2x upscaling | PSNR | 32.23 | MWCNN |
| Image Super-Resolution | Set14 - 3x upscaling | PSNR | 30.16 | MWCNN |
| Image Super-Resolution | Set14 - 2x upscaling | PSNR | 33.7 | MWCNN |
| Image Super-Resolution | Set14 - 4x upscaling | PSNR | 28.41 | MWCNN |
| Image Super-Resolution | Set14 - 4x upscaling | SSIM | 0.7816 | MWCNN |
| Image Super-Resolution | Set5 - 3x upscaling | PSNR | 34.17 | MWCNN |
| Image Super-Resolution | Urban100 - 2x upscaling | PSNR | 32.3 | MWCNN |
| Image Super-Resolution | Set5 - 2x upscaling | PSNR | 37.91 | MWCNN |
| Image Super-Resolution | Urban100 - 4x upscaling | PSNR | 26.27 | MWCNN |
| Image Super-Resolution | Urban100 - 4x upscaling | SSIM | 0.789 | MWCNN |
| Image Super-Resolution | Urban100 - 3x upscaling | PSNR | 28.13 | MWCNN |
| Image Super-Resolution | BSD100 - 4x upscaling | PSNR | 27.62 | MWCNN |
| Image Super-Resolution | BSD100 - 4x upscaling | SSIM | 0.7355 | MWCNN |
| Image Super-Resolution | BSD100 - 3x upscaling | PSNR | 29.12 | MWCNN |
| 3D Architecture | Urban100 sigma25 | PSNR | 30.66 | MWCNN |
| 3D Architecture | Urban100 sigma15 | PSNR | 33.17 | MWCNN |
| 3D Architecture | Set12 sigma50 | PSNR | 27.74 | MWCNN |
| 3D Architecture | Urban100 sigma50 | PSNR | 27.42 | MWCNN |
| 3D Architecture | Set12 sigma15 | PSNR | 33.15 | MWCNN |
| 3D Architecture | BSD68 sigma15 | PSNR | 31.86 | MWCNN |
| 3D Architecture | BSD68 sigma25 | PSNR | 29.41 | MWCNN |
| 3D Architecture | Set12 sigma25 | PSNR | 30.79 | MWCNN |
| 3D Architecture | BSD68 sigma50 | PSNR | 26.53 | MWCNN |
| 10-shot image generation | Live1 (Quality 10 Grayscale) | PSNR | 29.69 | MWCNN |
| 10-shot image generation | Live1 (Quality 10 Grayscale) | PSNR-B | 29.39 | MWCNN |
| 10-shot image generation | Live1 (Quality 10 Grayscale) | SSIM | 0.8357 | MWCNN |
| 10-shot image generation | Classic5 (Quality 30 Grayscale) | PSNR | 33.43 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 40 Grayscale) | PSNR | 34.45 | MWCNN |
| 10-shot image generation | ICB (Quality 20 Color) | PSNR | 32.79 | MWCNN |
| 10-shot image generation | ICB (Quality 20 Color) | PSNR-B | 33.32 | MWCNN |
| 10-shot image generation | ICB (Quality 20 Color) | SSIM | 0.812 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 10 Color) | PSNR | 27.45 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 10 Color) | PSNR-B | 27.44 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 10 Color) | SSIM | 0.808 | MWCNN |
| 10-shot image generation | Classic5 (Quality 40 Grayscale) | PSNR | 34.27 | MWCNN |
| 10-shot image generation | Classic5 (Quality 20 Grayscale) | PSNR | 32.16 | MWCNN |
| 10-shot image generation | ICB (Quality 30 Color) | PSNR | 34.11 | MWCNN |
| 10-shot image generation | ICB (Quality 30 Color) | PSNR-B | 34.69 | MWCNN |
| 10-shot image generation | ICB (Quality 30 Color) | SSIM | 0.845 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 30 Grayscale) | PSNR | 33.45 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 20 Color) | PSNR | 29.8 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 20 Color) | PSNR-B | 29.78 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 20 Color) | SSIM | 0.877 | MWCNN |
| 10-shot image generation | ICB (Quality 20 Grayscale) | PSNR | 36.56 | MWCNN |
| 10-shot image generation | ICB (Quality 20 Grayscale) | PSNR-B | 36.44 | MWCNN |
| 10-shot image generation | ICB (Quality 20 Grayscale) | SSIM | 0.902 | MWCNN |
| 10-shot image generation | ICB (Quality 10 Grayscale) | PSNR | 34.12 | MWCNN |
| 10-shot image generation | ICB (Quality 10 Grayscale) | PSNR-B | 34.06 | MWCNN |
| 10-shot image generation | ICB (Quality 10 Grayscale) | SSIM | 0.884 | MWCNN |
| 10-shot image generation | ICB (Quality 10 Color) | PSNR | 30.76 | MWCNN |
| 10-shot image generation | ICB (Quality 10 Color) | PSNR-B | 31.21 | MWCNN |
| 10-shot image generation | ICB (Quality 10 Color) | SSIM | 0.779 | MWCNN |
| 10-shot image generation | Classic5 (Quality 10 Grayscale) | PSNR | 30.01 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 20 Grayscale) | PSNR | 32.04 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 20 Grayscale) | PSNR-B | 31.83 | MWCNN |
| 10-shot image generation | LIVE1 (Quality 20 Grayscale) | SSIM | 0.8989 | MWCNN |
| 3D Object Super-Resolution | BSD100 - 2x upscaling | PSNR | 32.23 | MWCNN |
| 3D Object Super-Resolution | Set14 - 3x upscaling | PSNR | 30.16 | MWCNN |
| 3D Object Super-Resolution | Set14 - 2x upscaling | PSNR | 33.7 | MWCNN |
| 3D Object Super-Resolution | Set14 - 4x upscaling | PSNR | 28.41 | MWCNN |
| 3D Object Super-Resolution | Set14 - 4x upscaling | SSIM | 0.7816 | MWCNN |
| 3D Object Super-Resolution | Set5 - 3x upscaling | PSNR | 34.17 | MWCNN |
| 3D Object Super-Resolution | Urban100 - 2x upscaling | PSNR | 32.3 | MWCNN |
| 3D Object Super-Resolution | Set5 - 2x upscaling | PSNR | 37.91 | MWCNN |
| 3D Object Super-Resolution | Urban100 - 4x upscaling | PSNR | 26.27 | MWCNN |
| 3D Object Super-Resolution | Urban100 - 4x upscaling | SSIM | 0.789 | MWCNN |
| 3D Object Super-Resolution | Urban100 - 3x upscaling | PSNR | 28.13 | MWCNN |
| 3D Object Super-Resolution | BSD100 - 4x upscaling | PSNR | 27.62 | MWCNN |
| 3D Object Super-Resolution | BSD100 - 4x upscaling | SSIM | 0.7355 | MWCNN |
| 3D Object Super-Resolution | BSD100 - 3x upscaling | PSNR | 29.12 | MWCNN |
| 16k | BSD100 - 2x upscaling | PSNR | 32.23 | MWCNN |
| 16k | Set14 - 3x upscaling | PSNR | 30.16 | MWCNN |
| 16k | Set14 - 2x upscaling | PSNR | 33.7 | MWCNN |
| 16k | Set14 - 4x upscaling | PSNR | 28.41 | MWCNN |
| 16k | Set14 - 4x upscaling | SSIM | 0.7816 | MWCNN |
| 16k | Set5 - 3x upscaling | PSNR | 34.17 | MWCNN |
| 16k | Urban100 - 2x upscaling | PSNR | 32.3 | MWCNN |
| 16k | Set5 - 2x upscaling | PSNR | 37.91 | MWCNN |
| 16k | Urban100 - 4x upscaling | PSNR | 26.27 | MWCNN |
| 16k | Urban100 - 4x upscaling | SSIM | 0.789 | MWCNN |
| 16k | Urban100 - 3x upscaling | PSNR | 28.13 | MWCNN |
| 16k | BSD100 - 4x upscaling | PSNR | 27.62 | MWCNN |
| 16k | BSD100 - 4x upscaling | SSIM | 0.7355 | MWCNN |
| 16k | BSD100 - 3x upscaling | PSNR | 29.12 | MWCNN |