Sung-Jin Cho, Seo-won Ji, Jun-Pyo Hong, Seung-Won Jung, Sung-Jea Ko
Coarse-to-fine strategies have been extensively used for the architecture design of single image deblurring networks. Conventional methods typically stack sub-networks with multi-scale input images and gradually improve sharpness of images from the bottom sub-network to the top sub-network, yielding inevitably high computational costs. Toward a fast and accurate deblurring network design, we revisit the coarse-to-fine strategy and present a multi-input multi-output U-net (MIMO-UNet). The MIMO-UNet has three distinct features. First, the single encoder of the MIMO-UNet takes multi-scale input images to ease the difficulty of training. Second, the single decoder of the MIMO-UNet outputs multiple deblurred images with different scales to mimic multi-cascaded U-nets using a single U-shaped network. Last, asymmetric feature fusion is introduced to merge multi-scale features in an efficient manner. Extensive experiments on the GoPro and RealBlur datasets demonstrate that the proposed network outperforms the state-of-the-art methods in terms of both accuracy and computational complexity. Source code is available for research purposes at https://github.com/chosj95/MIMO-UNet.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Deblurring | RealBlur-J | PSNR (sRGB) | 32.05 | MIMO-UNet++ |
| Deblurring | RealBlur-J | Params(M) | 16.1 | MIMO-UNet++ |
| Deblurring | RealBlur-J | SSIM (sRGB) | 0.921 | MIMO-UNet++ |
| Deblurring | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| Deblurring | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| Deblurring | RSBlur | Average PSNR | 33.37 | MIMO-UNet+ |
| Deblurring | RSBlur | Average PSNR | 32.73 | MIMO-UNet |
| 2D Classification | RealBlur-J | PSNR (sRGB) | 32.05 | MIMO-UNet++ |
| 2D Classification | RealBlur-J | Params(M) | 16.1 | MIMO-UNet++ |
| 2D Classification | RealBlur-J | SSIM (sRGB) | 0.921 | MIMO-UNet++ |
| 2D Classification | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| 2D Classification | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| 2D Classification | RSBlur | Average PSNR | 33.37 | MIMO-UNet+ |
| 2D Classification | RSBlur | Average PSNR | 32.73 | MIMO-UNet |
| Image Deblurring | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| Image Deblurring | GoPro | Params (M) | 16.1 | MIMO-UNet++ |
| Image Deblurring | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| 10-shot image generation | RealBlur-J | PSNR (sRGB) | 32.05 | MIMO-UNet++ |
| 10-shot image generation | RealBlur-J | Params(M) | 16.1 | MIMO-UNet++ |
| 10-shot image generation | RealBlur-J | SSIM (sRGB) | 0.921 | MIMO-UNet++ |
| 10-shot image generation | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| 10-shot image generation | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| 10-shot image generation | RSBlur | Average PSNR | 33.37 | MIMO-UNet+ |
| 10-shot image generation | RSBlur | Average PSNR | 32.73 | MIMO-UNet |
| 10-shot image generation | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| 10-shot image generation | GoPro | Params (M) | 16.1 | MIMO-UNet++ |
| 10-shot image generation | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| 1 Image, 2*2 Stitchi | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| 1 Image, 2*2 Stitchi | GoPro | Params (M) | 16.1 | MIMO-UNet++ |
| 1 Image, 2*2 Stitchi | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| 16k | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| 16k | GoPro | Params (M) | 16.1 | MIMO-UNet++ |
| 16k | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| Blind Image Deblurring | RealBlur-J | PSNR (sRGB) | 32.05 | MIMO-UNet++ |
| Blind Image Deblurring | RealBlur-J | Params(M) | 16.1 | MIMO-UNet++ |
| Blind Image Deblurring | RealBlur-J | SSIM (sRGB) | 0.921 | MIMO-UNet++ |
| Blind Image Deblurring | GoPro | PSNR | 32.68 | MIMO-UNet++ |
| Blind Image Deblurring | GoPro | SSIM | 0.959 | MIMO-UNet++ |
| Blind Image Deblurring | RSBlur | Average PSNR | 33.37 | MIMO-UNet+ |
| Blind Image Deblurring | RSBlur | Average PSNR | 32.73 | MIMO-UNet |