Pranav Jeevan, Neeraj Nixon, Amit Sethi
Recent advancements in single image super-resolution have been predominantly driven by token mixers and transformer architectures. WaveMixSR utilized the WaveMix architecture, employing a two-dimensional discrete wavelet transform for spatial token mixing, achieving superior performance in super-resolution tasks with remarkable resource efficiency. In this work, we present an enhanced version of the WaveMixSR architecture by (1) replacing the traditional transpose convolution layer with a pixel shuffle operation and (2) implementing a multistage design for higher resolution tasks ($4\times$). Our experiments demonstrate that our enhanced model -- WaveMixSR-V2 -- outperforms other architectures in multiple super-resolution tasks, achieving state-of-the-art for the BSD100 dataset, while also consuming fewer resources, exhibits higher parameter efficiency, lower latency and higher throughput. Our code is available at https://github.com/pranavphoenix/WaveMixSR.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Super-Resolution | BSD100 - 2x upscaling | PSNR | 33.12 | WaveMixSR-V2 |
| Super-Resolution | BSD100 - 2x upscaling | SSIM | 0.9326 | WaveMixSR-V2 |
| Super-Resolution | BSD100 - 4x upscaling | PSNR | 27.87 | WaveMixSR-V2 |
| Super-Resolution | BSD100 - 4x upscaling | SSIM | 0.764 | WaveMixSR-V2 |
| Image Super-Resolution | BSD100 - 2x upscaling | PSNR | 33.12 | WaveMixSR-V2 |
| Image Super-Resolution | BSD100 - 2x upscaling | SSIM | 0.9326 | WaveMixSR-V2 |
| Image Super-Resolution | BSD100 - 4x upscaling | PSNR | 27.87 | WaveMixSR-V2 |
| Image Super-Resolution | BSD100 - 4x upscaling | SSIM | 0.764 | WaveMixSR-V2 |
| 3D Object Super-Resolution | BSD100 - 2x upscaling | PSNR | 33.12 | WaveMixSR-V2 |
| 3D Object Super-Resolution | BSD100 - 2x upscaling | SSIM | 0.9326 | WaveMixSR-V2 |
| 3D Object Super-Resolution | BSD100 - 4x upscaling | PSNR | 27.87 | WaveMixSR-V2 |
| 3D Object Super-Resolution | BSD100 - 4x upscaling | SSIM | 0.764 | WaveMixSR-V2 |
| 16k | BSD100 - 2x upscaling | PSNR | 33.12 | WaveMixSR-V2 |
| 16k | BSD100 - 2x upscaling | SSIM | 0.9326 | WaveMixSR-V2 |
| 16k | BSD100 - 4x upscaling | PSNR | 27.87 | WaveMixSR-V2 |
| 16k | BSD100 - 4x upscaling | SSIM | 0.764 | WaveMixSR-V2 |