Yuanhao Cai, Hao Bian, Jing Lin, Haoqian Wang, Radu Timofte, Yulun Zhang
When enhancing low-light images, many deep learning algorithms are based on the Retinex theory. However, the Retinex model does not consider the corruptions hidden in the dark or introduced by the light-up process. Besides, these methods usually require a tedious multi-stage training pipeline and rely on convolutional neural networks, showing limitations in capturing long-range dependencies. In this paper, we formulate a simple yet principled One-stage Retinex-based Framework (ORF). ORF first estimates the illumination information to light up the low-light image and then restores the corruption to produce the enhanced image. We design an Illumination-Guided Transformer (IGT) that utilizes illumination representations to direct the modeling of non-local interactions of regions with different lighting conditions. By plugging IGT into ORF, we obtain our algorithm, Retinexformer. Comprehensive quantitative and qualitative experiments demonstrate that our Retinexformer significantly outperforms state-of-the-art methods on thirteen benchmarks. The user study and application on low-light object detection also reveal the latent practical values of our method. Code, models, and results are available at https://github.com/caiyuanhao1998/Retinexformer
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image Enhancement | MIT-Adobe 5k | PSNR on proRGB | 25.98 | Retinexformer |
| Image Enhancement | MIT-Adobe 5k | PSNR on sRGB | 24.94 | Retinexformer |
| Image Enhancement | MIT-Adobe 5k | SSIM on proRGB | 0.957 | Retinexformer |
| Image Enhancement | MIT-Adobe 5k | SSIM on sRGB | 0.907 | Retinexformer |
| Image Enhancement | LOL | Average PSNR | 27.18 | Retinexformer_ |
| Image Enhancement | LOL | FLOPS (G) | 15.57 | Retinexformer_ |
| Image Enhancement | LOL | Params (M) | 1.61 | Retinexformer_ |
| Image Enhancement | LOL | SSIM | 0.85 | Retinexformer_ |
| Image Enhancement | LOL | Average PSNR | 25.16 | Retinexformer |
| Image Enhancement | LOL | FLOPS (G) | 15.57 | Retinexformer |
| Image Enhancement | LOL | Params (M) | 1.61 | Retinexformer |
| Image Enhancement | LOL | SSIM | 0.845 | Retinexformer |
| Image Enhancement | SDSD-indoor | PSNR | 29.77 | Retinexformer |
| Image Enhancement | LOL-v2 | Average PSNR | 22.8 | Retinexformer |
| Image Enhancement | LOL-v2 | SSIM | 0.84 | Retinexformer |
| Image Enhancement | LOLv2 | Average PSNR | 27.71 | Retinexformer |
| Image Enhancement | LOLv2 | SSIM | 0.856 | Retinexformer |
| Image Enhancement | DICM | User Study Score | 3.71 | Retinexformer |
| Image Enhancement | VV | User Study Score | 3.61 | Rextinexformer |
| Image Enhancement | NPE | User Study Score | 4.17 | Retinexformer |
| Image Enhancement | SMID | PSNR | 29.15 | Retinexformer |
| Image Enhancement | LOLv2-synthetic | Average PSNR | 29.04 | Retinexformer |
| Image Enhancement | LOLv2-synthetic | SSIM | 0.939 | Retinexformer |
| Image Enhancement | LIME | User Study Score | 4.3 | Rextinexformer |
| Image Enhancement | LOL-v2-synthetic | PSNR | 25.67 | Retinexformer |
| Image Enhancement | LOL-v2-synthetic | SSIM | 0.939 | Retinexformer |
| Image Enhancement | MEF | User Study Score | 3.91 | Retinexformer |
| Image Enhancement | SDSD-outdoor | PSNR | 29.84 | Retinexformer |
| Image Enhancement | MIT-Adobe FiveK | PSNR | 24.94 | Retinexformer |
| Image Enhancement | MIT-Adobe FiveK | SSIM | 0.907 | Retinexformer |
| Image Enhancement | SID | PSNR | 24.44 | Retinexformer |
| Image Enhancement | SID | SSIM | 0.68 | Retinexformer |
| Photo Retouching | MIT-Adobe 5k | PSNR | 24.94 | Retinexformer |
| Photo Retouching | MIT-Adobe 5k | SSIM | 0.907 | Retinexformer |
| Image Deblurring | LOL-Blur | Average PSNR | 22.904 | RetinexFormer |
| Image Deblurring | LOL-Blur | LPIPS | 0.236 | RetinexFormer |
| Image Deblurring | LOL-Blur | SSIM | 0.824 | RetinexFormer |
| 10-shot image generation | LOL-Blur | Average PSNR | 22.904 | RetinexFormer |
| 10-shot image generation | LOL-Blur | LPIPS | 0.236 | RetinexFormer |
| 10-shot image generation | LOL-Blur | SSIM | 0.824 | RetinexFormer |
| 1 Image, 2*2 Stitchi | LOL-Blur | Average PSNR | 22.904 | RetinexFormer |
| 1 Image, 2*2 Stitchi | LOL-Blur | LPIPS | 0.236 | RetinexFormer |
| 1 Image, 2*2 Stitchi | LOL-Blur | SSIM | 0.824 | RetinexFormer |
| 16k | LOL-Blur | Average PSNR | 22.904 | RetinexFormer |
| 16k | LOL-Blur | LPIPS | 0.236 | RetinexFormer |
| 16k | LOL-Blur | SSIM | 0.824 | RetinexFormer |