Lingbo Yang, Chang Liu, Pan Wang, Shanshe Wang, Peiran Ren, Siwei Ma, Wen Gao
Existing face restoration researches typically relies on either the degradation prior or explicit guidance labels for training, which often results in limited generalization ability over real-world images with heterogeneous degradations and rich background contents. In this paper, we investigate the more challenging and practical "dual-blind" version of the problem by lifting the requirements on both types of prior, termed as "Face Renovation"(FR). Specifically, we formulated FR as a semantic-guided generation problem and tackle it with a collaborative suppression and replenishment (CSR) approach. This leads to HiFaceGAN, a multi-stage framework containing several nested CSR units that progressively replenish facial details based on the hierarchical semantic guidance extracted from the front-end content-adaptive suppression modules. Extensive experiments on both synthetic and real face images have verified the superior performance of HiFaceGAN over a wide range of challenging restoration subtasks, demonstrating its versatility, robustness and generalization ability towards real-world face processing applications.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Super-Resolution | FFHQ 256 x 256 - 4x upscaling | FID | 5.36 | HiFaceGAN |
| Super-Resolution | FFHQ 256 x 256 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| Super-Resolution | FFHQ 256 x 256 - 4x upscaling | PSNR | 28.65 | HiFaceGAN |
| Super-Resolution | FFHQ 256 x 256 - 4x upscaling | SSIM | 0.816 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | FED | 0.0716 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | FID | 1.898 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | LLE | 2.071 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | LPIPS | 0.0723 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | NIQE | 6.961 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | PSNR | 30.824 | HiFaceGAN |
| Super-Resolution | FFHQ 512 x 512 - 4x upscaling | SSIM | 0.838 | HiFaceGAN |
| Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | FID | 1.978 | HiFaceGAN |
| Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | MS-SSIM | 0.975 | HiFaceGAN |
| Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | PSNR | 33.04 | HiFaceGAN |
| Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | SSIM | 0.875 | HiFaceGAN |
| Facial Recognition and Modelling | FFHQ 512 x 512 - 16x upscaling | FID | 11.389 | HiFaceGAN |
| Facial Recognition and Modelling | FFHQ 512 x 512 - 16x upscaling | LPIPS | 0.2449 | HiFaceGAN |
| Facial Recognition and Modelling | FFHQ 512 x 512 - 16x upscaling | NIQE | 6.767 | HiFaceGAN |
| Face Reconstruction | FFHQ 512 x 512 - 16x upscaling | FID | 11.389 | HiFaceGAN |
| Face Reconstruction | FFHQ 512 x 512 - 16x upscaling | LPIPS | 0.2449 | HiFaceGAN |
| Face Reconstruction | FFHQ 512 x 512 - 16x upscaling | NIQE | 6.767 | HiFaceGAN |
| 3D | FFHQ 512 x 512 - 16x upscaling | FID | 11.389 | HiFaceGAN |
| 3D | FFHQ 512 x 512 - 16x upscaling | LPIPS | 0.2449 | HiFaceGAN |
| 3D | FFHQ 512 x 512 - 16x upscaling | NIQE | 6.767 | HiFaceGAN |
| 3D Face Modelling | FFHQ 512 x 512 - 16x upscaling | FID | 11.389 | HiFaceGAN |
| 3D Face Modelling | FFHQ 512 x 512 - 16x upscaling | LPIPS | 0.2449 | HiFaceGAN |
| 3D Face Modelling | FFHQ 512 x 512 - 16x upscaling | NIQE | 6.767 | HiFaceGAN |
| Image Super-Resolution | FFHQ 256 x 256 - 4x upscaling | FID | 5.36 | HiFaceGAN |
| Image Super-Resolution | FFHQ 256 x 256 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| Image Super-Resolution | FFHQ 256 x 256 - 4x upscaling | PSNR | 28.65 | HiFaceGAN |
| Image Super-Resolution | FFHQ 256 x 256 - 4x upscaling | SSIM | 0.816 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | FED | 0.0716 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | FID | 1.898 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | LLE | 2.071 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | LPIPS | 0.0723 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | NIQE | 6.961 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | PSNR | 30.824 | HiFaceGAN |
| Image Super-Resolution | FFHQ 512 x 512 - 4x upscaling | SSIM | 0.838 | HiFaceGAN |
| Image Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | FID | 1.978 | HiFaceGAN |
| Image Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | MS-SSIM | 0.975 | HiFaceGAN |
| Image Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | PSNR | 33.04 | HiFaceGAN |
| Image Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | SSIM | 0.875 | HiFaceGAN |
| 3D Face Reconstruction | FFHQ 512 x 512 - 16x upscaling | FID | 11.389 | HiFaceGAN |
| 3D Face Reconstruction | FFHQ 512 x 512 - 16x upscaling | LPIPS | 0.2449 | HiFaceGAN |
| 3D Face Reconstruction | FFHQ 512 x 512 - 16x upscaling | NIQE | 6.767 | HiFaceGAN |
| Blind Face Restoration | CelebA-Test | Deg. | 42.18 | HiFaceGAN |
| Blind Face Restoration | CelebA-Test | FID | 66.09 | HiFaceGAN |
| Blind Face Restoration | CelebA-Test | LPIPS | 47.7 | HiFaceGAN |
| Blind Face Restoration | CelebA-Test | NIQE | 4.916 | HiFaceGAN |
| Blind Face Restoration | CelebA-Test | PSNR | 24.92 | HiFaceGAN |
| Blind Face Restoration | CelebA-Test | SSIM | 0.6195 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 256 x 256 - 4x upscaling | FID | 5.36 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 256 x 256 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 256 x 256 - 4x upscaling | PSNR | 28.65 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 256 x 256 - 4x upscaling | SSIM | 0.816 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | FED | 0.0716 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | FID | 1.898 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | LLE | 2.071 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | LPIPS | 0.0723 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | NIQE | 6.961 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | PSNR | 30.824 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 512 x 512 - 4x upscaling | SSIM | 0.838 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | FID | 1.978 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | MS-SSIM | 0.975 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | PSNR | 33.04 | HiFaceGAN |
| 3D Object Super-Resolution | FFHQ 1024 x 1024 - 4x upscaling | SSIM | 0.875 | HiFaceGAN |
| 16k | FFHQ 256 x 256 - 4x upscaling | FID | 5.36 | HiFaceGAN |
| 16k | FFHQ 256 x 256 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| 16k | FFHQ 256 x 256 - 4x upscaling | PSNR | 28.65 | HiFaceGAN |
| 16k | FFHQ 256 x 256 - 4x upscaling | SSIM | 0.816 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | FED | 0.0716 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | FID | 1.898 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | LLE | 2.071 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | LPIPS | 0.0723 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | MS-SSIM | 0.971 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | NIQE | 6.961 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | PSNR | 30.824 | HiFaceGAN |
| 16k | FFHQ 512 x 512 - 4x upscaling | SSIM | 0.838 | HiFaceGAN |
| 16k | FFHQ 1024 x 1024 - 4x upscaling | FID | 1.978 | HiFaceGAN |
| 16k | FFHQ 1024 x 1024 - 4x upscaling | MS-SSIM | 0.975 | HiFaceGAN |
| 16k | FFHQ 1024 x 1024 - 4x upscaling | PSNR | 33.04 | HiFaceGAN |
| 16k | FFHQ 1024 x 1024 - 4x upscaling | SSIM | 0.875 | HiFaceGAN |