Eric R. Chan, Marco Monteiro, Petr Kellnhofer, Jiajun Wu, Gordon Wetzstein
We have witnessed rapid progress on 3D-aware image synthesis, leveraging recent advances in generative visual models and neural rendering. Existing approaches however fall short in two ways: first, they may lack an underlying 3D representation or rely on view-inconsistent rendering, hence synthesizing images that are not multi-view consistent; second, they often depend upon representation network architectures that are not expressive enough, and their results thus lack in image quality. We propose a novel generative model, named Periodic Implicit Generative Adversarial Networks ($\pi$-GAN or pi-GAN), for high-quality 3D-aware image synthesis. $\pi$-GAN leverages neural representations with periodic activation functions and volumetric rendering to represent scenes as view-consistent 3D representations with fine detail. The proposed approach obtains state-of-the-art results for 3D-aware image synthesis with multiple real and synthetic datasets.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Scene Generation | Replica | FID | 166.55 | pi-GAN |
| Scene Generation | Replica | SwAV-FID | 13.17 | pi-GAN |
| Scene Generation | AVD | FID | 98.76 | pi-GAN |
| Scene Generation | AVD | SwAV-FID | 9.54 | pi-GAN |
| Scene Generation | VizDoom | FID | 143.55 | pi-GAN |
| Scene Generation | VizDoom | SwAV-FID | 15.26 | pi-GAN |
| 16k | Replica | FID | 166.55 | pi-GAN |
| 16k | Replica | SwAV-FID | 13.17 | pi-GAN |
| 16k | AVD | FID | 98.76 | pi-GAN |
| 16k | AVD | SwAV-FID | 9.54 | pi-GAN |
| 16k | VizDoom | FID | 143.55 | pi-GAN |
| 16k | VizDoom | SwAV-FID | 15.26 | pi-GAN |