Benjamin Attal, Jia-Bin Huang, Christian Richardt, Michael Zollhoefer, Johannes Kopf, Matthew O'Toole, Changil Kim
Volumetric scene representations enable photorealistic view synthesis for static scenes and form the basis of several existing 6-DoF video techniques. However, the volume rendering procedures that drive these representations necessitate careful trade-offs in terms of quality, rendering speed, and memory efficiency. In particular, existing methods fail to simultaneously achieve real-time performance, small memory footprint, and high-quality rendering for challenging real-world scenes. To address these issues, we present HyperReel -- a novel 6-DoF video representation. The two core components of HyperReel are: (1) a ray-conditioned sample prediction network that enables high-fidelity, high frame rate rendering at high resolutions and (2) a compact and memory-efficient dynamic volume representation. Our 6-DoF video pipeline achieves the best performance compared to prior and contemporary approaches in terms of visual quality with small memory requirements, while also rendering at up to 18 frames-per-second at megapixel resolution without any custom CUDA code.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Novel View Synthesis | DONeRF: Evaluation Dataset | PSNR | 35.1 | HyperReel |
| Novel View Synthesis | DONeRF: Evaluation Dataset | PSNR | 33.1 | Instant NGP |
| Novel View Synthesis | DONeRF: Evaluation Dataset | PSNR | 30.9 | NeRF |
| Novel View Synthesis | DONeRF: Evaluation Dataset | PSNR | 30.9 | AdaNeRF |
| Novel View Synthesis | DONeRF: Evaluation Dataset | PSNR | 30.8 | DoNeRF |
| Novel View Synthesis | DONeRF: Evaluation Dataset | PSNR | 29.8 | TermiNeRF |
| Novel View Synthesis | LLFF | PSNR | 26.2 | HyperReel |
| Novel View Synthesis | LLFF | PSNR | 25.7 | AdaNeRF |
| Novel View Synthesis | LLFF | PSNR | 25.6 | Instant NGP |
| Novel View Synthesis | LLFF | PSNR | 23.6 | TermiNeRF |
| Novel View Synthesis | LLFF | PSNR | 22.9 | DoNeRF |