Gaussian Splatting Decoder for 3D-aware Generative Adversarial Networks (2404.10625v2)
Abstract: NeRF-based 3D-aware Generative Adversarial Networks (GANs) like EG3D or GIRAFFE have shown very high rendering quality under large representational variety. However, rendering with Neural Radiance Fields poses challenges for 3D applications: First, the significant computational demands of NeRF rendering preclude its use on low-power devices, such as mobiles and VR/AR headsets. Second, implicit representations based on neural networks are difficult to incorporate into explicit 3D scenes, such as VR environments or video games. 3D Gaussian Splatting (3DGS) overcomes these limitations by providing an explicit 3D representation that can be rendered efficiently at high frame rates. In this work, we present a novel approach that combines the high rendering quality of NeRF-based 3D-aware GANs with the flexibility and computational advantages of 3DGS. By training a decoder that maps implicit NeRF representations to explicit 3D Gaussian Splatting attributes, we can integrate the representational diversity and quality of 3D GANs into the ecosystem of 3D Gaussian Splatting for the first time. Additionally, our approach allows for a high resolution GAN inversion and real-time GAN editing with 3D Gaussian Splatting scenes. Project page: florian-barthel.github.io/gaussian_decoder
- Styleflow: Attribute-conditioned exploration of stylegan-generated images using conditional continuous normalizing flows. ACM Trans. Graph., 40(3), 2021.
- Panohead: Geometry-aware 3d full-head synthesis in 360deg. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 20950–20959, 2023.
- Mip-nerf: A multiscale representation for anti-aliasing neural radiance fields, 2021a.
- Mip-nerf 360: Unbounded anti-aliased neural radiance fields. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5460–5469, 2021b.
- Multi-view inversion for 3d-aware generative adversarial networks. In Proceedings of the 19th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications. SCITEPRESS - Science and Technology Publications, 2024.
- Controlling 3d objects in 2d image synthesis. SN Computer Science, 4(1), 2022.
- pi-gan: Periodic implicit generative adversarial networks for 3d-aware image synthesis. In arXiv, 2020.
- Efficient geometry-aware 3D generative adversarial networks. In arXiv, 2021.
- Tensorf: Tensorial radiance fields. In Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XXXII, page 333–350, Berlin, Heidelberg, 2022. Springer-Verlag.
- Arcface: Additive angular margin loss for deep face recognition. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4685–4694, 2019.
- Fastnerf: High-fidelity neural rendering at 200fps. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 14326–14335, Los Alamitos, CA, USA, 2021. IEEE Computer Society.
- Ganspace: Discovering interpretable gan controls. In Proc. NeurIPS, 2020.
- Relu fields: The little non-linearity that could. In ACM SIGGRAPH 2022 Conference Proceedings, New York, NY, USA, 2022. Association for Computing Machinery.
- Progressive growing of GANs for improved quality, stability, and variation. In International Conference on Learning Representations, 2018.
- A Style-Based Generator Architecture for Generative Adversarial Networks, 2019. arXiv:1812.04948 [cs, stat].
- Analyzing and Improving the Image Quality of StyleGAN. In 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 8107–8116, Seattle, WA, USA, 2020. IEEE.
- Alias-free generative adversarial networks. In Proc. NeurIPS, 2021.
- 3d gaussian splatting for real-time radiance field rendering. ACM Trans. Graph., 42(4), 2023a.
- 3d gaussian splatting for real-time radiance field rendering. ACM Transactions on Graphics, 42(4), 2023b.
- Hugs: Human gaussian splats, 2023.
- Mf-nerf: Memory efficient nerf with mixed-feature hash table, 2023.
- Learning a model of facial shape and expression from 4D scans. ACM Transactions on Graphics, (Proc. SIGGRAPH Asia), 36(6):194:1–194:17, 2017.
- Neural sparse voxel fields. In Advances in Neural Information Processing Systems, pages 15651–15663. Curran Associates, Inc., 2020.
- Marching cubes: A high resolution 3d surface construction algorithm. In SIGGRAPH, pages 163–169. ACM, 1987.
- NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections. In CVPR, 2021.
- Nerf: Representing scenes as neural radiance fields for view synthesis. In ECCV, 2020.
- Nerf: representing scenes as neural radiance fields for view synthesis. Commun. ACM, 65(1):99–106, 2021.
- Instant3d: Instant text-to-3d generation. arxiv: 2311.08403, 2023.
- Conditional Generative Adversarial Nets, 2014. arXiv:1411.1784 [cs, stat].
- Compact 3d scene representation via self-organizing gaussian grids, 2023.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. Graph., 41(4), 2022.
- Extracting triangular 3d models, materials, and lighting from images. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 8280–8290, 2022.
- GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 11448–11459, Nashville, TN, USA, 2021. IEEE.
- Gaussianavatars: Photorealistic head avatars with rigged 3d gaussians. arXiv preprint arXiv:2312.02069, 2023.
- Pivotal tuning for latent-based editing of real images. ACM Trans. Graph., 2021.
- Plenoxels: Radiance fields without neural networks. In CVPR, 2022.
- Stylegan-xl: Scaling stylegan to large diverse datasets. In ACM SIGGRAPH 2022 Conference Proceedings, New York, NY, USA, 2022. Association for Computing Machinery.
- Lifting 2d stylegan for 3d-aware face generation. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6254–6262, 2020.
- Lifting 2D StyleGAN for 3D-Aware Face Generation. In 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 6254–6262, Nashville, TN, USA, 2021. IEEE.
- Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5449–5459, 2021.
- Lpff: A portrait dataset for face generators across large poses. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 20327–20337, 2023.
- Omniavatar: Geometry-guided controllable 3d head synthesis. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 12814–12824, 2023.
- Gaussian head avatar: Ultra high-fidelity head avatar via dynamic gaussians. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
- PlenOctrees for real-time rendering of neural radiance fields. In ICCV, 2021.
- Gaussian3diff: 3d gaussian diffusion for 3d full head synthesis and editing. arXiv, 2023.
- Digging into radiance grid for real-time view synthesis with detail preservation. In Computer Vision – ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XV, page 724–740, Berlin, Heidelberg, 2022. Springer-Verlag.
- The unreasonable effectiveness of deep features as a perceptual metric. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 586–595, 2018.
- Psavatar: A point-based morphable shape model for real-time head avatar animation with 3d gaussian splatting. https://synthical.com/article/c23e360e-3bbb-4410-a7d7-dc4202a76501, 2024.
- Triplane meets gaussian splatting: Fast and generalizable single-view 3d reconstruction with transformers. arXiv preprint arXiv:2312.09147, 2023.
- Florian Barthel (8 papers)
- Arian Beckmann (3 papers)
- Wieland Morgenstern (9 papers)
- Anna Hilsmann (43 papers)
- Peter Eisert (58 papers)