Compact 3D Scene Representation via Self-Organizing Gaussian Grids (2312.13299v2)
Abstract: 3D Gaussian Splatting has recently emerged as a highly promising technique for modeling of static 3D scenes. In contrast to Neural Radiance Fields, it utilizes efficient rasterization allowing for very fast rendering at high-quality. However, the storage size is significantly higher, which hinders practical deployment, e.g. on resource constrained devices. In this paper, we introduce a compact scene representation organizing the parameters of 3D Gaussian Splatting (3DGS) into a 2D grid with local homogeneity, ensuring a drastic reduction in storage requirements without compromising visual quality during rendering. Central to our idea is the explicit exploitation of perceptual redundancies present in natural scenes. In essence, the inherent nature of a scene allows for numerous permutations of Gaussian parameters to equivalently represent it. To this end, we propose a novel highly parallel algorithm that regularly arranges the high-dimensional Gaussian parameters into a 2D grid while preserving their neighborhood structure. During training, we further enforce local smoothness between the sorted parameters in the grid. The uncompressed Gaussians use the same structure as 3DGS, ensuring a seamless integration with established renderers. Our method achieves a reduction factor of 17x to 42x in size for complex scenes with no increase in training time, marking a substantial leap forward in the domain of 3D scene distribution and consumption. Additional information can be found on our project page: https://fraunhoferhhi.github.io/Self-Organizing-Gaussians/
- Mip-NeRF 360: Unbounded anti-aliased neural radiance fields. Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5460–5469, 2022.
- Improved evaluation and generation of grid layouts using distance preservation quality and linear assignment sorting. Computer Graphics Forum, 42(1):261–276, 2023.
- Depth synthesis and local warps for plausible image-based navigation. ACM Trans. on Graphics, 32(3), 2013.
- TensoRF: Tensorial radiance fields. In Proc. European Conference on Computer Vision (ECCV), 2022.
- Structure from motion without correspondence. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 557–564, 2000.
- Compressing explicit voxel grid representations: Fast NeRFs become also small. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 1236–1245, 2023.
- Plenoxels: Radiance fields without neural networks. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5491–5500, 2022.
- FastNeRF: High-fidelity neural rendering at 200fps. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV), pages 14326–14335, 2021.
- Multi-view stereo for community photo collections. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV), pages 1–8, 2007.
- The lumigraph. In Proc. of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), page 43–54, 1996.
- An overview of ongoing point cloud compression standardization activities: video-based (v-pcc) and geometry-based (g-pcc). APSIPA Trans. on Signal and Information Processing, 9, 2020.
- Baking neural radiance fields for real-time view synthesis. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV), pages 5855–5864, 2021.
- ReLU Fields: The little non-linearity that could. ACM Trans. on Graphics, 2022.
- 3d gaussian splatting for real-time radiance field rendering. ACM Trans. on Graphics, 42(4), 2023.
- Teuvo Kohonen. Self-Organized Formation of Topologically Correct Feature Maps. Biological Cybernetics, 43:59–69, 1982.
- Teuvo Kohonen. Essentials of the self-organizing map. Neural Networks, 37:52–65, 2013.
- Point-based neural rendering with per-view optimization. Computer Graphics Forum (Proc. of the Eurographics Symposium on Rendering), 40(4), 2021.
- Mf-nerf: Memory efficient nerf with mixed-feature hash table. ArXiv, abs/2304.12587, 2023.
- Light field rendering. In Proc. of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH), page 31–42, 1996.
- Compressing volumetric radiance fields to 1 mb. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4222–4231, Los Alamitos, CA, USA, 2023. IEEE Computer Society.
- Neural sparse voxel fields. NeurIPS, 2020.
- Dynamic 3d gaussians: Tracking by persistent dynamic view synthesis. In Proc. International Conference on 3D Vision (3DV), 2024.
- NeRF in the Wild: Neural Radiance Fields for Unconstrained Photo Collections. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021.
- Nerf: Representing scenes as neural radiance fields for view synthesis. In Proc. European Conference on Computer Vision (ECCV), 2020.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Trans. on Graphics, 41(4):102:1–102:15, 2022.
- Survey on deep learning-based point cloud compression. Frontiers in Signal Processing, 2022.
- MERF: Memory-efficient radiance fields for real-time view synthesis in unbounded scenes. ACM Trans. on Graphics, 42(4), 2023.
- ADOP: Approximate differentiable one-pixel point rendering. ACM Trans. on Graphics, 41(4), 2022.
- Photorealistic scene reconstruction by voxel coloring. International Journal of Computer Vision, 35(2), 1999.
- Deepvoxels: Learning persistent 3d feature embeddings. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
- Photo tourism: Exploring photo collections in 3d. ACM Trans. on Graphics, 25(3):835–846, 2006.
- Self-sorting map: An efficient algorithm for presenting multimedia data in structured layouts. IEEE Trans. Multim., 16(4):1045–1058, 2014.
- Organizing visual data in structured layout by maximizing similarity-proximity correlation. In ISVC (2), pages 703–713. Springer, 2013.
- Direct voxel grid optimization: Super-fast convergence for radiance fields reconstruction. In Proc. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
- Variable bitrate neural fields. ACM Trans. on Graphics, 2022.
- Advances in Neural Rendering. Computer Graphics Forum (EG STAR 2022), 2022.
- Plenoctrees for real-time rendering of neural radiance fields. In Proc. IEEE/CVF International Conference on Computer Vision (ICCV), pages 5732–5741, 2021.
- Digging into radiance grid for real-time view synthesis with detail preservation. In Proc. European Conference on Computer Vision (ECCV), page 724–740, 2022.
- VQ-NeRF: Neural reflectance decomposition and editing with vector quantization, 2023.