Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis (2401.02436v2)
Abstract: Recently, high-fidelity scene reconstruction with an optimized 3D Gaussian splat representation has been introduced for novel view synthesis from sparse image sets. Making such representations suitable for applications like network streaming and rendering on low-power devices requires significantly reduced memory consumption as well as improved rendering efficiency. We propose a compressed 3D Gaussian splat representation that utilizes sensitivity-aware vector clustering with quantization-aware training to compress directional colors and Gaussian parameters. The learned codebooks have low bitrates and achieve a compression rate of up to $31\times$ on real-world scenes with only minimal degradation of visual quality. We demonstrate that the compressed splat representation can be efficiently rendered with hardware rasterization on lightweight GPUs at up to $4\times$ higher framerates than reported via an optimized GPU compute pipeline. Extensive experiments across multiple datasets demonstrate the robustness and rendering speed of the proposed approach.
- Onesweep: A Faster Least Significant Digit Radix Sort for GPUs. arXiv preprint arXiv:2206.01784, 2022.
- Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5460–5469, New Orleans, LA, USA, 2022. IEEE.
- TensoRF: Tensorial Radiance Fields. In Computer Vision – ECCV 2022, pages 333–350, Cham, 2022. Springer Nature Switzerland.
- Compressing Explicit Voxel Grid Representations: fast NeRFs become also small. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 1236–1245, Waikoloa, HI, USA, 2023. IEEE.
- L. Peter Deutsch. DEFLATE Compressed Data Format Specification version 1.3, 1996. Issue: 1951 Num Pages: 17 Series: Request for Comments Published: RFC 1951.
- Plenoxels: Radiance Fields without Neural Networks. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5491–5500, New Orleans, LA, USA, 2022. IEEE.
- SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 108–118, Vancouver, BC, Canada, 2023. IEEE.
- On Quantizing Implicit Neural Representations. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 341–350, Waikoloa, HI, USA, 2023. IEEE.
- Deep Blending for Free-viewpoint Image-based Rendering. ACM Transactions on Graphics (Proc. SIGGRAPH Asia), 37(6):257:1–257:15, 2018. Publisher: ACM.
- Baking neural radiance fields for real-time view synthesis. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5875–5884, 2021.
- Quantization and training of neural networks for efficient integer-arithmetic-only inference. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2704–2713, 2018.
- TernaryNeRF: Quantizing Voxel Grid-based NeRF Models. In 2022 IEEE International Workshop on Rapid System Prototyping (RSP), pages 8–14, Shanghai, China, 2022. IEEE.
- 3D Gaussian Splatting for Real-Time Radiance Field Rendering. ACM Trans. Graph., 42(4), 2023. Place: New York, NY, USA Publisher: Association for Computing Machinery.
- Tanks and temples: Benchmarking large-scale scene reconstruction. ACM Transactions on Graphics (ToG), 36(4):1–13, 2017. Publisher: ACM New York, NY, USA.
- Compressing Volumetric Radiance Fields to 1 MB. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4222–4231, Vancouver, BC, Canada, 2023. IEEE.
- Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1):99–106, 2021. Publisher: ACM New York, NY, USA.
- Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics (ToG), 41(4):1–15, 2022. Publisher: ACM New York, NY, USA.
- XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks. In Computer Vision – ECCV 2016, pages 525–542, Cham, 2016. Springer International Publishing.
- Merf: Memory-efficient radiance fields for real-time view synthesis in unbounded scenes. ACM Transactions on Graphics (TOG), 42(4):1–12, 2023. Publisher: ACM New York, NY, USA.
- Masked Wavelet Representation for Compact Neural Radiance Fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20680–20690, 2023.
- ADOP: approximate differentiable one-pixel point rendering. ACM Trans. Graph., 41(4):1–14, 2022.
- Handbook of data compression. Springer Science & Business Media, 2010.
- D. Sculley. Web-scale k-means clustering. In Proceedings of the 19th international conference on World wide web, pages 1177–1178, Raleigh North Carolina USA, 2010. ACM.
- Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5449–5459, New Orleans, LA, USA, 2022. IEEE.
- Variable Bitrate Neural Fields. In ACM SIGGRAPH 2022 Conference Proceedings, New York, NY, USA, 2022. Association for Computing Machinery. event-place: Vancouver, BC, Canada.
- Compressible-composable NeRF via Rank-residual Decomposition. In Advances in Neural Information Processing Systems, pages 14798–14809. Curran Associates, Inc., 2022.
- SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 2716–2724, New Orleans, LA, USA, 2022. IEEE.
- Differentiable Direct Volume Rendering. In IEEE Transactions on Visualization and Computer Graphics, pages 562–572, 2022. Issue: 1.
- Point-nerf: Point-based neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5438–5448, 2022.
- PlenOctrees for Real-time Rendering of Neural Radiance Fields. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 5732–5741, Montreal, QC, Canada, 2021. IEEE.
- The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 586–595, Salt Lake City, UT, 2018. IEEE.
- TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3):3588–3596, 2023. Number: 3.
- A universal algorithm for sequential data compression. IEEE Trans. Inform. Theory, 23(3):337–343, 1977.
- EWA volume splatting. In Proceedings Visualization, 2001. VIS ’01., pages 29–538, San Diego, CA, USA, 2001. IEEE.