Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis (2401.02436v2)

Published 17 Nov 2023 in cs.CV and cs.GR

Abstract: Recently, high-fidelity scene reconstruction with an optimized 3D Gaussian splat representation has been introduced for novel view synthesis from sparse image sets. Making such representations suitable for applications like network streaming and rendering on low-power devices requires significantly reduced memory consumption as well as improved rendering efficiency. We propose a compressed 3D Gaussian splat representation that utilizes sensitivity-aware vector clustering with quantization-aware training to compress directional colors and Gaussian parameters. The learned codebooks have low bitrates and achieve a compression rate of up to $31\times$ on real-world scenes with only minimal degradation of visual quality. We demonstrate that the compressed splat representation can be efficiently rendered with hardware rasterization on lightweight GPUs at up to $4\times$ higher framerates than reported via an optimized GPU compute pipeline. Extensive experiments across multiple datasets demonstrate the robustness and rendering speed of the proposed approach.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. Onesweep: A Faster Least Significant Digit Radix Sort for GPUs. arXiv preprint arXiv:2206.01784, 2022.
  2. Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5460–5469, New Orleans, LA, USA, 2022. IEEE.
  3. TensoRF: Tensorial Radiance Fields. In Computer Vision – ECCV 2022, pages 333–350, Cham, 2022. Springer Nature Switzerland.
  4. Compressing Explicit Voxel Grid Representations: fast NeRFs become also small. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 1236–1245, Waikoloa, HI, USA, 2023. IEEE.
  5. L. Peter Deutsch. DEFLATE Compressed Data Format Specification version 1.3, 1996. Issue: 1951 Num Pages: 17 Series: Request for Comments Published: RFC 1951.
  6. Plenoxels: Radiance Fields without Neural Networks. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5491–5500, New Orleans, LA, USA, 2022. IEEE.
  7. SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 108–118, Vancouver, BC, Canada, 2023. IEEE.
  8. On Quantizing Implicit Neural Representations. In 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), pages 341–350, Waikoloa, HI, USA, 2023. IEEE.
  9. Deep Blending for Free-viewpoint Image-based Rendering. ACM Transactions on Graphics (Proc. SIGGRAPH Asia), 37(6):257:1–257:15, 2018. Publisher: ACM.
  10. Baking neural radiance fields for real-time view synthesis. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 5875–5884, 2021.
  11. Quantization and training of neural networks for efficient integer-arithmetic-only inference. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 2704–2713, 2018.
  12. TernaryNeRF: Quantizing Voxel Grid-based NeRF Models. In 2022 IEEE International Workshop on Rapid System Prototyping (RSP), pages 8–14, Shanghai, China, 2022. IEEE.
  13. 3D Gaussian Splatting for Real-Time Radiance Field Rendering. ACM Trans. Graph., 42(4), 2023. Place: New York, NY, USA Publisher: Association for Computing Machinery.
  14. Tanks and temples: Benchmarking large-scale scene reconstruction. ACM Transactions on Graphics (ToG), 36(4):1–13, 2017. Publisher: ACM New York, NY, USA.
  15. Compressing Volumetric Radiance Fields to 1 MB. In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 4222–4231, Vancouver, BC, Canada, 2023. IEEE.
  16. Nerf: Representing scenes as neural radiance fields for view synthesis. Communications of the ACM, 65(1):99–106, 2021. Publisher: ACM New York, NY, USA.
  17. Instant neural graphics primitives with a multiresolution hash encoding. ACM Transactions on Graphics (ToG), 41(4):1–15, 2022. Publisher: ACM New York, NY, USA.
  18. XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks. In Computer Vision – ECCV 2016, pages 525–542, Cham, 2016. Springer International Publishing.
  19. Merf: Memory-efficient radiance fields for real-time view synthesis in unbounded scenes. ACM Transactions on Graphics (TOG), 42(4):1–12, 2023. Publisher: ACM New York, NY, USA.
  20. Masked Wavelet Representation for Compact Neural Radiance Fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 20680–20690, 2023.
  21. ADOP: approximate differentiable one-pixel point rendering. ACM Trans. Graph., 41(4):1–14, 2022.
  22. Handbook of data compression. Springer Science & Business Media, 2010.
  23. D. Sculley. Web-scale k-means clustering. In Proceedings of the 19th international conference on World wide web, pages 1177–1178, Raleigh North Carolina USA, 2010. ACM.
  24. Direct Voxel Grid Optimization: Super-fast Convergence for Radiance Fields Reconstruction. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 5449–5459, New Orleans, LA, USA, 2022. IEEE.
  25. Variable Bitrate Neural Fields. In ACM SIGGRAPH 2022 Conference Proceedings, New York, NY, USA, 2022. Association for Computing Machinery. event-place: Vancouver, BC, Canada.
  26. Compressible-composable NeRF via Rank-residual Decomposition. In Advances in Neural Information Processing Systems, pages 14798–14809. Curran Associates, Inc., 2022.
  27. SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference. In 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pages 2716–2724, New Orleans, LA, USA, 2022. IEEE.
  28. Differentiable Direct Volume Rendering. In IEEE Transactions on Visualization and Computer Graphics, pages 562–572, 2022. Issue: 1.
  29. Point-nerf: Point-based neural radiance fields. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 5438–5448, 2022.
  30. PlenOctrees for Real-time Rendering of Neural Radiance Fields. In 2021 IEEE/CVF International Conference on Computer Vision (ICCV), pages 5732–5741, Montreal, QC, Canada, 2021. IEEE.
  31. The Unreasonable Effectiveness of Deep Features as a Perceptual Metric. In 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 586–595, Salt Lake City, UT, 2018. IEEE.
  32. TinyNeRF: Towards 100 x Compression of Voxel Radiance Fields. Proceedings of the AAAI Conference on Artificial Intelligence, 37(3):3588–3596, 2023. Number: 3.
  33. A universal algorithm for sequential data compression. IEEE Trans. Inform. Theory, 23(3):337–343, 1977.
  34. EWA volume splatting. In Proceedings Visualization, 2001. VIS ’01., pages 29–538, San Diego, CA, USA, 2001. IEEE.
Citations (67)

Summary

  • The paper introduces a compressed 3D Gaussian splatting method that reduces data size by up to 31x while maintaining visual quality.
  • It employs sensitivity-aware clustering, quantization-aware fine-tuning, and entropy encoding to optimize memory and training efficiency.
  • Improved GPU rasterization boosts rendering speed by up to 4x, making the method ideal for real-time novel view synthesis.

Introduction

A novel approach to 3D scene reconstruction is presented in this paper, where optimized 3D Gaussian splatting plays a vital role. The main focus is on enhancing the practicality of view synthesis from sparse image data sets for various applications. This is achieved through a new compressed 3D Gaussian splat representation that promises significant reductions in memory requirements and substantial improvements in rendering performance, even on less powerful GPUs.

Scene Compression Methodology

The compression strategy is thorough, incorporating several innovative steps:

  1. Sensitivity-aware clustering: A pivotal first step where the color information and Gaussian parameters are inspected for their influence on training images. This allows for a precise and tailored compression of these elements into compact codebooks.
  2. Quantization-aware fine-tuning: Following the compression, the system undergoes fine-tuning. This stage recovers details that were lost during clustering, with an emphasis on maintaining a low bit-rate for the training phase.
  3. Entropy encoding: This final compression step leverages the spatial coherence of 3D Gaussian parameters. Algorithms such as entropy and run-length encoding are utilized to efficiently compact the data.

These steps collectively result in up to 31 times reduction of data size with nominal impact on visual quality.

Rendering Performance

Rendering is a critical aspect of novel view synthesis, particularly for real-time applications. The proposed method showcases impressive rendering capabilities. By employing GPU rasterization, rendering is accomplished with up to four times higher frame rates compared to previous techniques, without compromising the visual fidelity of the synthesized views.

Empirical Validation

Extensive testing confirms the robustness and speed of the proposed method. Benchmarks across multiple data sets, including various real-world scenes, demonstrate that the compression retains high image quality with dramatic reductions in required memory. A set of ablation studies further corroborates the contribution of each individual stage within the compression pipeline.

Conclusion and Future Work

The paper concludes by highlighting the substantial compression rates and rendering speed improvements made possible by the new method. The resulting solution is well-suited for applications that involve network streaming or operate on devices with limited processing capabilities. Looking ahead, the authors intend to explore additional compression techniques, especially for spatial positioning of Gaussians, to broaden the scope of applications for their 3D Gaussian splatting based reconstruction and rendering methods.

Youtube Logo Streamline Icon: https://streamlinehq.com