LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS (2311.17245v6)

Published 28 Nov 2023 in cs.CV

Abstract: Recent advances in real-time neural rendering using point-based techniques have enabled broader adoption of 3D representations. However, foundational approaches like 3D Gaussian Splatting impose substantial storage overhead, as Structure-from-Motion (SfM) points can grow to millions, often requiring gigabyte-level disk space for a single unbounded scene. This growth presents scalability challenges and hinders splatting efficiency. To address this, we introduce LightGaussian, a method for transforming 3D Gaussians into a more compact format. Inspired by Network Pruning, LightGaussian identifies Gaussians with minimal global significance on scene reconstruction, and applies a pruning and recovery process to reduce redundancy while preserving visual quality. Knowledge distillation and pseudo-view augmentation then transfer spherical harmonic coefficients to a lower degree, yielding compact representations. Gaussian Vector Quantization, based on each Gaussian's global significance, further lowers bitwidth with minimal accuracy loss. LightGaussian achieves an average 15x compression rate while boosting FPS from 144 to 237 within the 3D-GS framework, enabling efficient complex scene representation on the Mip-NeRF 360 and Tank & Temple datasets. The proposed Gaussian pruning approach is also adaptable to other 3D representations (e.g., Scaffold-GS), demonstrating strong generalization capabilities.

References (73)

Authors (6)

Zhiwen Fan (52 papers)
Kevin Wang (41 papers)
Kairun Wen (4 papers)
Zehao Zhu (9 papers)
Dejia Xu (37 papers)
Zhangyang Wang (375 papers)

Citations (93)

View on Semantic Scholar

Summary

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS

The paper introduces LightGaussian, a novel approach to efficiently compress 3D Gaussian representations employed in neural rendering. This method addresses the substantial storage and scalability issues inherent in foundational techniques like 3D Gaussian Splatting. LightGaussian emerges as a significant advancement in the field, providing a 15x reduction in storage and enhancing frames per second (FPS) from 139 to 215.

Key Contributions and Methodology

The primary innovation of LightGaussian is its application of network pruning concepts to 3D Gaussians. By identifying and eliminating Gaussians deemed insignificant for scene reconstruction, the method significantly reduces redundancy without compromising visual fidelity. This pruning uses a rigorous criterion based on global significance scores, which take into account Gaussian opacity and volume.

Subsequently, the method employs a novel approach to compress the Spherical Harmonics (SH) representation. Through a distillation process augmented by pseudo-view synthesis, it efficiently transfers knowledge from high- to lower-degree SHs, maintaining the visual effects crucial for scene appearance.

Additionally, a hybrid VecTree Quantization scheme is introduced. This technique quantizes attributes to lower bitwidths while preserving accuracy, further amplifying storage efficiency.

Numerical Results

The numerical results underscore the efficacy of LightGaussian. In the experimental evaluations using Mip-NeRF 360 and Tank {content} Temple datasets, the method achieves substantial improvements in storage efficiency and rendering speed. Specifically, the model size reduces from 727MB to 42MB, and rendering speed increases to over 200 FPS with only a minimal decrease in SSIM (0.013).

Implications and Future Directions

The implications of this research are both practical and theoretical. Practically, LightGaussian enables the efficient deployment of large-scale 3D scenes in applications like virtual and augmented reality, autonomous driving, and digital twins. Theoretically, it presents a compelling case for adopting pruning and knowledge distillation techniques in 3D neural representations.

Future work could explore extending these techniques to other forms of 3D representations or further optimizing the hybrid quantization strategies. Enhancements in compression technologies and learning-based compression algorithms could also be areas of fruitful investigation.

Conclusion

The development of LightGaussian offers a significant contribution to the field of neural rendering, providing a scalable and efficient solution to the challenges posed by traditional 3D Gaussian methods. Its balanced approach to compression without substantial loss of quality sets a new benchmark for future research endeavors in this area.

PDF Markdown

Related Papers

GitHub

Tweets

https://twitter.com/WayneINR/status/1864038169794261388

HackerNews

LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200 FPS (6 points, 0 comments)
LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200 FPS (3 points, 0 comments)