Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

Gemini 2.5 Flash 78 tok/s

Gemini 2.5 Pro 43 tok/s Pro

GPT-5 Medium 23 tok/s

GPT-5 High 29 tok/s Pro

GPT-4o 93 tok/s

GPT OSS 120B 470 tok/s Pro

Kimi K2 183 tok/s Pro

2000 character limit reached

Distilled-3DGS:Distilled 3D Gaussian Splatting (2508.14037v1)

Published 19 Aug 2025 in cs.CV

Abstract: 3D Gaussian Splatting (3DGS) has exhibited remarkable efficacy in novel view synthesis (NVS). However, it suffers from a significant drawback: achieving high-fidelity rendering typically necessitates a large number of 3D Gaussians, resulting in substantial memory consumption and storage requirements. To address this challenge, we propose the first knowledge distillation framework for 3DGS, featuring various teacher models, including vanilla 3DGS, noise-augmented variants, and dropout-regularized versions. The outputs of these teachers are aggregated to guide the optimization of a lightweight student model. To distill the hidden geometric structure, we propose a structural similarity loss to boost the consistency of spatial geometric distributions between the student and teacher model. Through comprehensive quantitative and qualitative evaluations across diverse datasets, the proposed Distilled-3DGS, a simple yet effective framework without bells and whistles, achieves promising rendering results in both rendering quality and storage efficiency compared to state-of-the-art methods. Project page: https://distilled3dgs.github.io . Code: https://github.com/lt-xiang/Distilled-3DGS .

Collections

Summary

The paper introduces a multi-teacher knowledge distillation framework to compress 3D Gaussian Splatting models while improving novel view synthesis quality.
It employs diverse teacher models and spatial distribution loss to achieve up to 0.62 dB PSNR improvement and an 86–89% reduction in Gaussian primitives.
The approach enables real-time, memory-efficient rendering, making it ideal for VR/AR and mobile applications with high fidelity.

Distilled-3DGS: Multi-Teacher Knowledge Distillation for Efficient 3D Gaussian Splatting

Introduction

Distilled-3DGS introduces a multi-teacher knowledge distillation framework for 3D Gaussian Splatting (3DGS), targeting the dual objectives of high-fidelity novel view synthesis and significant reduction in memory/storage requirements. The method leverages ensembles of teacher models—standard, noise-perturbed, and dropout-regularized 3DGS variants—to supervise a compact student model. The framework addresses the unique challenges posed by the explicit, unstructured nature of 3DGS representations, including unordered Gaussian primitives and scene-dependent distributions, by proposing geometry-aware distillation strategies and a spatial distribution consistency loss.

Multi-Teacher Distillation Framework

The core architecture of Distilled-3DGS consists of two stages: (1) training diverse teacher models and (2) distilling their knowledge into a lightweight student model.

Figure 1: The architecture of multi-teacher knowledge distillation framework for 3DGS, showing the two-stage process and the integration of spatial distribution distillation.

Teacher Model Diversity

Three teacher models are trained independently:

Standard 3DGS ( $G_{std}$ ): Optimized with photometric and structural similarity losses.
Perturbed 3DGS ( $G_{perb}$ ): Gaussian parameters are randomly perturbed during training, enhancing robustness to input variations.
Dropout 3DGS ( $G_{drop}$ ): Gaussian primitives are randomly deactivated during training, promoting distributed scene representation and generalization.

Each teacher model produces high-quality, dense point clouds, which are subsequently fused to generate pseudo-labels for student supervision.

Student Model Training

The student model is pruned aggressively using importance scores (e.g., Mini-Splatting criteria), resulting in a much sparser set of Gaussians. Supervision is provided via:

Conventional KD Loss: Combines ground-truth and fused teacher outputs.
Spatial Distribution Distillation: Enforces structural similarity between teacher and student point clouds using voxel histogram-based loss.
Figure 2: Overview of Spatial Distribution Distillation, illustrating voxelization and histogram matching for structure-aware supervision.

Spatial Distribution Distillation

Direct point-wise correspondence is infeasible due to the unordered and variable nature of Gaussian primitives. Instead, the method discretizes the 3D space into voxels and computes occupancy histograms for both teacher and student point clouds. The cosine similarity between normalized histograms serves as the structural loss, robustly capturing global and local geometric consistency regardless of point density or sampling noise.

Experimental Results

Distilled-3DGS is evaluated on Mip-NeRF360, Tanks and Temples, and Deep Blending datasets. The method consistently achieves superior rendering quality and storage efficiency compared to both NeRF-based and 3DGS-based baselines.

Figure 3: Visualized comparison on the Bicycle, Garden, and Kitchen scenes, demonstrating superior detail preservation by Distilled-3DGS.

Key quantitative results:

PSNR improvements: +0.55 dB (Mip-NeRF360), +0.62 dB (Tanks and Temples), +0.46 dB (Deep Blending) over vanilla 3DGS.
Gaussian reduction: 86–89% fewer Gaussians than vanilla 3DGS, with comparable or better fidelity.
SSIM and LPIPS: Consistently higher SSIM and lower LPIPS, indicating improved structural and perceptual quality.

Ablation studies confirm the complementary benefits of teacher diversity and the critical role of spatial distribution distillation. Removal of either the perturbation or dropout teacher, or the structural loss, leads to measurable drops in PSNR and perceptual quality.

Figure 4: Visual comparison with different teacher models, highlighting the degradation in rendering quality when teacher diversity is reduced.

Implementation Considerations

Training Overhead: Multi-teacher distillation requires pre-training several high-capacity teacher models, increasing computational and memory demands by a factor of $N$ (number of teachers).
Student Model Efficiency: The student model achieves real-time rendering and is suitable for deployment on resource-constrained devices due to its compactness.
Structural Loss Scalability: Voxel grid resolution impacts both memory usage and structural fidelity; higher resolutions yield better quality at increased computational cost.
Generalization: The framework is robust to scene complexity and point density variations, making it applicable to a wide range of real-world scenarios.

Theoretical and Practical Implications

Distilled-3DGS demonstrates that knowledge distillation can be effectively adapted to explicit, unstructured 3D representations, overcoming the limitations of conventional KD approaches reliant on latent feature spaces. The spatial distribution distillation strategy provides a generalizable mechanism for structure-aware supervision in point-based models. Practically, the method enables high-quality view synthesis with minimal storage, facilitating deployment in VR/AR, robotics, and mobile applications.

Future Directions

Potential avenues for further research include:

End-to-End Distillation: Integrating teacher and student training into a unified pipeline to reduce training overhead.
Adaptive Pruning: Dynamic adjustment of Gaussian counts during training for optimal trade-off between quality and efficiency.
Generalization to Other Representations: Extending spatial distribution distillation to other explicit 3D representations (e.g., meshes, voxels).

Conclusion

Distilled-3DGS establishes a robust framework for compressing 3DGS models via multi-teacher knowledge distillation and spatial distribution consistency. The approach achieves state-of-the-art rendering quality with drastically reduced memory requirements, validated across diverse datasets and scene types. The method's scalability and efficiency position it as a practical solution for real-time, memory-constrained 3D scene synthesis, with promising directions for further optimization and generalization.

PDF Markdown

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

Authors (4)

GitHub

Distilled-3DGS
GitHub - lt-xiang/Distilled-3DGS (6 stars)

Tweets

https://twitter.com/janusch_patas/status/1958058196612059205

https://twitter.com/_akhaliq/status/1960119900279644372

https://twitter.com/zhenjun_zhao/status/1958113247992885545

alphaXiv

Distilled-3DGS:Distilled 3D Gaussian Splatting (15 likes, 0 questions)