FastGS: Accelerated 3D Gaussian Splatting

Updated 9 November 2025

FastGS is an acceleration framework for 3D Gaussian Splatting that adaptively adjusts Gaussian counts based on multi-view consistency to significantly reduce training time.
It integrates multi-view consistent densification and pruning modules, optimizing Gaussian placements during training to maintain or improve PSNR and SSIM.
The framework achieves 2–15× speedup across static, dynamic, and SLAM applications, demonstrating broad applicability and high-quality rendering.

FastGS is an acceleration framework for 3D Gaussian Splatting (3DGS) that adaptively regulates the number and placement of Gaussians during training based on multi-view consistency metrics, thereby achieving a substantial reduction in training time without compromising final rendering quality. Unlike prior approaches that rely on fixed budgeting or simplistic heuristics for densification and pruning, FastGS introduces a dynamic method that utilizes multi-view photometric error for both adding and removing Gaussian primitives. This framework achieves up to 15.45× acceleration over baseline 3DGS, delivers equivalent or improved PSNR and SSIM scores, and is broadly applicable across static reconstruction, dynamic scenes, surface modeling, large-scale geometry, and SLAM.

1. Methodological Foundations

FastGS operates within the standard 3DGS workflow but introduces two principal modules: multi-view consistent densification (VCD) and multi-view consistent pruning (VCP). The process can be summarized as follows:

Initialization: Multi-view images and an SfM point cloud, e.g., from COLMAP, are used to instantiate an initial set of anisotropic 3D Gaussians. Each primitive $\mathcal{G}_i$ is parameterized by mean $\mu_i\in\mathbb{R}^3$ , rotation $r_i$ (quaternion), scale $s_i\in\mathbb{R}^3$ (for covariance), opacity $\sigma_i\in[0,1)$ , and SH color coefficients $c_i\in\mathbb{R}^{16\times 3}$ .
Iterative Training: For $T=30{,}000$ $T = 30, 000$ steps:
- The 3DGS rasterizer produces rendered images $R$ , which are compared to ground truth $G$ to compute the loss $L=(1-\lambda)\cdot L_1(R,G) + \lambda (1-\mathrm{SSIM}(R,G))$ .
- Densification (VCD) is executed every $D_+=500$ steps (until $t\leq 15,000$ ), while pruning (VCP) is performed on a complementary or continuous schedule.

Multi-View Consistency Densification (VCD)

VCD identifies 3D Gaussians that persistently correspond to high-error regions across multiple training views:

For view $v^j$ at a pixel $(u,v)$ :

$e_{u,v}^j = \frac{1}{C}\sum_{c=1}^C |r_{u,v}^{j,c} - g_{u,v}^{j,c}|$

(where $C$ is the color channel count).

Errors are normalized to $[0,1]$ , and a binary mask $M^j_\text{mask}$ is computed by thresholding normalized error $\hat e_{u,v}^j$ .
Each Gaussian is projected into all sampled training views; its densification score $s_i^+$ is the mean count of mask-activated pixels within its projected footprint $\Omega_i$ :

$s_i^+ = \frac{1}{K} \sum_{j=1}^K \sum_{p\in\Omega_i} \mathbb{I}[M^j_\text{mask}(p)=1]$

Gaussians with $s_i^+ > \tau_+$ are split, using the local principal axes for geometry-aware cloning.

Multi-View Consistency Pruning (VCP)

VCP removes superficially or redundantly contributing Gaussians:

For each view $j$ , the global photometric loss is $E_\text{photo}^j = (1-\lambda) L_1^j + \lambda (1-\mathrm{SSIM}^j)$ .
The pruning score is

$s_i^- = \mathrm{Norm} \left( \sum_{j=1}^K \left( \sum_{p\in\Omega_i} \mathbb{I}[M^j_\text{mask}(p)=1] \right) E_\text{photo}^j \right)$

where $\mathrm{Norm}(\cdot)$ denotes a min-max normalization over all $i$ .

When $s_i^- > \tau_-$ , the corresponding primitive is pruned.

High-level Pseudocode

Input: G (Gaussians), {G_j} (images), {v_j} (cameras), K, τ₊, τ₋, D₊, D₋, T
for t in 1..T:
    render G → R
    loss = compute_photometric_loss(R, G)
    optimize G wrt loss

    if t ≤ 15000 and t % D₊ == 0:
        for each Gaussian 𝒢ᵢ:
            if densify_score(𝒢ᵢ) > τ₊:
                split(𝒢ᵢ)

    if t % D₋ == 0:
        for each Gaussian 𝒢ᵢ:
            if prune_score(𝒢ᵢ) > τ₋:
                prune(𝒢ᵢ)

2. Theoretical Properties

The principal theoretical claim is that training time $T$ scales proportionally to the number of active Gaussians $N$ due to the dependence of both forward and backward 3DGS passes on $N$ . By enforcing tight multi-view consistency in densification, FastGS ensures that $N_{\text{FastGS}} \ll N_{\text{vanilla}}$ throughout training; consequently,

$T_\text{FastGS} \ll T_\text{vanilla}$

This empirical scaling is supported by the observed 2–15 $\times$ training time reductions. The pruning regime is conservative: only Gaussians associated with demonstrable multi-view photometric error are removed, preserving rendering quality to within $<$ 0.2 dB PSNR.

No closed-form convergence proof is provided; the schedule is justified empirically by ablation studies on diverse datasets.

3. Performance Evaluation and Empirical Results

FastGS is validated on Mip-NeRF 360, Tanks & Temples, Deep Blending, and additional dynamic and SLAM tasks. Metrics include PSNR, SSIM, LPIPS for visual quality, along with total training time, final count of Gaussians, and FPS for inference throughput.

Summary of Quantitative Results

Dataset	Method	Training Time (min)	PSNR	# Gaussians (M)	Speedup
Mip-NeRF 360	3DGS-accel	10.94	27.52	2.63	–
	DashGaussian	6.38	27.73	2.40	1.8×
	FastGS	1.92	27.54	0.38	5.7×
Deep Blending	3DGS-accel	8.87	29.74	–	–
	3DGS	19.77	–	–	–
	FastGS	1.28	30.03	–	15.45×
Tanks & Temples	3DGS-accel	6.96	23.85	–	–
	FastGS	1.32	24.15	–	5.3×

Ablation studies further show that VCD alone yields a 3.0× reduction in training time, VCP alone a 1.95× reduction, and the combined method fully realizes the observed speedups with minimal effect on visual metrics.

4. Generality and Applicability

FastGS functions as a plug-in algorithm, compatible with any 3DGS-based pipeline. The observed accelerations are not limited to static reconstruction but extend to:

Static pipelines: 3DGS-accel, Mip-Splatting, Scaffold-GS.
Dynamic scene modeling: Deformable-3DGS on NeRF-DS, Neu3D.
Surface mesh reconstruction: Plugged into PGSR on Tanks & Temples and Mip-NeRF 360, preserving F1 scores at 2–6× reduced training time.
Sparse-view and large-scale scenes: DropGaussian (sparse-view), Octree-GS (urban/large indoor), with 3–4× acceleration.
Simultaneous localization and mapping: Photo-SLAM—5× faster at same localization accuracy.

These results indicate broad transferability of the core criteria based on view-consistent error occupancy.

5. Implementation Insights

FastGS maintains compatibility with standard 3DGS rasterizers (tile-based splatting, $\alpha$ -blending), leveraging photometric loss ( $L_1$ , SSIM), SH-based color, and Adam optimization throughout. The only modifications are the VCD/VCP routines, which operate outside the per-ray rendering kernel.

Densification and pruning are scheduled at configurable iteration intervals. Optimal settings ( $K=10$ , $\tau_+, \tau_-$ tuned on held-out data, $D_+=500$ ) are task-agnostic and require minimal adjustment across application domains.

Final rendering proceeds as in standard 3DGS, freezing the learned Gaussian cloud. Memory and compute resource usage are directly proportional to the intermediate $N$ , leading to a reduction in both hardware and run-time demands.

6. Comparative Perspective

Unlike prior methods that employ fixed-budgeting, arbitrary splitting heuristics, or manual Gaussian culling, FastGS links all densification and pruning to explicit, aggregated photometric error maps across training views. This effectively eliminates both unnecessary densification (over-segmentation) and suboptimal pruning (removal of beneficial primitives). The method dispenses with auxiliary scheduling mechanisms, relying solely on multi-view error statistics. Ablation reveals that the combined multi-view strategy dominates other optimization and culling heuristics in efficiency.

7. Limitations and Extensions

FastGS does not provide a convergence guarantee in theoretical terms. The efficacy of the VCD and VCP thresholds is empirically tuned and may need adjustment for niche modalities (e.g., non-Lambertian surfaces or highly sparse camera trajectories). The reliance on photometric loss maps presumes reasonable multi-view calibration and geometric alignment.

Potential future directions, suggested by the architecture, include coupling view-consistent error-based scheduling with adaptive learning rates, further generalization to other non-splat primitive sets, and possible extension to in-situ (online) scene refinement for robotics and mapping applications.

In summary, FastGS demonstrates that tightly integrating multi-view photometric error analysis with adaptive primitive scheduling delivers order-of-magnitude gains in 3DGS training speed, substantial memory and compute savings, and cross-domain generality, all while maintaining or exceeding established standards of rendering fidelity (Ren et al., 6 Nov 2025).

PDF Markdown Chat (Pro)

References (1)

FastGS: Training 3D Gaussian Splatting in 100 Seconds (2025)

Follow Topic

Get notified by email when new papers are published related to FastGS Framework.