Papers
Topics
Authors
Recent
Search
2000 character limit reached

Deformable 3D Gaussian Splatting

Updated 16 April 2026
  • Deformable 3D Gaussian Splatting is a technique employing anisotropic 3D Gaussian primitives to efficiently render dynamic scenes and facilitate novel view synthesis.
  • It combines cage-based deformation methods and neural deformation fields to maintain high-frequency texture details during both global and local adjustments.
  • Recent advances achieve state-of-the-art performance, with over 35 dB PSNR and real-time rendering speeds exceeding 300 FPS by integrating structured embeddings and adaptive temporal models.

Deformable 3D Gaussian Splatting is a volumetric scene representation and deformation technique that synthesizes novel views of complex, potentially dynamic or edited 3D geometry through explicit, anisotropic 3D Gaussian primitives. While static 3D Gaussian Splatting provides highly efficient, real-time novel-view rendering for fixed geometry, extension to deformable settings requires precise alignment of geometry and preservation of high-frequency appearance during global or local deformations. Recent developments—including cage-based control, neural deformation fields, rigid-body rigging, and embedding-based temporal models—enable the seamless adaptation of existing 3DGS models to arbitrary target shapes or time-varying scenarios with little loss of fidelity.

1. Mathematical Foundations of Deformable 3D Gaussian Splatting

A deformable 3DGS scene is represented as a collection of N explicit, parameterized 3D Gaussians: gi=(μi,Σi,ci,αi)g_i = (\mu_i, \Sigma_i, c_i, \alpha_i) where μiR3\mu_i \in \mathbb{R}^3 is the center, ΣiR3×3\Sigma_i \in \mathbb{R}^{3 \times 3} the symmetric positive-definite covariance (typically factored as Σi=RiSiSiRi\Sigma_i = R_i S_i S_i^\top R_i^\top for rotation RiR_i and axis-aligned scale SiS_i), cic_i the (possibly view-dependent) color, and αi\alpha_i the opacity or density. Rendering projects each Gaussian to the screen, computes per-pixel weights via the analytic projection of ellipsoids, sorts by depth, and composites using alpha-blending:

C(p)=i=1Nciαij<i(1αj)C(p) = \sum_{i=1}^N c_i \, \alpha_i \prod_{j<i} (1-\alpha_j)

Deformation is modeled as a mapping from the canonical Gaussians (typically at t=0t=0 or a reference shape) to a deformed configuration through one of several parameterizations:

  • Cage-based deformation: associates each μiR3\mu_i \in \mathbb{R}^30 to a surrounding cage by mean-value/harmonic coordinates μiR3\mu_i \in \mathbb{R}^31; as the cage deforms, each μiR3\mu_i \in \mathbb{R}^32 is updated via

μiR3\mu_i \in \mathbb{R}^33

where μiR3\mu_i \in \mathbb{R}^34 are new cage vertex positions. Covariances transform via the local deformation Jacobian μiR3\mu_i \in \mathbb{R}^35:

μiR3\mu_i \in \mathbb{R}^36

(Tong et al., 17 Apr 2025, Xie et al., 2024)

  • Neural deformation fields: learn per-Gaussian deformations as functions of (potentially fused) geometric and temporal embeddings, e.g.,

μiR3\mu_i \in \mathbb{R}^37

with μiR3\mu_i \in \mathbb{R}^38 a per-Gaussian embedding, μiR3\mu_i \in \mathbb{R}^39 a temporal embedding (e.g., positional encoding or grid), and ΣiR3×3\Sigma_i \in \mathbb{R}^{3 \times 3}0 an MLP (Bae et al., 2024, Jiao et al., 21 Mar 2026, Jiao et al., 27 Aug 2025).

The color and opacity are typically kept fixed under deformation to preserve high-frequency texture unless explicitly handled for dynamic appearance or view/time dependence.

2. Cage-Based and Hybrid Deformation Control

Cage-based deformation schemes place the 3DGS point cloud inside a "cage" mesh whose deformation induces a volumetric mapping propagated deterministically, thereby enabling:

  • Intuitive global and semiglobal control (structural shape editing)
  • Preservation of high-frequency textural detail via Jacobian-corrected covariance propagation, avoiding splat stretching or blurring (Tong et al., 17 Apr 2025, Xie et al., 2024)
  • Flexible alignment to diverse target modalities: target shapes may be specified as text prompts (via proxy 3DGS), single images (AtlasNet, mesh proxy), point clouds, meshes, or 3DGS models, with shape alignment driven by PointNet-AtlasNet pipelines optimized for physical plausibility, coordinate positivity, and normal consistency.

Integration with neural Jacobian fields provides precise, per-face control of local differential mappings, increasing smoothness and regularity, particularly in user-guided or sketch-based editing scenarios (Xie et al., 2024). However, cage-induced deformations cannot guarantee the preservation of strict invariants such as straight lines or sharp features for CAD-like geometries, and local manipulations may require alternative parameterizations (e.g., direct handle dragging).

3. Neural Deformation Fields and Per-Gaussian Embedding

Most dynamic or temporal 3DGS representations employ neural deformation fields, where each Gaussian's position and shape is predicted by MLPs taking as input per-Gaussian and temporal embeddings. Prominent designs include:

  • Joint per-Gaussian and per-frame embeddings for accurate local deformation, decomposed into slow/fast (coarse/fine) pathways for modeling both gradual and abrupt motion (Bae et al., 2024, Jiao et al., 21 Mar 2026, Jiao et al., 27 Aug 2025).
  • Geometry-aware deformation, in which features derived from local 3D neighborhoods (via sparse 3D U-Nets or PointNet) enrich prediction and enforce spatial consistency (Lu et al., 2024).
  • Partitioning schemes, such as MAPo, that decouple highly dynamic Gaussians from quasi-static components via motion scoring, recursively specializing deformation networks for different spatiotemporal regimes (Jiao et al., 27 Aug 2025).
  • Physics-based or rigid-body deformation, e.g., deformable linear objects (ropes/cables) modeled with PBD solvers coupled with GS-based observation filtering (Dinkel et al., 13 May 2025), or mesh-rigged splats for intraoperative navigation with joint photometric and geometric supervision (Fehrentz et al., 23 Sep 2025).

Ablations consistently show that neglecting per-Gaussian adaptation (e.g., using global or purely coordinate-based deformation) produces contamination across static/dynamic regions and loss of geometric detail (Jiao et al., 27 Aug 2025, Bae et al., 2024).

4. Advanced Deformation Modeling: Temporal, View, and Hierarchical

Emerging techniques further refine deformable 3DGS models by addressing dynamic scene complexities:

  • Temporal basis or RBF expansions for per-Gaussian attribute curves, supporting both reversible (elastic) and irreversible (shearing, topological change) deformations (Shan et al., 2 Jan 2025, Nam et al., 25 May 2025).
  • Adaptive hierarchical motion masks, distinguishing static from deformable regions by evaluating deformation magnitude or rendering-loss differences, restricting neural inference and deformation updates to relevant Gaussians and maximizing rendering speed (Shan et al., 2 Jan 2025).
  • View-dependent deformable fields: augment deformation conditioning with view/camera direction, enabling modeling of subtle appearance and emission changes, notably in domains with challenging radiometric phenomena (e.g., thermal-infrared) (Nam et al., 25 May 2025).
  • Group trajectory analysis and rigid-group factorization (e.g., GroupFlow in SpeeDe3DGS), which clusters Gaussians by similar motion, replacing redundant per-Gaussian neural inference with compact per-group rigid transforms and dramatically accelerating rendering and reducing memory (Tu et al., 9 Jun 2025).

Quantitative evaluations indicate that these strategies yield state-of-the-art quality (e.g., PSNR ΣiR3×3\Sigma_i \in \mathbb{R}^{3 \times 3}1 35 dB, real-time rendering ΣiR3×3\Sigma_i \in \mathbb{R}^{3 \times 3}2 300 FPS) across diverse real and synthetic benchmarks while addressing scalability (Shan et al., 2 Jan 2025, Tu et al., 9 Jun 2025).

5. Applications: Editing, Medical Imaging, Tracking, and Animation

Deformable 3D Gaussian Splatting underpins several lines of application:

  • Interactive 3D geometric editing and animation: cage-based, sketch-driven workflows enable scene-level deformation or animation through explicit or diffusion-prior guided systems, supporting modalities ranging from text and images to arbitrary sketch keyframes (Tong et al., 17 Apr 2025, Xie et al., 2024).
  • Dynamic scene synthesis: per-Gaussian embedding fields, temporal decomposition, and deformation hierarchy enable photorealistic, real-time reconstruction and free-viewpoint rendering in scenes with large, non-rigid motion (e.g., articulated avatars, human-body performance capture, dynamic objects) (Jung et al., 2023, Qian et al., 2023, Bae et al., 2024, Jiao et al., 27 Aug 2025).
  • Real-time 3D medical reconstruction and navigation: explicit and flexible deformation modeling of tissue and organs, combined with rapid point-cloud initialization and segregation of static/deformable components, addresses intraoperative speed and fidelity requirements in endoscopic and surgical applications (Chen et al., 2024, Zhu et al., 2024, Yang et al., 2024, Shan et al., 2 Jan 2025, Fehrentz et al., 23 Sep 2025).
  • Physics-guided and object-specific tracking: hybrid pipelines combine physics-based dynamics (e.g., PBD) with differentiable GS rendering, enabling robust 3D tracking under occlusion in robotics or manipulation scenarios (Dinkel et al., 13 May 2025).
  • Dynamic transient and occlusion handling: deformable transient fields and memory-efficient masking allow high-quality synthesis in settings with occlusions, dynamic distractors, or uncontrolled capture (Park et al., 8 Mar 2025).

6. Quantitative Evaluation, Limitations, and Future Directions

Benchmarks systematically report geometry and appearance fidelity (Chamfer distance, PSNR, SSIM, LPIPS), perceptual metrics (user study votes, DINO cosine), and practical speed/memory (FPS, training time, parameter count). For example, CAGE-GS achieves leading Chamfer and DINO scores on ShapeNet, and quantitative PSNR increases of 1–2 dB have been attributed to geometry-aware or embedding-based deformation compared to earlier coordinate-only or global models (Tong et al., 17 Apr 2025, Lu et al., 2024, Jiao et al., 27 Aug 2025, Jiao et al., 21 Mar 2026).

Key ablation findings:

  • Omission of Jacobian-based covariance update produces severe blurring or texture drift (Tong et al., 17 Apr 2025).
  • Single deformation network for the entire sequence induces over-averaging and blurry renderings in fast dynamic regions unless partitioned (MAPo, GroupFlow) (Jiao et al., 27 Aug 2025, Tu et al., 9 Jun 2025).
  • Explicit per-Gaussian embedding is essential to prevent leakage of fast motion into neighboring static geometry (Bae et al., 2024).

Limitations include:

  • Incomplete preservation of strict geometric invariants (lines, planes) under global cage-based methods (Tong et al., 17 Apr 2025).
  • Trade-off between per-Gaussian neural inference overhead and real-time constraint; group/rank pruning and trajectory factorization mitigate this (Tu et al., 9 Jun 2025).
  • Residual temporal blur or artifacts in extreme motion regions or highly stochastic texture domains (Bae et al., 2024, Nam et al., 25 May 2025).

Future directions cited include:

  • End-to-end learning of Jacobian fields and deformation priors.
  • Interactive/VR deformation and editing interfaces.
  • Deformation-aware neural surface coupling and mutual supervision for guaranteed geometry correspondence (Li et al., 2024).
  • Spatio-temporal deformation and frequency modeling in domains such as thermal or multispectral data (Nam et al., 25 May 2025).
  • Extension to bounded-support kernels and new functional bases (e.g., deformable Beta splatting for adaptive kernel shape and improved densification) (Liu et al., 27 Jan 2025).

References:

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Deformable 3D Gaussian Splatting.