Papers
Topics
Authors
Recent
Search
2000 character limit reached

ToothForge: Spectral Dental Shape Synthesis

Updated 5 March 2026
  • ToothForge is a spectral generative modeling framework that synchronizes eigenspaces to enable accurate 3D dental shape synthesis.
  • It employs a β-VAE trained on synchronized spectral embeddings, reducing reconstruction error (e.g., MSE of 0.032 for molars) and noise in heterogeneous meshes.
  • The approach supports efficient dental crown interpolation, compression, and smooth latent-space blending without relying on shared mesh connectivity.

ToothForge is a spectral generative modeling framework for high-fidelity 3D dental shape synthesis based on synchronized spectral embeddings. Designed to address the paucity of dental shape datasets and heterogeneity of mesh connectivities in medical applications, ToothForge enables automatic generation, analysis, and interpolation of dental crowns using compact representations in the frequency domain. The method introduces spectral synchronization—a procedure that aligns the eigenspaces of Laplace–Beltrami operators across arbitrary triangle meshes—allowing unified machine learning workflows independent of mesh connectivity. ToothForge incorporates a β-VAE trained over the synchronized spectra, resulting in improved reconstruction, greater flexibility for mesh structures, and practical applications in medical shape domains (Kubík et al., 3 Jun 2025).

1. Spectral Embeddings of Triangular Meshes

The core of ToothForge’s approach relies on spectral analysis of a watertight triangular mesh M=(V,E,F)\mathcal{M} = (\mathcal{V}, \mathcal{E}, \mathcal{F}) with nn vertices v1,,vnR3v_1, \dots, v_n \in \mathbb{R}^3. Discretization of the Laplace–Beltrami operator proceeds via the cotangent-weight scheme, constructing a stiffness matrix WW with

Wij=(cotαij+cotβij)for ij,W_{ij} = -(\cot \alpha_{ij} + \cot \beta_{ij}) \quad \text{for}~ i \neq j,

where (i,j)(i,j) is an edge, and Wii=jiWijW_{ii} = -\sum_{j \ne i} W_{ij}. The lumped mass matrix AA assigns Aii=13(i,j,k)FArea(ijk)A_{ii} = \frac{1}{3} \sum_{(i,j,k) \in \mathcal{F}} \text{Area}(\triangle ijk). The discrete Laplacian is L=A1WL = A^{-1} W.

Eigenpairs (λi,ϕi)(\lambda_i, \phi_i) of Lϕi=λiϕiL \phi_i = \lambda_i \phi_i yield orthonormal eigenvectors (harmonics) and nonnegative eigenvalues (frequencies). Any per-vertex coordinate vector fRnf \in \mathbb{R}^n can be expanded as f=ΦΦff = \Phi\Phi^\top f with Φ\Phi the eigenvector matrix. Truncation to the first kk modes produces compact spectral embeddings, where the kk-banded modal coefficients

Ck=ΦkVRk×3C_k = \Phi_k^\top V \in \mathbb{R}^{k \times 3}

represent the mesh in the frequency domain. The mesh itself is recoverable as Vk=ΦkCkV^k = \Phi_k C_k.

2. Instability of Raw Spectra and Motivation for Synchronization

Spectral decomposition on discrete meshes introduces ambiguities: eigenvectors are determined only up to sign flips, and in the presence of clustered eigenvalues, modes may arbitrarily permute or mix. When comparing two meshes with differing connectivity, the bases Φk\Phi_k are, by construction, unrelated. Consequently, direct use of these raw spectral embeddings CkC_k as machine learning features leads to severe artifacts, including sign-flip noise and basis-mismatch biases, degrading both reconstruction and generative sample quality (Kubík et al., 3 Jun 2025). This fundamental incompatibility precludes effective learning on heterogeneous mesh datasets.

3. Spectral Synchronization via Reference Alignment

ToothForge resolves incompatibility via spectral synchronization. One selects a reference mesh Mref\mathcal{M}_{\text{ref}} and computes its kk-truncated basis Φkref\Phi_{k-\text{ref}}. For each training mesh Mi\mathcal{M}_i, an orthogonal alignment map RiRk×kR_i \in \mathbb{R}^{k \times k} is determined, along with a vertex correspondence cic_i (often via nearest neighbor or coarse registration), to register the eigenbasis Φki\Phi_{k-i} to Φkref\Phi_{k-\text{ref}}. Alignment proceeds by solving

Ri=argminRRk×kΦkrefΦkiRF2R_i = \underset{R \in \mathbb{R}^{k \times k}}{\arg\min} \|\Phi_{k-\text{ref}} - \Phi_{k-i} R\|_F^2

subject to RR=IR^\top R = I, which is the orthogonal Procrustes problem. Its solution is Ri=UVR_i = UV^\top, where UΣV=SVD(ΦkiΦkref)U\Sigma V^\top = \text{SVD}(\Phi_{k-i}^\top \Phi_{k-\text{ref}}). All spectral coefficients CkiC_{k-i} are then synchronized:

Ckisync=RiCkiC_{k-i}^{\text{sync}} = R_i C_{k-i}

so that all data share the basis Φkref\Phi_{k-\text{ref}} and are directly comparable in machine learning tasks.

4. Generative Modeling on Synchronized Spectra

The synchronized coefficients CksyncRk×3C^{\text{sync}}_k \in \mathbb{R}^{k \times 3} serve as the representation for generative modeling. ToothForge employs a β-VAE, where the encoder eθ:Rk×3(μ,Σ)Rd×Rd×de_\theta: \mathbb{R}^{k \times 3} \to (\mu, \Sigma) \in \mathbb{R}^d \times \mathbb{R}^{d \times d} maps coefficients to a Gaussian posterior zN(μ,Σ)z \sim \mathcal{N}(\mu, \Sigma), and the decoder dγ:RdRk×3d_\gamma: \mathbb{R}^d \to \mathbb{R}^{k \times 3} predicts reconstructed spectra C^k\hat{C}_k. The architecture uses k=256k=256, latent dimension d=16d=16, and a five-stage CNN backbone. The loss function is

L=CksyncC^kF2+β KL[N(μ,Σ)N(0,I)],\mathcal{L} = \|C^{\text{sync}}_k - \hat{C}_k\|_F^2 + \beta~\text{KL}[\mathcal{N}(\mu,\Sigma) \| \mathcal{N}(0,I)],

with β[0,0.05]\beta \in [0, 0.05] annealed cyclically. Training utilizes only the synchronized spectra for each mesh.

5. Pipeline Overview: Training and Inference

The modeling workflow includes:

  • Training:
    • Compute the reference basis Φkref\Phi_{k-\text{ref}}.
    • For each mesh Mi\mathcal{M}_i:
    • Construct Laplacian LiL_i and eigenbasis Φki\Phi_{k-i}.
    • Calculate raw spectral coefficients Ci=ΦkiViC_i = \Phi_{k-i}^\top V_i.
    • Align basis by solving Procrustes for RiR_i, yielding synchronized CisyncC_i^{\text{sync}}.
    • Train the β-VAE on {Cisync}\{C_i^{\text{sync}}\}.
  • Inference:
    • Sample latent zN(0,I)z \sim \mathcal{N}(0, I).
    • Decode to obtain C^\hat{C}.
    • Reconstruct vertices V^=ΦkrefC^\hat{V} = \Phi_{k-\text{ref}} \hat{C}; reuse faces and edges from reference mesh to build high-resolution output.

This pipeline operates without the necessity for shared mesh connectivity among samples, instead leveraging only spectral harmonization.

6. Quantitative Performance and Ablation

Evaluated on a private dataset of 430 dental crowns (incisors, premolars, molars), ToothForge demonstrates that training on synchronized spectra reduces spectral MSE to 0.032 (molars), an order of magnitude lower than models trained on unaligned data. Spatial Chamfer distances exhibit similar improvements. Ablation experiments show omission of synchronization introduces noise and artifacts in interpolation, while proper alignment yields smooth, anatomically consistent morphs. The combined effect of synchronization and β-regularization reduces minimum matching distance (MMD) below 0.0059 for all classes. Generation throughput is approximately 1 millisecond per 1,000 samples on a Tesla T4 (Kubík et al., 3 Jun 2025).

7. Additional Applications, Limitations, and Generalizability

ToothForge enables additional applications: (a) Efficient shape compression using moderate values of kk (128–256 modes) preserves fine morphological details such as cusps; (b) Latent-space interpolation in the regularized VAE latent space, thanks to shared spectral bases, produces smooth vertex-wise blends between shapes.

Limitations are present: the reliance on a single reference mesh and explicit vertex correspondences can be challenged by extreme deformations or topological differences. The alignment map is rigid; potential extensions include adopting learned functional maps or expanding the framework to anatomically diverse organs with heterogeneous mesh structure. The general principle of spectral synchronization—aligning mesh spectra prior to frequency-domain learning—is broadly transferable to medical shape modeling domains with inconsistent connectivity. Key benefits include amelioration of sign-flip and basis-mismatch noise, independence from connectivity constraints, and compact, high-fidelity generative modeling (Kubík et al., 3 Jun 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to ToothForge (Spectral Synchronization).