Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 77 tok/s

Gemini 2.5 Pro 54 tok/s Pro

GPT-5 Medium 29 tok/s Pro

GPT-5 High 26 tok/s Pro

GPT-4o 103 tok/s Pro

Kimi K2 175 tok/s Pro

GPT OSS 120B 454 tok/s Pro

Claude Sonnet 4.5 38 tok/s Pro

2000 character limit reached

Locally Linear Latent Variable Models (LL-LVM)

Updated 2 August 2025

LL-LVM is a probabilistic framework that models high-dimensional data as arising from locally linear mappings in a lower-dimensional latent space, preserving local manifold geometry.
It employs a variational EM algorithm with Gaussian priors on latent variables and local maps for efficient, closed-form inference and uncertainty quantification.
The model integrates neighborhood graph structures using Laplacian priors to enforce local geometry preservation, enabling principled model selection and out-of-sample extensions.

Locally Linear Latent Variable Models (LL-LVMs) are a class of probabilistic models designed to learn nonlinear manifold structure underlying high-dimensional observations by explicitly modeling the data as arising from locally linear mappings in a lower-dimensional latent space. They bridge local linearity intuition from non-probabilistic manifold learning methods with the rigor and flexibility of probabilistic inference, enabling uncertainty quantification, principled model selection, and tractable integration with other probabilistic models.

1. Model Formulation and Probabilistic Structure

The canonical LL-LVM posits that each high-dimensional data point $y_i \in \mathbb{R}^{d_y}$ is generated from a low-dimensional latent coordinate $x_i \in \mathbb{R}^{d_x}$ (with $d_x \ll d_y$ ) through a mapping that is linear only within a local neighborhood. A neighborhood graph $G$ (with adjacency matrix $G$ and Laplacian $L$ ) defines the structure over which locality is imposed. The central generative assumption is: $y_j - y_i \approx C_i (x_j - x_i)$ for adjacent pairs $(y_i, y_j)$ , where $C_i$ is a locally linear map (a $d_y \times d_x$ matrix) at $y_i$ . The set of all local maps $\{C_1, \ldots, C_n\}$ (collectively $C$ ) encodes the local geometry across the data.

The joint probability model is: $p(y, C, x | G) = p(y | C, x, G) \, p(C | G) \, p(x | G)$ Gaussian priors are placed both on $x$ (with a precision structure involving the Laplacian $L$ ) and on $C$ (favoring smoothness penalizing deviations between local maps of neighbors). Specifically,

$p(x|G, \alpha) = \mathcal{N}(0, \Pi) \quad \text{with} \quad \Pi^{-1} = \alpha I_{nd_x} + 2 (L \otimes I_{d_x})$

A similar Gaussian "smoothness prior" is placed on $C$ .

The likelihood $p(y|C, x, G)$ penalizes the difference between observed and transformed latent differences, encoding the locally linear reconstruction error for neighbor pairs.

2. Inference via Variational Optimization

Direct inference is intractable; LL-LVMs employ a variational EM algorithm that approximates the posterior $p(x, C | y, G)$ by a factorized form: $q(x, C) = q(x) \, q(C)$ The evidence lower bound (ELBO) is: $\log p(y|G) \geq \mathcal{L}[q(x, C), \theta] = \iint q(x) q(C) \log \frac{p(y, C, x | G, \theta)}{q(x)q(C)} \, dx \, dC$ E-step updates involve:

Updating $q(x)$ by

$q(x) \propto \exp\, \mathbb{E}_{q(C)}\left[ \log p(y, C, x | G, \theta) \right]$

resulting in $q(x) = \mathcal{N}(\mu_x, \Sigma_x)$ , with closed-form updates for $\Sigma_x$ and $\mu_x$ derived from quadratic expansion of the likelihood.

Updating $q(C)$ analogously, with $q(C) = \mathrm{MN}(C|M_C, I, \Sigma_C)$ , where $M_C$ and $\Sigma_C$ depend on sufficient statistics from $q(x)$ .

M-step updates hyperparameters (e.g., noise precision $\gamma$ , latent scale $\alpha$ ) by maximizing the lower bound with closed-form or univariate optimization as appropriate.

These computations leverage the Gaussian structure for efficient, closed-form calculations of all variational parameters.

3. Local Geometry Preservation and Comparison to Non-Probabilistic Manifold Learning

LL-LVMs encode local geometry preservation at the probabilistic level. The key modeling constraint, $y_j - y_i \approx C_i (x_j - x_i)$ for neighboring points, enforces the preservation of local tangent structure, mirroring the goal of non-probabilistic approaches such as Locally Linear Embedding (LLE). In LLE, local reconstruction weights are used to preserve manifold neighborhoods in the embedding. LL-LVMs generalize this intuition probabilistically, yielding not only point estimates but full uncertainty on both the latent locations and the local linear maps.

This probabilistic treatment enables:

Explicit evaluation and selection of neighborhood graphs via the variational evidence.
Robustness to "short-circuit" artifacts, since the model can identify misspecified neighborhoods by evaluating evidence lower bounds.
Out-of-sample extensions, as new data can be projected probabilistically via the inferred $q(x)$ and $q(C)$ .

4. Quantification of Uncertainty and Model Selection

By specifying a full probabilistic joint model over $x$ and $C$ , LL-LVMs afford rigorous uncertainty quantification:

Latent coordinates $x$ and local maps $C$ are assigned posteriors, with mean and covariance capturing the epistemic uncertainty in the manifold estimation.
Model quality can be directly compared across different graph constructions or intrinsic manifold dimensionality by evaluating the variational lower bound.
Selection of the intrinsic dimensionality is facilitated probabilistically, bypassing reliance on heuristic criteria.

Out-of-sample extension follows a natural procedure: freeze existing variational posteriors $q(x), q(C)$ , and perform E-step updates for the test datum (given its neighborhood) to obtain a posterior distribution for its latent coordinate and local map.

5. Integration with Broader Probabilistic Frameworks

LL-LVMs, being fully probabilistic with all conditional distributions Gaussian, are modular and readily integrated as subcomponents within broader graphical models. This allows:

Imposing priors that enforce structure such as temporal dynamics or cluster structure in the latent space.
Hybridization with models capturing observed covariates or other modalities, leveraging conditional independence structure.
Use as a manifold prior in hierarchical Bayesian modeling.

Their extensibility arises directly from the Gaussian conditional structure, which ensures tractable, closed-form updates when composing the model with other probabilistic modules.

LL-LVMs are distinct from manifold learning techniques that do not provide uncertainty estimates, probabilistic evaluations of neighborhood quality, or mechanisms for selecting manifold dimensionality. While methods such as PCA, Isomap, or deterministic LLE provide low-dimensional embeddings, they lack probabilistic semantics, making model selection and integration with other models challenging.

The LL-LVM framework lays the foundation for a Bayesian approach to manifold learning, with capabilities not present in earlier methods:

Model selection via evidence maximization.
Quantitative comparison of hypotheses (e.g., alternate graphs).
Out-of-sample generalization with principled uncertainty.
Constructing composite models for multipurpose inference.

A plausible implication is that this probabilistic framework enables manifold learning to be embedded within larger systems for transfer learning, hierarchical modeling, or time-evolving manifold estimation—a limitation for prior approaches based on deterministic embeddings alone.

7. Summary Table of LL-LVM Properties

Property	LL-LVM	Traditional LLE
Probabilistic formulation	Yes	No
Uncertainty quantification	Yes (Gaussian posteriors over $x$ , $C$ )	No
Neighborhood evaluation/model selection	Variational evidence lower bound	Not available
Out-of-sample extension	Closed-form variational E-step	Not addressed
Integration into larger models	Yes (Gaussian graphical model)	Difficult

The LL-LVM offers a principled and extensible tool for probabilistic manifold learning, synthesizing local-geometry preservation with a full Bayesian treatment while enabling quantitative assessment, flexible extension, and rigorous treatment of uncertainty (Park et al., 2014).

PDF Markdown Chat (Pro)

References (1)

Bayesian Manifold Learning: The Locally Linear Latent Variable Model (LL-LVM) (2014)

Follow Topic

Get notified by email when new papers are published related to Locally Linear Latent Variable Models (LL-LVM).