Papers
Topics
Authors
Recent
Search
2000 character limit reached

Stable Rank and Distance Preservation

Updated 10 January 2026
  • Stable Rank is a spectral invariant that quantifies the effective dimension or spread in data, ensuring the stability of topological features.
  • Distance preservation guarantees that metric and topological summaries remain stable under perturbations and embeddings, facilitating reliable dimension reduction.
  • Applications span persistent homology, JL embeddings, and graph clustering, supporting scalable, noise-resistant geometric and topological data analysis.

Stable rank is a categorical and spectral invariant underlying robust distance preservation in persistent homology, metric embeddings, and geometric data analysis. It measures the effective complexity (in terms of spread or dimension) of objects such as persistence modules or point clouds and provides a foundation for both the stability of topological invariants and dimension reduction schemes. Distance preservation addresses how metrics or topological summaries are stably retained under perturbations, embedding mappings, or categorical generalizations.

1. Stable Rank: Definitions and General Principles

Stable rank arises in distinct but related contexts: vector spaces, persistence modules, difference matrices, and regular categories.

  • In spectral embedding and metric geometry, the stable rank of a real matrix MM is defined as sr(M)=MF2/M22\mathrm{sr}(M) = \|M\|_F^2 / \|M\|_2^2—the ratio of Frobenius norm squared to operator norm squared—which quantifies the number of significant singular directions beyond simple rank (Deshpande et al., 2015, Casey, 2023).
  • In persistence, the stable rank generalizes the classical Betti number: for a diagram F:(R,)RF: (\mathbb{R}, \le) \to \mathbf{R}, pF(u,v)=r(Im[F(u)F(v)])p_F(u,v) = r(\mathrm{Im}[F(u) \to F(v)]) records the integer-valued invariant (e.g., dimension) induced by a rank function rr satisfying monotonicity and subadditivity on a regular category (R,r)(\mathbf{R}, r) (Bergomi et al., 2019).
  • In one-parameter persistence, given a pseudometric dCd_C between modules (from a contour CC), the stabilized rank rank^C(V)(t)\widehat{\mathrm{rank}}_C(V)(t) is the minimum rank among modules WW within dCd_C distance tt of VV (Chachólski et al., 2019, Agerberg et al., 2023).

2. Stable Rank and Distance Preservation in Metric Embeddings

Stable rank directly governs the quality of average distance preservation in spectral embeddings, particularly in 22\ell_2^2 to 1\ell_1 dimension reduction:

  • For nn points x1,...,xnRdx_1, ..., x_n \in \mathbb{R}^d satisfying 22\ell_2^2 triangle inequalities (i.e., squared distances forming a metric), the stable rank of the difference matrix MM controls the distortion of embeddings f:Rd1f:\mathbb{R}^d \to \ell_1. Specifically:

    • There exists an explicit linear embedding with contraction and average distortion O(sr(M))O(\mathrm{sr}(M)). That is, for all i,ji, j:

    f(xi)f(xj)2xixj22\|f(x_i) - f(x_j)\|_2 \leq \|x_i - x_j\|_2^2

    while

    i<jf(xi)f(xj)21sr(M)i<jxixj22\sum_{i<j} \|f(x_i) - f(x_j)\|_2 \geq \frac{1}{\mathrm{sr}(M)} \sum_{i<j} \|x_i - x_j\|_2^2

    (Deshpande et al., 2015). - High stable rank implies strong aggregate distance preservation for most pairs, and is essential for efficient approximation algorithms (e.g., for Sparsest Cut) (Deshpande et al., 2015).

  • Bulk Johnson–Lindenstrauss Lemmas: If one tolerates an η\eta fraction of distances being distorted, only the minimal stable rank RR of batches of difference vectors limits the necessary target dimension mm for random projections. The formula

m=O(ϵ2log(4e/η)logN/R)m = O(\epsilon^{-2} \log(4e/\eta) \log N / R)

quantifies how high stable rank enables drastic dimensionality reduction with controlled distance distortion in almost all pairs (Casey, 2023).

3. Rank-Based Persistence and Categorical Stability

The categorical axiomatization of stable rank enables broad generalization of persistence and its stability:

  • In a ranked category (R,r)(\mathbf{R}, r), the rank function is required to satisfy:

    1. Monotonicity under monomorphisms: AB    r(A)r(B)A \hookrightarrow B \implies r(A) \leq r(B),
    2. Monotonicity under regular epimorphisms: BD    r(B)r(D)B \twoheadrightarrow D \implies r(B) \geq r(D),
    3. Subadditivity in pullback squares: r(B)r(A)r(D)r(C)r(B) - r(A) \geq r(D) - r(C). (Bergomi et al., 2019).
  • Persistence functions pF(u,v)p_F(u,v) induced by such categories generalize Betti numbers and recover classical stable ranks in vector space settings. Moreover, these persistence invariants are stable under function perturbations:

fgdI(F,G)dB(DF,DG)\|f-g\|_{\infty} \geq d_I(F,G) \geq d_B(\mathcal{D}F, \mathcal{D}G)

where dId_I is interleaving distance, dBd_B is colored bottleneck distance, and DF\mathcal{D}F denotes the multicolored persistence diagram (Bergomi et al., 2019).

  • In semisimple Abelian categories, equality holds:

dI(F,G)=dB(DCF,DCG)d_I(F,G) = d_B(\mathcal{D}_\mathcal{C}F, \mathcal{D}_\mathcal{C}G)

confirming that stable rank—as encoded in colored diagrams—fully captures categorical distance preservation (Bergomi et al., 2019).

4. Stability of Rank Invariants in Persistent Homology

Rank invariants underpin the stability of multidimensional and one-parameter persistence modules:

  • For XX a triangulable space and f,g:XRnf, g: X \to \mathbb{R}^n continuous, the multidimensional rank invariant ρf,k(u,v)\rho_{f,k}(u,v) tracks the dimension of persistent homology classes across lower-level sets. The matching (bottleneck) distance Dmatch(ρf,ρg)D_\mathrm{match}(\rho_f, \rho_g) between their rank invariants is stably bounded:

Dmatch(ρf,k,ρg,k)fgD_\mathrm{match}(\rho_{f,k}, \rho_{g,k}) \leq \|f - g\|_\infty

(0908.0064, Frosini et al., 2010). This uniform bound generalizes the classical bottleneck stability.

  • For two tame persistence modules V,WV, W, the interleaving distance dId_I lower-bounds the uniform difference of rank invariants:

dI(V,W)supuvρV(u,v)ρW(u,v)d_I(V,W) \geq \sup_{u \preceq v} |\rho_V(u,v) - \rho_W(u,v)|

(Landi, 2014).

  • In domain perturbation, encoding sets via distance functions or densities (Hausdorff, symmetric difference, sup-norm) yields stability bounds for the matching distance between rank invariants in terms of the underlying set metric (Frosini et al., 2010).

5. Stable Rank Invariants and 1-Lipschitz Robustness

Hierarchical stabilization converts discrete rank invariants into robust, 1-Lipschitz stable rank functions:

  • For a chosen contour CC, the pseudometric dCd_C between persistence modules induces the stabilized rank

rank^C(V)(t)=min{rank(W)dC(V,W)t}\widehat{\mathrm{rank}}_C(V)(t) = \min\{\mathrm{rank}(W) \mid d_C(V, W) \leq t\}

which is nonincreasing, additive, and monotonic (Chachólski et al., 2019, Agerberg et al., 2023).

  • The interleaving distance between stable ranks is bounded by the pseudometric:

dC(V,W)d(rank^C(V),rank^C(W))d_C(V, W) \geq d_\bowtie(\widehat{\mathrm{rank}}_C(V), \widehat{\mathrm{rank}}_C(W))

and

rank^C(V)(t)rank^C(W)(t)dC(V,W)|\widehat{\mathrm{rank}}_C(V)(t) - \widehat{\mathrm{rank}}_C(W)(t)| \leq d_C(V, W)

(Chachólski et al., 2019, Agerberg et al., 2023).

  • In the context of algebraic Wasserstein distances, stable rank functions are efficiently computable and can be tuned via interpretable parameters (norm order pp, contour CC) to reflect task-specific, robust geometry. These invariants are 1-Lipschitz and support metric learning pipelines (Agerberg et al., 2023).

6. Applications and Algorithmic Implications

Stable rank and distance preservation are foundational in several algorithmic and geometric contexts:

  • Sparsest Cut and spectral clustering: Stable rank enables polynomial-time cut rounding in graphs with low threshold-rank SDP relaxations, with quality directly controlled by stable rank rather than ambient dimension (Deshpande et al., 2015).
  • Dimension reduction: Bulk JL embeddings and SR-based dimension formulas allow substantial reduction in target dimensions when intrinsic stable rank is high, crucial for scalable geometric data analysis and privacy-preserving random projections (Casey, 2023).
  • Persistent homology: Stable rank invariants facilitate reliable, noise-resistant classification (e.g., in point-processes, time-series, and artery tree data), with accuracy sensitively tunable via contour choices (Chachólski et al., 2019, Agerberg et al., 2023).
Context Stable Rank Role Distance Preservation Mechanism
Metric embeddings Limits average distortion Spectral embeddings: 221\ell_2^2 \to \ell_1
JL Lemmas Enables reduced target dimension Bulk distance preservation in random projections
Persistent homology Provides robust invariant feature map Bottleneck/interleaving stability (1-Lipschitz)
Wasserstein invariants Allows task-adaptive feature learning Parameterized pseudometrics, interpretable tuning

In summary, stable rank quantifies the effective dimension, spread, or complexity governing the stability of geometric and topological invariants under perturbation and embedding. Its presence ensures that metric distortion or topological loss is uniformly controlled in algorithms, categorical frameworks, and data analysis settings. The robust, 1-Lipschitz behavior of stabilized rank invariants underlies their suitability for practical, discriminative, and reliable applications across mathematics, computer science, and applied topology.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Stable Rank and Distance Preservation.