Papers
Topics
Authors
Recent
2000 character limit reached

CoLaS: Copula-Seeded Local Latent-Space Graphs

Updated 25 December 2025
  • The paper introduces CoLaS, a modular latent-variable graph model that unifies degree heterogeneity, persistent clustering, and assortativity through a copula-based framework.
  • The model separates marginal specifications from dependence structure by coupling node popularity and latent spatial locations, enabling explicit parameter control.
  • The CoLaS-HT extension further refines the approach to produce power-law degree distributions while preserving network sparsity and locality.

CoLaS (Copula-Seeded Local Latent-Space Graphs) is a modular latent-variable random graph model that unifies degree heterogeneity, persistent clustering, and systematic degree mixing within sparse regimes. CoLaS separates marginal specifications from dependence structure by utilizing a copula to couple node "popularity" and latent geometric location, introducing explicit and interpretable parameter control for assortativity. The framework is supported by sparse-limit theory for degree distributions, transitivity, and assortativity, and includes a minimal extension—CoLaS-HT—that enables power-law degree tails while preserving sparsity and locality (Papamichalis et al., 23 Dec 2025).

1. Latent Variable Construction and Edge Formation

Each node ii is assigned latent marks:

  • Popularity WiFWW_i \sim F_W controlling degree heterogeneity.
  • Location XiUnif(Td)X_i \sim \mathrm{Unif}(\mathbb{T}^d), where Td\mathbb{T}^d is the dd-dimensional torus, specifying spatial locality.

The joint distribution of (Wi,Xi)(W_i, X_i) is constructed using Sklar's theorem with a copula CθC_\theta on [0,1]d+1[0,1]^{d+1}. Writing Ui=FW(Wi)U_i = F_W(W_i) and Vi=XiV_i = X_i, this yields (Ui,Vi)Cθ(U_i, V_i) \sim C_\theta, so marginals are preserved: FW,X(w,x)=Cθ(FW(w),FX(x))F_{W,X}(w,x) = C_\theta(F_W(w), F_X(x)), with FX(x)F_X(x) uniform.

Given the latent marks, edges are conditionally independent: for {Aij}i<j\{A_{ij}\}_{i<j}, the edge indicators are independent Bernoulli variables conditional on {(Wi,Xi)}i=1n\{(W_i, X_i)\}_{i=1}^n.

Edge probabilities under the fixed-range rule are:

pij(n)=1exp{λρnWiWjk(XiXjεn)}p^{(n)}_{ij} = 1 - \exp\left\{ -\frac{\lambda}{\rho_n} W_iW_j\, k\left(\frac{X_i - X_j}{\varepsilon_n}\right) \right\}

where λ>0\lambda > 0 is global intensity, k:Rd[0,)k: \mathbb{R}^d \to [0, \infty) a local kernel with compact support, εn0\varepsilon_n \downarrow 0 a shrinking-range sequence, and ρn=nεndρ(0,)\rho_n = n\varepsilon_n^d \to \rho \in (0, \infty).

2. Control of Degree Mixing and Separation of Marginals

Degree mixing (assortativity) is engineered through the copula parameter θ\theta, which modulates the degree to which higher popularity (WW) aligns with favorable spatial locations (XX), while keeping FWF_W and the law of XX fixed.

For conditional moments:

mp,θ(x)=E[WpX=x],p=1,2,3,m_{p,\theta}(x) = \mathbb{E}\left[ W^p \mid X = x \right], \quad p=1,2,3,\dots

which underlie large-nn limiting behavior.

Endpoint assortativity in the sparse regime converges to:

r(θ)=Covνθ(Λθ(Z),Λθ(Z))+Eνθ[Γθ(Z,Z)]Varνθ(Λθ(Z))+Eνθ[Λθ(Z)]r(\theta) = \frac{ \mathrm{Cov}_{\nu_\theta}(\Lambda_\theta(Z), \Lambda_\theta(Z')) + \mathbb{E}_{\nu_\theta}[\Gamma_\theta(Z, Z')] } { \mathrm{Var}_{\nu_\theta}(\Lambda_\theta(Z)) + \mathbb{E}_{\nu_\theta}[\Lambda_\theta(Z)] }

where νθ\nu_\theta is the limiting edge–Palm law, Λθ(z)\Lambda_\theta(z) is the limiting degree intensity, and Γθ(z,z)\Gamma_\theta(z, z') is the limiting common-neighbor intensity.

The copula construction ensures that all changes to mixing properties arise through its concordance, with marginals unchanged for all θ\theta.

3. Sparse-Limit Degree Distributions and Tail Dichotomy

In the fixed-range CoLaS regime with a compact kernel, degree distributions obey a mixed-Poisson limit:

DidPoisson(Λθ(Wi,Xi)),D_i \xrightarrow{d} \mathrm{Poisson}(\Lambda_\theta(W_i, X_i)),

with

Λθ(w,x)=ρRdE[1exp(λρwWk(u))X=x]du\Lambda_\theta(w, x) = \rho \int_{\mathbb{R}^d} \mathbb{E}\left[ 1 - \exp\left( -\frac{\lambda}{\rho} w W' k(u) \right) \bigg| X = x \right] du

This yields a universal "degree-tail dichotomy":

  • For bounded, fixed-range kernels, Λθ(W,X)M<\Lambda_\theta(W, X) \leq M < \infty a.s., and thus DD is stochastically dominated by Poisson(M)\mathrm{Poisson}(M), forcing degree distributions to have exponentially light tails, regardless of the popularity marginal.

The inability to produce power-law degrees in this sparse, fixed-range context motivates the extension described in the next section.

4. Persistent Clustering and Transitivity

Clustering, quantified by global transitivity, remains nonvanishing in the sparse local regime:

Cn=3Tni(Di2)PC(θ):=2E[τθ(W,X)]E[Λθ(W,X)2]C_n = \frac{3 T_n}{\sum_i \binom{D_i}{2}} \xrightarrow{\mathbb{P}} C(\theta) := \frac{2 \mathbb{E}\left[ \tau_\theta(W, X) \right]}{\mathbb{E}[\Lambda_\theta(W, X)^2]}

where triangle counts TnT_n and the function

τθ(w,x)=ρ22Rd×RdE[qw,x(u;W1)qw,x(v;W2)qW1,W2(uv)X=x]dudv\tau_\theta(w, x) = \frac{\rho^2}{2} \iint_{\mathbb{R}^d \times \mathbb{R}^d} \mathbb{E}\bigl[ q_{w, x}(u; W_1) q_{w, x}(v; W_2) q_{W_1, W_2}(u-v) \mid X = x \bigr] du \, dv

with qa,b(u)=1exp{λρabk(u)}q_{a, b}(u) = 1 - \exp\{ -\frac{\lambda}{\rho} ab k(u)\}.

The numerator counts rooted triangles, while the squared intensity in the denominator counts wedges; their ratio determines asymptotic clustering.

5. Tail Inheritance: The CoLaS-HT Extension

To achieve genuine power-law degree distributions while retaining locality and sparsity, CoLaS introduces a tail-inheriting extension ("CoLaS-HT"). The key modification is to replace the fixed interaction range with a weight-dependent range:

(Aij=1Wi,Wj,Xi,Xj)=1exp{λρnk(XiXjεn(WiWj)1/d)}\P(A_{ij}=1 \mid W_i, W_j, X_i, X_j) = 1 - \exp \left\{ -\frac{\lambda}{\rho_n} k\left( \frac{X_i - X_j}{\varepsilon_n (W_i W_j)^{1/d}} \right) \right\}

The limiting degree intensity is now

ΛθHT(w,x)=ρκ2(λ)wm1,θ(x)\Lambda_\theta^{\mathrm{HT}}(w, x) = \rho \kappa_2^{(\lambda)} w \, m_{1,\theta}(x)

with κ2(λ)=(1e(λ/ρ)k(u))du\kappa_2^{(\lambda)} = \int \left(1 - e^{-(\lambda/\rho)k(u)} \right) du. If WW is regularly varying of index α\alpha, the limiting intensity and thus the mixed-Poisson degree also inherit this tail—a property absent from the fixed-range regime.

6. Model Calibration via One-Graph Estimation

Identification of the copula parameter θ\theta and the density parameter λ\lambda is feasible from a single observed graph under injectivity of the map θ(C(θ),r(θ))\theta \mapsto (C(\theta), r(\theta)):

  • Fix FWF_W, spatial dimension dd, and kernel kk.
  • Estimate λ\lambda by matching the empirical mean degree dˉ\bar{d} with ρnEW[Λθ(W,X)]\rho_n \mathbb{E}_W[\Lambda_\theta(W, X)].
  • Compute empirical global transitivity C^n\widehat{C}_n and endpoint assortativity r^n\widehat{r}_n.
  • Obtain θ^n\widehat{\theta}_n via minimum-distance moment matching:

θ^nargminθΘ(r^n,C^n)(r(θ),C(θ))22.\widehat{\theta}_n \in \arg \min_{\theta \in \Theta} \left\| (\widehat{r}_n, \widehat{C}_n) - (r(\theta), C(\theta)) \right\|_2^2.

A joint n\sqrt{n}-CLT for (C^n,r^n)(\widehat{C}_n, \widehat{r}_n) ensures consistency and asymptotic normality:

n(θ^nθ0)N(0,(GG)1GΣG(GG)1)\sqrt{n} (\widehat{\theta}_n - \theta_0) \Rightarrow \mathcal{N}\left( 0,\, (G^\top G)^{-1} G^\top \Sigma G (G^\top G)^{-1} \right)

where G=Dθ(r,C)θ0G = D_\theta(r, C)|_{\theta_0} is the Jacobian of the moment map, and Σ\Sigma is the covariance of (r^n,C^n)(\widehat{r}_n, \widehat{C}_n).

7. Conceptual Synthesis and Model Characteristics

CoLaS achieves explicit, modular separation of mechanisms:

  • Marginal degree heterogeneity is governed by FWF_W.
  • Clustering is controlled by the geometric kernel kk under shrinking range, preventing vanishing transitivity.
  • Degree mixing (assortativity) is tuned solely by the copula parameter θ\theta, without secondary rewiring.

This structure enables explicit large-nn limit theorems for degree distribution (mixed-Poisson laws), consistent estimation via one-graph calibration, and, with the CoLaS-HT extension, sharply distinguishes between light-tailed and heavy-tailed degree behaviors contingent on kernel and node-weight interactions. This provides a unified analytic framework for sparse empirical networks exhibiting heterogeneity, clustering, and assortativity, with direct parameter-to-mechanism correspondence (Papamichalis et al., 23 Dec 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Topic to Video (Beta)

Follow Topic

Get notified by email when new papers are published related to CoLaS (Copula-Seeded Local Latent-Space Graphs).