Papers
Topics
Authors
Recent
Search
2000 character limit reached

Learnable Laplacian Positional Encodings

Updated 24 June 2026
  • LLPE are learnable positional encoding techniques that utilize the graph Laplacian’s spectral properties to generate robust and expressive node embeddings.
  • They employ learnable spectral filters and message-passing GNNs to capture both global and local graph structures while avoiding the computational bottlenecks of traditional eigenvector methods.
  • These methods deliver state-of-the-art performance on various benchmarks, offering scalable, adaptive, and theoretically grounded alternatives for graph representation learning.

Learnable Laplacian Positional Encodings (LLPE) are a class of techniques that leverage the spectral properties of a graph’s Laplacian operator to construct node representations optimized for use in graph neural networks (GNNs) and related models. These methods address the core challenges of capturing global and local topological information, efficiently scaling to large graphs, and ensuring robustness under graph perturbations or task shifts.

LLPE integrates principles from spectral graph theory, deep learning, and positional encoding, permitting learnable, often end-to-end differentiable, mappings from graph structure to node embeddings. These encodings have demonstrated both theoretical and empirical advantages over fixed Laplacian eigenvector–based methods, especially regarding stability, expressivity, scalability, and adaptivity to homophilous and heterophilous regimes.

1. Spectral Foundations and Laplacian Eigenvector Encodings

Let G=(V,E)G = (V, E), with adjacency matrix ARn×nA \in \mathbb{R}^{n \times n} and degree matrix D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n). The (combinatorial) Laplacian is L=DAL = D - A. Its eigendecomposition is L=UΛUL = U \Lambda U^\top with orthonormal UU and diagonal Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n), 0=λ1λ2λn0 = \lambda_1 \leq \lambda_2 \leq \cdots \leq \lambda_n.

Classical Laplacian eigenvector positional encodings (PEs) use either the full matrix UU or the truncated UkU_k (the first ARn×nA \in \mathbb{R}^{n \times n}0 columns) as node features. These spectral encodings provide permutation equivariance and, in principle, can distinguish non-isomorphic graphs (greater expressivity than 1-WL). However, they have several key limitations:

  • Computational cost: Explicit eigendecomposition is ARn×nA \in \mathbb{R}^{n \times n}1 in time, ARn×nA \in \mathbb{R}^{n \times n}2 in memory, and ARn×nA \in \mathbb{R}^{n \times n}3 for truncated ARn×nA \in \mathbb{R}^{n \times n}4-eigenvector use—a severe bottleneck for large graphs.
  • Instability: As established by Davis-Kahan and related analyses, eigenvectors of ARn×nA \in \mathbb{R}^{n \times n}5 are highly sensitive to small perturbations if the spectral gap ARn×nA \in \mathbb{R}^{n \times n}6 is small, leading to non-robustness under even minor noise or structure changes.
  • Non-uniqueness: Eigenvectors are defined up to sign and rotation within degenerate eigenspaces; care is required to maintain permutation equivariance and invariance to basis choices (Huang et al., 2023).

2. Core Formulations of Learnable Laplacian Positional Encodings

LLPE frameworks overcome the above limitations by replacing or augmenting explicit eigenvector use with learnable, permutation-equivariant mappings. The following paradigms now anchor the field:

2.1 Filtered Spectral LLPE

LLPE can be parameterized by learnable spectral filters ARn×nA \in \mathbb{R}^{n \times n}7, operating over the entire Laplacian spectrum. For each output dimension ARn×nA \in \mathbb{R}^{n \times n}8, a scalar filter ARn×nA \in \mathbb{R}^{n \times n}9 is constructed as a Chebyshev polynomial of order D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)0:

D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)1

where D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)2 is the normalized eigenvalue and D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)3 is the D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)4-th Chebyshev polynomial. Node encodings are then propagated as D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)5. This enables the model to control the frequency content explicitly—learning to emphasize low, mid, or high-frequency modes, crucial for capturing both homophilous and heterophilous graph structures (Ito et al., 29 Apr 2025).

2.2 GNN-based LLPE via Message Passing

An alternative, scalable approach instantiates LLPE via message-passing GNNs (e.g., GIN), viewing GNN layers as nonlinear, permutation-equivariant spectral filters. In PEARL (2502.01122), two cases are distinguished:

  • R-PEARL (Random Initialization): Each node is seeded with D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)6 i.i.d. random vectors. The GNN processes each sampled input, pooling the D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)7 outputs (typically via mean) to produce an equivariant, spectrally expressive node embedding. This approximates general basis-invariant mappings without explicit eigendecomposition and operates in near-linear time for large graphs.
  • B-PEARL (Basis Initialization): For small graphs, the GNN is initialized with standard basis vectors (one per node) and output features are pooled via summation. With polynomial GNN filters, this approach can recover expressive basis-invariant encodings, with cost D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)8.

Both methods enable LLPE to bypass the cubic complexity and stability pitfalls of direct spectral encodings, achieving high expressivity by leveraging the universality of message-passing networks.

2.3 Hybrid Approaches and Integration with Standard GNNs

Some architectures, such as LSPE (Dwivedi et al., 2021), maintain explicit positional streams initialized via a Laplacian embedding (e.g., D=diag(d1,,dn)D = \mathrm{diag}(d_1,\ldots,d_n)9 with learned L=DAL = D - A0), updating these streams alongside structural ones through GNN layers. This decoupling sharpens both local and global representational power and allows for interactively regularizing the positional component (e.g., via a Laplacian eigenvector loss).

Other works integrate a small learnable GNN or MLP to refine raw Laplacian eigenvector features (Wijesinghe et al., 22 Feb 2025), particularly in self-supervised contrastive learning, or combine them with higher-order, multi-hop, or walk-based features.

3. Theoretical Properties: Stability, Expressivity, and Universality

LLPE methods bring forth the following provable properties:

  • Stability: Learnable GNN LLPEs with Lipschitz-continuous nonlinearities and bounded filters inherit Lipschitz continuity with respect to graph perturbations. For R-/B-PEARL (as formalized in (2502.01122)), node encodings obey

L=DAL = D - A1

where L=DAL = D - A2 is the number of GNN layers and L=DAL = D - A3 is mild in L=DAL = D - A4—independent of spectral gap.

  • Expressivity: LLPEs using GNNs or Chebyshev spectral filtering are universal for continuous, basis-invariant functions over the Laplacian. This means that for any such L=DAL = D - A5, there exists a parameterization of LLPE that can approximate it to arbitrary precision (2502.01122, Huang et al., 2023). LLPEs can count cycles of length up to 7 (R-PEARL, via nonlinearities) and generalize the power of 1-WL and higher.
  • Adaptivity to Homophily/Heterophily: Filter-based LLPE methods adaptively learn which parts of the spectrum—corresponding to different structural scales—are selected for the task. On stochastic block models (SBMs), LLPEs can exactly recover communities regardless of whether structure lies in the low- or high-frequency spectrum (Ito et al., 29 Apr 2025).
  • Generalization: Rademacher complexity bounds for Chebyshev LLPE show generalization error does not grow with spectrum truncation order, contrasting with MLP-based approaches that scale poorly with L=DAL = D - A6 (Ito et al., 29 Apr 2025).

4. Empirical Performance and Benchmarking

Multiple LLPE instantiations have demonstrated state-of-the-art results and superior scaling across synthetic, molecular, and large-scale real-world graphs:

Method Complexity Expressivity Stability Example Benchmarks
R-PEARL (2502.01122) L=DAL = D - A7 L=DAL = D - A81-WL Yes REDDIT-B, REDDIT-M, OGB, DrugOOD
B-PEARL (2502.01122) L=DAL = D - A9 Full spectral Yes ZINC (logP), DrugOOD, RelBench
Chebyshev LLPE (Ito et al., 29 Apr 2025) L=UΛUL = U \Lambda U^\top0 (L=UΛUL = U \Lambda U^\top1 approx.) Task-adaptive Yes SBMs, Power-law, TEXAS, Penn94
SPE (Huang et al., 2023) L=UΛUL = U \Lambda U^\top2 Universal Provable ZINC, Alchemy, DrugOOD, Cycle counting
LSPE (Dwivedi et al., 2021) Linear (sparse GNNs) Task-adaptive Optional† ZINC, OGBG-MOLTOX21, OGBG-MOLPCBA

†With Laplacian eigenvector regularization.

Key results:

  • R-PEARL achieves comparable or superior performance to SignNet and other previous eigenvector-based methods at far lower computational cost; mean accuracy 94.5% on REDDIT-B with L=UΛUL = U \Lambda U^\top3 complexity (2502.01122).
  • Chebyshev LLPE (with order L=UΛUL = U \Lambda U^\top4) yields up to 35% relative accuracy gain on heterophilous SBMs, and up to 14% absolute gain on real heterophilous benchmarks (e.g., Penn94) (Ito et al., 29 Apr 2025).
  • SPE provides provable robustness to OOD shifts with optimal performance in stability–expressivity tradeoff regimes, outperforming earlier basis-invariant networks (Huang et al., 2023).
  • LSPE and message-passing LLPEs close the gap between fixed PE (often overfitting on small data) and flexible but data-driven representation, achieving up to 64.1% error reduction on ZINC (Dwivedi et al., 2021).

5. Implementation, Complexity, and Practical Guidance

Implementation proceeds via:

  1. Spectral Filtering Approaches:
    • Compute the Laplacian eigendecomposition (L=UΛUL = U \Lambda U^\top5), or—on large graphs—the first and last L=UΛUL = U \Lambda U^\top6 eigenpairs using Arnoldi-type sparse solvers (L=UΛUL = U \Lambda U^\top7).
    • Normalize eigenvalues to L=UΛUL = U \Lambda U^\top8.
    • Choose Chebyshev order L=UΛUL = U \Lambda U^\top9, PE dimension UU0, and initialize UU1 coefficients.
    • Synthesize UU2, multiply with eigenvectors to form node PEs, and concatenate with node features.
    • Optimize end-to-end with the graph model using cross-entropy or task-specific losses, along with regularization (e.g., UU3, UU4 on UU5).
  2. GNN-based LLPE:
    • Seed each node with UU6 random scalars (R-PEARL) or standard basis vectors (B-PEARL) as initial features.
    • Run a shared UU7-layer GNN for each sample/input, apply statistical pooling (mean or sum) to enforce permutation equivariance.
    • For large graphs, select UU8 and GNN filter order UU9 according to the desired spectral complexity.
    • Normalize learned embeddings for tighter control of stability constants.
    • Feed output positional features to downstream GNNs or graph transformers for training.

Chebyshev LLPE methods scale efficiently up to Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n)0, with sparse methods enabling partial spectrum approximation for Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n)1. GNN-based LLPE attains near-linear scaling for sparse graphs and can be deployed under strict resource constraints.

6. Extensions and Open Directions

Significant extensions of LLPE frameworks include:

  • Learning with generalized graph shift operators—normalized Laplacian, random-walk matrices, or edge-weighted variants—in the same spectral or message-passing framework (2502.01122).
  • Combining absolute positional encodings (LLPE) with relative or pairwise-structural embeddings, enabling richer graph representations.
  • Task-dependent initializations or anchor-based distributions for random or basis GNN initialization, focusing on motifs or local substructures.
  • Development of scalable, incremental, or streaming variations for dynamic graphs.
  • Extending LLPE to directed, heterogeneous, or hypergraph settings where traditional Laplacian theory does not readily apply (Huang et al., 2023).
  • Investigation of optimal spectral truncation (choice of Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n)2) and filter order Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n)3 to balance computational tractability with information richness.
  • Theoretical characterization of LLPE under non-uniform graph size distributions and in the presence of high-multiplicity eigenvalues.

A plausible implication is that further integration of spectral and structure-aware encoding, coupled with adaptivity to both global and local graph variation, will underpin advances in generalizable, scalable, and robust graph learning.

7. LLPE in Coordinate-MLPs and Continuous Domains

Adaptations of LLPE beyond standard graphs have been proposed, most notably for coordinate-MLPs. Here, LLPE regularizes instance-specific embedding parameters (e.g., RBF widths Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n)4), via graph-Laplacian penalties on a similarity graph over embedded coordinates (Ramasinghe et al., 2021). This approach:

  • Uses a super-Gaussian radial basis embedding with coordinate-dependent Λ=diag(λ1,,λn)\Lambda = \operatorname{diag}(\lambda_1,\ldots,\lambda_n)5 fit via a low-degree polynomial in the local Jacobian norm of the target function.
  • Incorporates a Laplacian penalty to enforce embedding smoothness adaptively, balancing memorization and generalization.
  • Results in more stable, generalizable, and plug-and-play intermediate positional embeddings for deep coordinate-based models, outperforming both hand-tuned and end-to-end randomly initialized Fourier features.

References

LLPE synthesizes advances from spectral graph theory, nonlinear permutation-equivariant learning, and positional embedding design, yielding state-of-the-art, theoretically grounded, and scalable encodings for a broad set of modern graph learning problems.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Learnable Laplacian Positional Encodings (LLPE).