Homological Neural Networks
- Homological Neural Networks (HNNs) are neural architectures that leverage simplicial complexes and algebraic topology to construct sparse, interpretable models for complex, high-dimensional data.
- They extract topological features through data-driven dependency estimation and clique complex construction, enforcing fixed wiring and high sparsity in neural connections.
- Empirical results show that HNNs achieve competitive performance with state-of-the-art methods while ensuring model stability, interpretability, and lower computational cost.
Homological Neural Networks (HNNs) are a class of neural architectures that leverage combinatorial topology—particularly the structure of simplicial complexes and their associated algebraic invariants—to encode sparse, interpretable, and compositionally structured models for high-dimensional data. Rooted in the intersection of topological data analysis and neural computation, HNNs systematically incorporate higher-order interactions via topological filtrations, incidence structures, and explicit homological constructions. Their design is motivated by the need to reconcile data complexity and sample-efficiency with the tractability, interpretability, and stability of learned models, especially in the regime of multivariate tabular data or compositional generative tasks.
1. Formal Foundations: Simplicial Complexes and Homology
The mathematical core of HNNs is the simplicial complex. For a set of variables , a simplicial complex is a set of subsets (“simplices”) of , closed under subset inclusion: if , then any also lies in (Wang et al., 2023, Lin et al., 14 May 2026). A -simplex is defined as a set of vertices, and the entire hierarchy spans from vertices (0-simplices), edges (1-simplices), faces (2-simplices), and so on.
Given a simplicial complex , one assigns to each dimension 0 a vector space of 1-chains 2. The boundary operator 3 is linearly defined on basis elements via
4
where the hat denotes omission of the 5-th vertex. The 6-th homology group 7 encodes 8-dimensional topological holes, with dimension 9 (Betti number) quantifying their count.
HNN architectures are built upon these algebraic and combinatorial structures: neurons and connections correspond—respectively—to simplices and their face/containment relations, directly mirroring the topology of the data relationships (Wang et al., 2023, Lin et al., 14 May 2026).
2. Architectural Construction: From Data to Homological Wiring
The HNN pipeline incorporates data-driven topology extraction and strict wiring constraints (Wang et al., 2023, Lin et al., 14 May 2026):
- Dependency Estimation: Pairwise (or higher-order) dependencies (e.g., Pearson correlations, mutual information) are computed on the variables, forming a weighted dependency matrix 0.
- Sparse Graph Construction: Algorithms such as the Triangulated Maximally Filtered Graph (TMFG) or Maximally Filtered Clique Forest (MFCF) produce sparse, chordal graphs consistent with data dependencies. TMFG, for instance, iteratively leverages four-vertex cliques (tetrahedra) to maximize locality of association.
- Clique Complex Extraction: Maximal cliques in the sparse graphical model are identified and closed under inclusion to obtain the clique complex, which defines a simplicial complex structure over the variable set (Lin et al., 14 May 2026).
- Layered Homological Representation: Neurons are instantiated for each simplex in each dimension. Connections only exist between each 1-simplex and its 2-dimensional faces; the corresponding weight matrices 3 are strictly masked by unsigned incidence matrices 4, enforcing that
5
This results in a fixed-wiring, highly sparse DNN, where typical sparsity exceeds 90% (Lin et al., 14 May 2026, Wang et al., 2023).
- Forward Propagation: Activations propagate hierarchically, with each unit aggregating (weighted) signals only from its combinatorial faces. Nonlinearities (ReLU, LeakyReLU) follow each aggregation. All-layer readouts enable explicit decomposition by interaction order.
3. Theoretical Motivation: Compositional Sparsity and Inductive Biases
HNNs are motivated by the principle of compositional sparsity: high-dimensional functions of the form
6
with each 7 of bounded size, admit sample-efficient learning if architecture and parameterization exploit this compositional structure (Lin et al., 14 May 2026). By mapping maximal cliques to simplices and restricting neuron aggregation to faces, HNNs instantiate an inductive bias mirroring the hierarchical composition of generative functions.
The architecture is completely determined from the data, and structural inference (variable dependency, clique extraction) is entirely decoupled from parameter learning (“fixed wiring”) (Lin et al., 14 May 2026). This results in orders-of-magnitude reductions in trainable parameters relative to comparably expressive dense MLPs. Empirical results demonstrate stability as model dimension grows and significantly lower sensitivity to hyperparameter choices.
4. Relation to Neural Representations and Persistent Homology
Beyond architectural design, homological methods have been employed to analyze and interpret neural representations. ReLU networks induce a partition of input space into convex polyhedral regions 8, within which the map is affine (Beshkov, 3 Feb 2025). For a dataset manifold 9, the neural representation 0 has homology groups isomorphic to the relative homology 1, where the “overlap decomposition” 2 detects input equivalence across glued polyhedra (Beshkov, 3 Feb 2025). This opens the door to purely topological (rather than geometric) feature analysis and clarifies the effect of neural composition on topological invariants.
Persistent homology features—such as barcodes and persistence diagrams—have also been used as supervision targets or as neural inputs/outputs. For instance, neural networks (CNNs on images, GNNs on complexes) can be trained to predict or approximate persistence-based features, including stable tropical coordinates and filtered persistence images, directly from data (Montúfar et al., 2020).
5. Algorithmic Recipes and Practical Implementation
HNN Construction and Training Pipeline
A typical end-to-end HNN training workflow includes:
- Compute variable dependencies (correlation or information-based) on training data.
- Construct the sparse chordal graph (TMFG or MFCF).
- Extract the clique/simplicial complex.
- Generate incidence matrices 3 for all 4.
- Instantiate and initialize masked weight matrices 5 and readout layers.
- Train weights using Adam or similar optimizer, with early stopping and layer normalization (Wang et al., 2023, Lin et al., 14 May 2026).
Sparsity is enforced structurally—no pruning or 6 regularization is required. Layer readouts preserve interpretability by decomposition order.
Model Configurations
| Model Type | Typical Parameter Count | Sparsity |
|---|---|---|
| Tabular HNN | 7M (vs. 8M dense) | 9 |
| LSTM-HNN (solar) | 0M (vs. 1M) | 2 |
| HNN (OpenML-CTR23) | 3k–4k (MLP: 17–115k) | 90–99% |
6. Empirical Results, Interpretability, and Applications
Benchmarking and Sample Complexity
HNNs have been empirically evaluated on synthetic tasks (Erdős–Rényi precision matrices, nonlinear clique sum generative models) and real-world tabular and time-series datasets (Wang et al., 2023, Lin et al., 14 May 2026). Key findings include:
- HNNs sustain high 5 in low-sample or high-dimensional regimes where dense MLPs collapse.
- On real datasets, HNNs are consistently competitive with state-of-the-art tree ensembles (e.g., XGBoost, LightGBM) while requiring far fewer parameters.
- In time-series (solar-energy, exchange rate) forecasting, LSTM-HNN hybrids attain RSE and correlation on par with the best spatio-temporal models.
Interpretability
Due to the one-to-one mapping between neurons, simplices, and variables, HNNs provide explicit structural interpretability. Betti-number statistics of activations can be tracked to detect topological features and to understand the emergence of global “holes” (cycles, voids) in the feature composition (Wang et al., 2023, Lin et al., 14 May 2026). Learned weights can be visualized on TMFG or clique-forest structures to highlight critical dependencies and higher-order interactions.
Efficiency
HNNs achieve 6 per-sample computational complexity for sparse architectures (7=number of variables), contrasting with 8 in fully connected nets. Measurements indicate %%%%4950%%%% speedup and 135% lower energy consumption per epoch compared to dense MLP baselines (Wang et al., 2023).
7. Extensions, Limitations, and Open Problems
Extensions include the learning of graph topology via differentiable optimization, integrating GNN-style message passing into the homological backbone, and leveraging higher-dimensional simplices for domains naturally supporting larger variable interactions (Wang et al., 2023). Tracking the evolution of the structural topology in streaming or time-varying data is another active direction.
A limitation is that HNNs, in their standard form, do not explicitly compute homology groups or Betti numbers during inference; instead, they rely on the combinatorial and hierarchical inductive structure of the simplicial complex. A plausible implication is that future architectures may combine explicit relative homology computation (e.g., as in (Beshkov, 3 Feb 2025)) with neural learning objectives, enabling models with “features as homology classes” rather than as static summary statistics or diagram-based features.
Homological Neural Networks represent a principled synthesis of algebraic topology and deep learning, instantiating compositional inductive biases via the language of simplices, faces, and incidence, and enabling interpretable, efficient, and scalable models for high-dimensional and structured data (Wang et al., 2023, Lin et al., 14 May 2026, Montúfar et al., 2020, Beshkov, 3 Feb 2025).