Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
112 tokens/sec
GPT-4o
8 tokens/sec
Gemini 2.5 Pro Pro
47 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

TopoX: A Suite of Python Packages for Machine Learning on Topological Domains (2402.02441v5)

Published 4 Feb 2024 in cs.LG, cs.AI, stat.CO, and cs.MS

Abstract: We introduce TopoX, a Python software suite that provides reliable and user-friendly building blocks for computing and machine learning on topological domains that extend graphs: hypergraphs, simplicial, cellular, path and combinatorial complexes. TopoX consists of three packages: TopoNetX facilitates constructing and computing on these domains, including working with nodes, edges and higher-order cells; TopoEmbedX provides methods to embed topological domains into vector spaces, akin to popular graph-based embedding algorithms such as node2vec; TopoModelX is built on top of PyTorch and offers a comprehensive toolbox of higher-order message passing functions for neural networks on topological domains. The extensively documented and unit-tested source code of TopoX is available under MIT license at https://pyt-team.github.io/}{https://pyt-team.github.io/.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473, 2014.
  2. Topological signal processing over simplicial complexes. IEEE Transactions on Signal Processing, 68:2992–3007, 2020.
  3. Geometric deep learning: grids, groups, graphs, geodesics, and gauges. arXiv preprint:2104.13478, 2021.
  4. A comprehensive survey on geometric deep learning. IEEE Access, 8:35929–35949, 2020.
  5. Mathilde Papillon et al. Icml 2023 topological deep learning challenge: Design and results. In Proceedings of 2nd Annual Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML), October 2023.
  6. Hypergraph neural networks. Proceedings of the AAAI Conference on Artificial Intelligence, 33(01), 2019.
  7. Fast graph representation learning with PyTorch Geometric. In ICLR Workshop on Representation Learning on Graphs and Manifolds, 2019.
  8. Cell attention networks. arXiv preprint arXiv:2209.08179, 2022.
  9. Exploring network structure, dynamics, and function using networkx. Technical report, Los Alamos National Lab.(LANL), Los Alamos, NM (United States), 2008.
  10. Topological deep learning: Going beyond graph data.
  11. Cell complex neural networks. NeurIPS 2020 Workshop TDA and Beyond, 2020.
  12. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems, pages 1097–1105, 2012.
  13. XGI: A Python package for higher-order interaction networks. Journal of Open Source Software, 8(85):5162, May 2023. doi: 10.21105/joss.05162. URL https://joss.theoj.org/papers/10.21105/joss.05162.
  14. Parallel algorithms for efficient computation of high-order line graphs of hypergraphs. In 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC), pages 312–321. IEEE, 2021.
  15. Architectures of topological deep learning: a survey on topological neural networks. arXiv preprint arXiv:2304.10031, 2023.
  16. PyTorch: an imperative style, high-performance deep learning library. In Advances in Neural Information Processing Systems 32, pages 8024–8035. Curran Associates, Inc., 2019.
  17. Scikit-learn: Machine learning in python. the Journal of machine Learning research, 12:2825–2830, 2011.
  18. Principled simplicial neural networks for trajectory prediction. In Int. Conf. Mach. Learn. PMLR, 2021.
  19. Karate Club: An API Oriented Open-source Python Framework for Unsupervised Learning on Graphs. In Proceedings of the 29th ACM International Conference on Information and Knowledge Management (CIKM ’20), page 3125–3132. ACM, 2020.
  20. Random walks on simplicial complexes and the normalized Hodge 1-Laplacian. SIAM Review, 62(2):353–391, 2020.
  21. Signal processing on higher-order networks: livin)’on the edge… and beyond. Signal Processing, 187:108149, 2021.
  22. Sequence to sequence learning with neural networks. In Advances in neural information processing systems, pages 3104–3112, 2014.
  23. Deep Graph Library: a graph-centric, highly-performant package for graph neural networks. arXiv preprint arXiv:1909.01315, 2019.
  24. Convolutional learning on simplicial complexes. arXiv preprint arXiv:2301.11163, 2023.
  25. Graph neural networks: a review of methods and applications. AI Open, 1:57–81, 2020.
Citations (8)

Summary

  • The paper introduces TopoX, a unified suite of Python packages that advances Topological Deep Learning through integrated domain construction, embedding, and neural modeling.
  • It details how TopoNetX manipulates higher-order topological cells and how TopoEmbedX extends node2vec embeddings to complex domains with over 95% code coverage.
  • The implementation of TopoModelX in PyTorch enables efficient higher-order message passing, positioning TopoX as a valuable tool for both research and education.

Overview of TopoX Software Suite

In the field of machine learning, the burgeoning field of Topological Deep Learning (TDL) has expanded the boundaries beyond traditional Euclidean spaces, encompassing topological domains such as simplicial and cellular complexes. This paper introduces TopoX, a suite of Python packages, which stands as a significant stride in TDL, offering an all-encompassing toolkit for machine learning applications within these complex domains. The suite consists of TopoNetX for domain construction and computation, TopoEmbedX for embedding such domains into vector spaces, and TopoModelX, a PyTorch-based framework for implementing neural networks in these domains.

TopoX Functionalities and Design

The design of TopoX is noted for its adaptation and abstraction from current graph-based tools. TopoNetX is instrumental in the manipulation of higher-order cells and complex structures, whereas TopoEmbedX extrapolates node2vec-like embeddings to topological spaces, serving to advance representation learning. TopoModelX is particularly innovative, introducing TDL models that leverage higher-order message passing functions. This comprehensive suite not only bridges a gap in practical implementation of TDL theories but also boasts robust documentation, exhaustive unit testing, and an open-source MIT license repository.

Comparative Advantages

TopoX is meticulously compared with other prominent Python libraries, marking its superiority in terms of functionalities and the range of topological domains catered. Boasting a code coverage of >=95% across its packages, TopoX demonstrates reliability and robustness parallel to, or exceeding, renowned libraries like PyG, DGL, and NetworkX. Its unified API design simplifies the transition across various topological structures and enhances collaborative and parallel computing efforts within the scientific community.

Practical Applicability and Educational Value

The educational potential of TopoX is showcased through example snippets demonstrating its intuitive interface for constructing complexes and applying TDL models. These rich examples, along with bundled Jupyter notebook tutorials, make it an invaluable learning tool for users new to the world of TDL. Whether for embedding using the Stanford bunny dataset or forwarding passes with simplicial neural networks, TopoX simplifies the intricate dance of topology and deep learning into user-friendly, actionable code blocks, anchoring its place as a bridge between theory and hands-on application.

In conclusion, TopoX equips researchers and enthusiasts with powerful tools to exploit the versatile landscape of topological domains, promising a leap forward for the interdisciplinary area where machine learning meets complex topology.

Github Logo Streamline Icon: https://streamlinehq.com