Topological Hierarchical Decompositions (2312.10239v1)
Abstract: Topological data analysis is an emerging field that applies the study of topological invariants to data. Perhaps the simplest of these invariants is the number of connected components or clusters. In this work, we explore a topological framework for cluster analysis and show how it can be used as a basis for explainability in unsupervised data analysis. Our main object of study will be hierarchical data structures referred to as Topological Hierarchical Decompositions (THDs). We give a number of examples of how traditional clustering algorithms can be topologized, and provide preliminary results on the THDs associated with Reeb graphs and the mapper algorithm. In particular, we give a generalized construction of the mapper functor as a pixelization of a cosheaf in order to generalize multiscale mapper.
- Hausdorff clustering. Physical Review E, 78(4), oct 2008.
- Persistence diagrams as diagrams: A categorification of the stability theorem. In Topological Data Analysis, pages 67–96. Springer, 2020.
- Francis Borceux. Handbook of Categorical Algebra, volume 3 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, 1994.
- A relative theory of interleavings. arXiv preprint arXiv:2004.14286, 2020.
- Probabilistic convergence and stability of random mapper graphs. Journal of Applied and Computational Topology, 5(1):99–140, 2021.
- Heloc applicant risk performance evaluation by topological hierarchical decomposition. arXiv preprint arXiv:1811.10658, 2018.
- Kyle A Brown. Topological Hierarchies and Decomposition: From Clustering to Persistence. PhD thesis, Wright State University, 2022.
- Categorification of persistent homology. Discrete & Computational Geometry, 51(3):600–627, 2014.
- Structure and stability of the one-dimensional mapper. Foundations of Computational Mathematics, 18(6):1333–1396, 2018.
- Multidimensional scaling. Measurement, judgment and decision making, pages 179–250, 1998.
- When and why the topological coverage criterion works. In Proceedings of the Twenty-Eighth Annual ACM-SIAM Symposium on Discrete Algorithms, pages 2679–2690, USA, 2017.
- Analysis of scalar fields over point cloud data. In Proc. 19th ACM-SIAM Sympos. on Discrete Algorithms, pages 1021–1030, 2009.
- A theory of sub-barcodes, 2022.
- Emnist: Extending mnist to handwritten letters. In 2017 international joint conference on neural networks (IJCNN), pages 2921–2926. IEEE, 2017.
- Decorated merge trees for persistent topology, 2021.
- Justin Michael Curry. Sheaves, cosheaves and applications. University of Pennsylvania, 2014.
- Coverage in sensor networks via persistent homology. Algebraic & Geometric Topology, 7:339–358, 2007.
- Categorified reeb graphs. Discrete & Computational Geometry, 55(4):854–906, 2016.
- Multiscale mapper: Topological summarization via codomain covers. In Proceedings of the twenty-seventh annual acm-siam symposium on discrete algorithms, pages 997–1013. SIAM, 2016.
- J Funk. The display locale of a cosheaf. Cahiers de topologie et géométrie différentielle catégoriques, 36(1):53–93, 1995.
- Kirk Patrick Gardner. Verified Topological Data Analysis and a Theory of Sub-Barcodes. North Carolina State University, 2022.
- Patrick J Grother. Nist special database 19-hand-printed forms and characters database. Technical Report, National Institute of Standards and Technology, 1995.
- Allen Hatcher. Algebraic Topology. Cambridge University Press, 2001.
- Wesley J Holmes. Topological Analysis of Averaged Sentence Embeddings. PhD thesis, Wright State University, 2020.
- M. Kashiwara and P. Schapira. Categories and Sheaves. Grundlehren der mathematischen Wissenschaften. Springer Berlin Heidelberg, 2005.
- Saunders Mac Lane. Categories for the working mathematician, volume 5. Springer Science & Business Media, 2013.
- Sheaves in Geometry and Logic a First Introduction to Topos Theory. Springer New York, New York, NY, 1992.
- Convergence between categorical representations of reeb space and mapper. arXiv preprint arXiv:1512.04108, 2015.
- Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Proceedings of the National Academy of Sciences, 108(17):7265–7270, 2011.
- Amit Patel. Generalized persistence diagrams. Journal of Applied and Computational Topology, 1(3):397–419, 2018.
- Jonathan Woolf. The fundamental category of a stratified space. arXiv preprint arXiv:0811.2580, 2008.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.