$G$-Mapper: Learning a Cover in the Mapper Construction (2309.06634v3)

Published 12 Sep 2023 in cs.LG, math.AT, and stat.ML

Abstract: The Mapper algorithm is a visualization technique in topological data analysis (TDA) that outputs a graph reflecting the structure of a given dataset. However, the Mapper algorithm requires tuning several parameters in order to generate a ``nice" Mapper graph. This paper focuses on selecting the cover parameter. We present an algorithm that optimizes the cover of a Mapper graph by splitting a cover repeatedly according to a statistical test for normality. Our algorithm is based on $G$-means clustering which searches for the optimal number of clusters in $k$-means by iteratively applying the Anderson-Darling test. Our splitting procedure employs a Gaussian mixture model to carefully choose the cover according to the distribution of the given data. Experiments for synthetic and real-world datasets demonstrate that our algorithm generates covers so that the Mapper graphs retain the essence of the datasets, while also running significantly fast.

References (46)

Authors (7)

Enrique Alvarado (7 papers)
Robin Belton (6 papers)
Emily Fischer (2 papers)
Kang-Ju Lee (5 papers)
Sourabh Palande (5 papers)
Sarah Percival (8 papers)
Emilie Purvine (28 papers)

Citations (2)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

$G$-Mapper: Learning a Cover in the Mapper Construction (2309.06634v3)

Summary

Related Papers