Homology-Preserving Multi-Scale Graph Skeletonization Using Mapper on Graphs (1804.11242v5)
Abstract: Node-link diagrams are a popular method for representing graphs that capture relationships between individuals, businesses, proteins, and telecommunication endpoints. However, node-link diagrams may fail to convey insights regarding graph structures, even for moderately sized data of a few hundred nodes, due to visual clutter. We propose to apply the mapper construction -- a popular tool in topological data analysis -- to graph visualization, which provides a strong theoretical basis for summarizing the data while preserving their core structures. We develop a variation of the mapper construction targeting weighted, undirected graphs, called {\mog}, which generates homology-preserving skeletons of graphs. We further show how the adjustment of a single parameter enables multi-scale skeletonization of the input graph. We provide a software tool that enables interactive explorations of such skeletons and demonstrate the effectiveness of our method for synthetic and real-world data.
- ASK-GraphView: A large scale graph visualization system. IEEE Trans. on Visual. and Comp. Graph., 12(5), 2006.
- Topolayout: Multilevel graph layout by topological features. IEEE Trans. on Visual. and Comp. Graph., 13(2), 2007.
- GrouseFlocks: Steerable exploration of graph hierarchy space. IEEE Trans. on Visual. and Comp. Graph., 14(4), 2008.
- Tugging graphs faster: Efficiently modifying path-preserving hierarchies for browsing paths. IEEE Trans. on Visual. and Comp. Graph., 17(3), 2010.
- The readability of path-preserving clusterings of graphs. Comp. Graph. Forum, 29(3), 2010.
- Towards unambiguous edge bundling: Investigating confluent drawings for network visualization. IEEE Trans. on Visual. and Comp. Graph., 23(1), 2017.
- M. Bampasidou and T. Gentimis. Modeling collaborations with persistent homology. arXiv preprint, 2014.
- Gephi: an open source software for exploring and manipulating networks. In AAAI Web and Social Media, 2009.
- Visual analysis of large graphs using (x, y)-clustering and hybrid visualizations. IEEE Trans. on Visual. and Comp. Graph., 17(11), 2010.
- Reeb graphs for shape analysis and applications. Theoretical Comp. Sci., 392, 2008.
- An overview on properties and efficacy of topological skeletons in shape modelling. Shape Modeling International, 2003.
- Fast unfolding of communities in large networks. J. of Statistical Mechanics, 2008.
- Deep graph mapper: Seeing graphs through the neural lens. Frontiers in Big Data, 4, 2021.
- Design and update of a classification system: The ucsd map of science. PloS One, 7(7), 2012.
- On modularity clustering. IEEE Trans. on Knowledge and Data Engineering, 20(2), 2007.
- Experiments on graph clustering algorithms. In European Symp. on Algorithms, 2003.
- U. Brandes and C. Pich. Eigensolver methods for progressive multidimensional scaling of large data. In Graph Drawing, 2007.
- S. Brin and L. Page. The anatomy of a large-scale hypertextual web search engine. Computer Networks and ISDN Systems, 30(1-7), 1998.
- Probabilistic convergence and stability of random mapper graphs. Journal of Applied and Computational Topology (APCT), 5:99–140, 2021.
- Graph bisection algorithms with good average case behavior. Combinatorica, 7(2), 1987.
- G. Carlsson. Topological pattern recognition for point cloud data. Acta Numerica, 23, 2014.
- Statistical analysis and parameter selection for mapper. J. of Machine Learning Research, 19(12), 2018.
- M. Carriére and S. Oudot. Structure and stability of the one-dimensional mapper. Foundations of Computational Mathematics, 18(6), 2018.
- C. Carstens and K. Horadam. Persistent homology of collaboration networks. Mathematical Problems in Engineering, 2013.
- Brain activity: Conditional dissimilarity and persistent homology. IEEE Symp. on Biomedical Imaging, 2015.
- Adaptive covers for mapper graphs using information criteria. IEEE Big Data, 2021.
- Hot spots conjecture and its application to modeling tubular structures. In Workshop on Machine Learning in Medical Imaging, 2011.
- G. Cordasco and L. Gargano. Community detection via semi-synchronous label propagation algorithms. In IEEE Workshop on Business Applications of Social Network Analysis, 2010.
- Geometry-based edge clustering for graph visualization. IEEE Trans. on Visual. and Comp. Graph., 14(6), 2008.
- A topological paradigm for hippocampal spatial map formation using persistent homology. PLoS Computational Biology, 8(8), 2012.
- T. Davis and Y. Hu. The University of Florida sparse matrix collection. ACM Trans. on Mathematical Software, 38(1), 2011.
- Exploratory social network analysis with Pajek: Revised and expanded edition for updated software, vol. 46. Cambridge University Press, 2018.
- Topological analysis of nerves, reeb spaces, mappers, and multiscale mappers. arXiv preprint, 2017.
- I. Dhillon. Co-clustering documents and words using bipartite spectral graph partitioning. In ACM SIGKDD Knowledge Discovery and Data Mining, 2001.
- A min-max cut algorithm for graph partitioning and data clustering. IEEE Data Mining, 2001.
- Compressed adjacency matrices: untangling gene regulatory networks. IEEE Trans. on Visual. and Comp. Graph., 18(12), 2012.
- Decimation of fast states and weak nodes: topological variation via persistent homology. European Conf. on Complex Systems, 2012.
- Untangling force-directed layouts using persistent homology. In Topological Data Analysis and Visualization (TopoInVis), 2022.
- C. Dunne and B. Shneiderman. Motif simplification. ACM SIGCHI Human Factors in Computing Systems, 2013.
- Edge compression techniques for visualization of dense directed graphs. IEEE Trans. on Visual. and Comp. Graph., 19(12), 2013.
- G. Ellis and A. Dix. A taxonomy of clutter reduction for information visualisation. IEEE Trans. on Visual. and Comp. Graph., 13(6), 2007.
- Graphviz— open source graph drawing tools. Graph Drawing, 2002.
- Skeleton-based edge bundling for graph visualization. IEEE Trans. on Visual. and Comp. Graph., 17(12), 2011.
- A density-based algorithm for discovering clusters in large spatial databases with noise. In ACM SIGKDD Knowledge Discovery and Data Mining, 1996.
- M. Fiedler. Algebraic connectivity of graphs. Czechoslovak Mathematical Lournal, 23(2), 1973.
- T. Fruchterman and E. Reingold. Graph drawing by force-directed placement. Software: Practice and Exp., 21(11), 1991.
- Multilevel agglomerative edge bundling for visualizing large graphs. In IEEE PacificVis, 2011.
- Graph drawing by stress majorization. In Graph Drawing, 2005.
- Topological fisheye views for visualizing large graphs. IEEE Trans. on Visual. and Comp. Graph., 11(4), 2005.
- A technique for drawing directed graphs. IEEE Trans. on Software Eng., 19(3), 1993.
- Simplifying tuggraph using zipping algorithms. Pattern Recognition, 103, 2020.
- V. Grolmusz. A note on the pagerank of undirected graphs. arXiv preprint, 2012.
- Exploring network structure, dynamics, and function using networkx. Technical report, Los Alamos National Lab.(LANL), Los Alamos, NM (United States), 2008.
- MOG: Mapper on graphs for relationship preserving clustering. arXiv preprint, 2018.
- Visual detection of structural changes in time-varying graphs using persistent homology. IEEE PacificVis, 2018.
- Analyzing social media networks with NodeXL: Insights from a connected world. Morgan Kaufmann, 2010.
- D. Holten and J. van Wijk. Force-directed edge bundling for graph visualization. Comp. Graph. Forum, 28(3), 2009.
- Persistent homology of complex networks. J. of Statistical Mechanics: Theory and Experiment, 2009.
- Y. Hu. Efficient, high-quality force-directed graph drawing. Mathematica Journal, 10(1), 2005.
- G. Jeh and J. Widom. SimRank: a measure of structural-context similarity. In ACM SIGKDD Knowledge Discovery and Data Mining, 2002.
- Graphprism: Compact visualization of network structure. In Adv. Visual Interfaces, 2012.
- Detecting divergent subpopulations in phenomics data using interesting flares. In ACM Bioinfo., Computational Biology, and Health Informatics, 2018.
- Drawing large graphs by low-rank stress majorization. Comp. Graph. Forum, 31, 2012.
- Möbius transformations for global intrinsic symmetry analysis. Comp. Graph. Forum, 29(5), 2010.
- Y. Koren. On spectral graph drawing. In Computing and Combinatorics. Springer, 2003.
- Ace: A fast multiscale eigenvectors computation for drawing huge graphs. IEEE Symp. on Information Visualization, 2002.
- Semi-supervised graph clustering: a kernel approach. Machine Learning, 74(1), 2009.
- S. Lafon and A. B. Lee. Diffusion maps and coarse-graining: A unified framework for dimensionality reduction, graph partitioning, and data set parameterization. IEEE Trans. on Pattern Analysis and Machine Intelligence, 28(9), 2006.
- Computing the shape of brain networks using graph filtration and gromov-hausdorff metric. Conference on Medical Image Computing and Computer Assisted Intervention, 2011.
- Discriminative persistent homology of brain networks. IEEE Symp. on Biomedical Imaging, 2011.
- Persistent brain network homology from the perspective of dendrogram. IEEE Trans. on Medical Imaging, 31(12), 2012.
- Weighted functional brain network modeling via network filtration. NIPS Workshop on Algebraic Topology and Machine Learning, 2012.
- J. Leskovec and A. Krevl. SNAP Datasets: Stanford large network dataset collection, 2014.
- B. Lévy. Laplace-Beltrami eigenfunctions towards an algorithm that understands geometry. In IEEE Shape Modeling and App., 2006.
- Visualizing high-dimensional data: Advances in the past decade. IEEE Trans. on Visual. and Comp. Graph., 23(3), 2017.
- Extracting insights from the shape of complex data using topology. Scientific Reports, 3, 2013.
- U. V. Luxburg. A tutorial on spectral clustering. Statistics and Computing, 17(4), 2007.
- F. McGee and J. Dingliana. An empirical study on the impact of edge bundling on user comprehension of graphs. In Advanced Visual Interfaces, 2012.
- E. Munch and B. Wang. Convergence between categorical representations of Reeb space and mapper. Symp. on Comp. Geometry, 51, 2016.
- M. Newman. The structure of scientific collaboration networks. Nat. Academy of Sci., 98(2), 2001.
- M. Newman. Properties of highly clustered networks. Physical Review E, 68(2), 2003.
- M. Newman. Fast algorithm for detecting community structure in networks. Physical Review E, 69(6), 2004.
- On spectral clustering: Analysis and an algorithm. In Advances in Neural Info. Processing Systems, 2002.
- Topology based data analysis identifies a subgroup of breast cancers with a unique mutational profile and excellent survival. Nat. Academy of Sci., 108(17), 2011.
- Pagerank citation ranking: Bringing order to the web. Technical report, Stanford, 1999.
- Networks and cycles: A persistent homology approach to complex networks. European Conf. on Complex Systems, 2013.
- Topological strata of weighted complex networks. PloS One, 8(6), 2013.
- A topological study of repetitive co-activation networks in in vitro cortical assemblies. Physical Biology, 12(1), 2015.
- P. Pons and M. Latapy. Computing communities in large networks using random walks. In Symp. on Computer and Info. Sciences, 2005.
- L. Pretto. Analysis of web link analysis algorithms: The mathematics of ranking. In Information Access through Search Engines and Digital Libraries. Springer, 2008.
- Near linear time algorithm to detect community structures in large-scale networks. Physical Review E, 76(3), 2007.
- Discrete laplace–beltrami operators for shape analysis and segmentation. Computers & Graphics, 33(3), 2009.
- R. Rossi and N. Ahmed. The network data repository with interactive graph analytics and visualization. In AAAI Artificial Intelligence, vol. 29, 2015.
- S. Schaeffer. Graph clustering. Computer Sci. Review, 1(1), 2007.
- Divided edge bundling for directional network data. IEEE Trans. on Visual. and Comp. Graph., 17(12), 2011.
- The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains. IEEE Signal Processing Magazine, 30(3), 2013.
- B. Silverman. Density estimation for statistics and data analysis, vol. 26. CRC press, 1986.
- Topological methods for the analysis of high dimensional data sets and 3D object recognition. Eurographics Symp. on Point-Based Graphics, 22, 2007.
- Persistent homology guided force-directed graph layouts. IEEE Trans. on Visual. and Comp. Graph., 26(1), 2019.
- Efficient aggregation for graph summarization. In ACM SIGKDD Knowledge Discovery and Data Mining, 2008.
- Tracking resilience to infections by mapping disease space. PLoS Biology, 14(4), 2016.
- W. Tutte. How to draw a graph. London Mathematical Society, s3-13(1), 1963.
- Interactive visualization of small world graphs. In IEEE Symp. on Information Visualization, 2004.
- Visualizing group structures in graphs: A survey. Comp. Graph. Forum, 36(6), 2017.
- Visual analysis of large graphs: State-of-the-art and future research challenges. Comp. Graph. Forum, 30(6), 2011.
- S. White and P. Smyth. A spectral clustering approach to finding communities in graphs. In SIAM Data Mining, 2005.
- P. Wills and F. Meyer. Metrics for graph comparison: a practitioner’s guide. PloS One, 15(2), 2020.
- Pheno-Mapper: an interactive toolbox for the visual exploration of phenomics data. ACM Bioinformatics, Computational Biology, and Health Informatics, 2021.
- Topological simplifications of hypergraphs. IEEE Trans. on Visual. and Comp. Graph., 2022.
- BioSNAP Datasets: Stanford biomedical network dataset collection, 2018.
- Paul Rosen (41 papers)
- Mustafa Hajij (51 papers)
- Bei Wang (102 papers)