Sparsification of Phylogenetic Covariance Matrices of $k$-Regular Trees (2405.17847v1)
Abstract: Consider a tree $T=(V,E)$ with root $\circ$ and edge length function $\ell:E\to\mathbb{R}+$. The phylogenetic covariance matrix of $T$ is the matrix $C$ with rows and columns indexed by $L$, the leaf set of $T$, with entries $C(i,j):=\sum{e\in[i\wedge j,o]}\ell(e)$, for each $i,j\in L$. Recent work [15] has shown that the phylogenetic covariance matrix of a large, random binary tree $T$ is significantly sparsified with overwhelmingly high probability under a change-of-basis with respect to the so-called Haar-like wavelets of $T$. This finding notably enables manipulating the spectrum of covariance matrices of large binary trees without the necessity to store them in computer memory but instead performing two post-order traversals of the tree. Building on the methods of [15], this manuscript further advances their sparsification result to encompass the broader class of $k$-regular trees, for any given $k\ge2$. This extension is achieved by refining existing asymptotic formulas for the mean and variance of the internal path length of random $k$-regular trees, utilizing hypergeometric function properties and identities.
- D. Aldous and B. Pittel. The critical beta-splitting random tree: Heights and related results, 2023.
- D. J. Aldous. The critical beta-splitting random tree II: Overview and open problems, 2023.
- M. Drmota. Random Trees: An Interplay between Combinatorics and Probability. Springer-Verlag/Wein, 2009.
- R. J. Evans and D. Stanton. Asymptotic formulas for zero-balanced hypergeometric series. SIAM J. Math. Anal., 1984.
- P. Flajolet and R. Sedegwick. Analytic Combinatorics. Cambridge University Press, 2009.
- E. Gorman and M. E. Lladser. Interpretable metric learning in comparative metagenomics: The adaptive Haar-like distance. PLoS Comput Biol 20(5): e1011543, 2024.
- L. J. Harmon. Phylogenetic Comparative Methods. CreateSpace Independent Publishing Platform, 2019.
- E. Hille. Analytic function theory. Vol. 1. Introduction to Higher Mathematics. Ginn and Company, 1959.
- S. Svihla and M. E. Lladser. Sparsification of phylogenetic covariance matrices of k𝑘kitalic_k-ary trees. In preparation.
- E. W. Weisstein. Hypergeometric function. https://mathworld.wolfram.com/HypergeometricFunction.html. Accessed: September 2023.