Papers
Topics
Authors
Recent
2000 character limit reached

Posterior bounds on divergence time of two sequences under dependent-site evolutionary models (2507.19659v1)

Published 25 Jul 2025 in q-bio.PE and math.PR

Abstract: Let x and y be two length n DNA sequences, and suppose we would like to estimate the divergence time T. A well known simple but crude estimate of T is p := d(x,y)/n, the fraction of mutated sites (the p-distance). We establish a posterior concentration bound on T, showing that the posterior distribution of T concentrates within a logarithmic factor of p when d(x,y)log(n)/n = o(1). Our bounds hold under a large class of evolutionary models, including many standard models that incorporate site dependence. As a special case, we show that T exceeds p with vanishingly small posterior probability as n increases under models with constant mutation rates, complementing the result of Mihaescu and Steel (Appl Math Lett 23(9):975--979, 2010). Our approach is based on bounding sequence transition probabilities in various convergence regimes of the underlying evolutionary process. Our result may be useful for improving the efficiency of iterative optimization and sampling schemes for estimating divergence times in phylogenetic inference.

Summary

We haven't generated a summary for this paper yet.

Whiteboard

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.