Papers
Topics
Authors
Recent
Search
2000 character limit reached

Beta & Poisson-Dirichlet Coalescents

Updated 22 January 2026
  • Beta and Poisson-Dirichlet Coalescents are stochastic models capturing genealogies with multiple simultaneous mergers under skewed offspring distributions.
  • They employ Beta and Poisson-Dirichlet laws to quantify external branch lengths, collision events, and extreme block sizes, crucial for non-neutral evolution studies.
  • Rigorous analytical methods, such as moment recursion and renewal approximations, underpin precise scaling limits and applications in evolutionary genetics.

Beta and Poisson-Dirichlet Coalescents are fundamental objects in the theory of exchangeable random partitions and stochastic processes describing genealogies in populations with highly skewed offspring distributions. These coalescent processes generalize the classic Kingman coalescent by allowing for multiple mergers (“simultaneous collisions”) of ancestral lineages, with particular focus on the Beta and Poisson-Dirichlet (PD) distributional classes which characterize different regimes of coalescent behavior. Their study underpins a rigorous framework for modeling genealogies under selection, large offspring variance, and various non-neutral evolutionary mechanisms.

1. Definitions and Structural Properties

A Λ\Lambda-coalescent is an exchangeable Markov process on the partitions of a finite set (e.g., {1,,n}\{1,\dots,n\}), whose transitions allow kk blocks to merge at once, with 2kb2\leq k\leq b when there are bb blocks present. The rate at which a given kk-tuple merges is

λb,k=01xk2(1x)bkΛ(dx).\lambda_{b,k} = \int_0^1 x^{k-2}(1-x)^{b-k} \Lambda(dx).

Here, Λ\Lambda is a finite measure on [0,1][0,1]. For the Beta(a,b)(a,b)-coalescent, Λ(dx)=1B(a,b)xa1(1x)b1dx\Lambda(dx) = \frac{1}{B(a,b)} x^{a-1}(1-x)^{b-1} dx.

Kingman's coalescent corresponds to Λ=δ0\Lambda=\delta_0. The Bolthausen–Sznitman coalescent uses Λ(dx)=dx\Lambda(dx)=dx; the Beta(1,b)(1,b)-coalescent generalizes this, yielding a spectrum of behaviors depending on parameters a,ba,b.

The Poisson–Dirichlet (PD(α,θ)(\alpha,\theta)) coalescents correspond to a class of Ξ\Xi-coalescents (further generalizations of Λ\Lambda-coalescents), with (α,θ)(\alpha,\theta) parameterizing random frequencies of blocks via stick-breaking or Poisson–Kingman constructions. In certain sampling limits, the finite-dimensional distributions of block sizes converge to a PD(α,θ)(\alpha,\theta) law (Siri-Jégousse et al., 2013).

2. Asymptotic Behavior and Scaling Limits

Under Beta(2α,α)(2-\alpha,\alpha)-coalescents for 1<α<21<\alpha<2, critical asymptotic results concern the external branch length T(n)T^{(n)} (the time a singleton persists before coalescence), the total external tree length Lext(n)L_{\rm ext}^{(n)}, and the total number of collisions.

External branch length: As nn\to\infty,

nα1T(n)dT,n^{\alpha-1} T^{(n)} \xrightarrow{d} T,

where TT is a random variable with explicit density and tail behavior,

P(T>t)=(1+tαΓ(α))α/(α1),P(T>t) = \left(1 + \frac{t}{\alpha \Gamma(\alpha)}\right)^{-\alpha/(\alpha-1)},

fT(t)=1(α1)Γ(α)(1+tαΓ(α))(α/(α1))1.f_T(t) = \frac{1}{(\alpha-1)\Gamma(\alpha)} \left(1 + \frac{t}{\alpha\Gamma(\alpha)}\right)^{-(\alpha/(\alpha-1))-1}.

The number σ(n)\sigma^{(n)} of collisions until absorption of a leaf, properly rescaled, converges in law to Beta(1,α)(1,\alpha) (Dhersin et al., 2012).

Total external length: For 1<α<21<\alpha<2,

nα2Lext(n)L2α(α1)Γ(α),n^{\alpha-2} L_{\rm ext}^{(n)} \xrightarrow{L^2} \alpha(\alpha-1)\Gamma(\alpha),

with variance and covariance structure detailed via explicit asymptotic expansions. The ratio to total tree length converges in probability to 2α2-\alpha, sharply contrasting the Kingman regime (Dhersin et al., 2012).

Largest block and minimal clade: The largest block at deterministic time scale or at T(n)T^{(n)} exhibits extremal Gumbel-type limit theorems. The minimal clade size (block containing a fixed element at its coalescence time) has a heavy-tail, decaying as k(α1)2k^{-(\alpha-1)^2} (Siri-Jégousse et al., 2013).

Total number of collisions: For Beta(1,b)(1,b)-coalescents, (Xndn)/cndS1(X_n - d_n)/c_n \xrightarrow{d} S_1, where dn=n/logn+nloglogn/(logn)2d_n = n/\log n + n\log\log n/(\log n)^2, cn=n/(logn)2c_n = n/(\log n)^2, and S1S_1 is the spectrally negative $1$-stable law; in the general Beta(a,b)(a,b) coalescent with $0(2a)(2-a)-stable law (Gnedin et al., 2012).

3. Poisson-Dirichlet Coalescents and Model Construction

Coalescent processes associated with Poisson–Dirichlet laws arise naturally in two distinct ways:

  1. As limit partitions at fixed time: For the Beta(2α,α)(2-\alpha,\alpha)-coalescent with 1<α<21<\alpha<2, the ranked block frequencies at fixed t>0t>0 converge, as nn\to\infty, to a PD(2α,α)(2-\alpha,\alpha) partition (Siri-Jégousse et al., 2013).
  2. From size-biased stick-breaking sampling: The two-parameter PD(α,β)(\alpha,-\beta) distributions arise as the limit collision measure in discrete-time Ξ\Xi-coalescents constructed via sampling NN points from a normalized Pareto(α)(\alpha) random partition, with possible β\beta size-biasing. The precise limiting regime depends on α\alpha and β\beta (Huillet, 2013), as summarized below:
α\alpha Range Limiting Process Time Scaling
0α<10\le\alpha<1 Discrete-time PD(α,β)(\alpha, -\beta) Ξ\Xi-coalescent None
1α<21\le\alpha<2 Beta(2α,αβ)(2-\alpha, \alpha-\beta) Λ\Lambda-coalescent Speed up by Nα1N^{\alpha-1}
α2\alpha\ge2 Kingman coalescent Speed up by NN or N/logNN/\log N

In forward-time Poisson branching-selection models, the genealogical tree is equivalent to these coalescent structures, providing a direct evolutionary mechanism for the emergence of such partitions (Huillet, 2013).

4. Genealogical and Population Genetics Interpretation

In constant-population models where offspring distribution is heavy-tailed (e.g., marine species, viral populations), Beta-coalescents and Poisson–Dirichlet coalescents more accurately capture the probability of large family sizes and multiple mergers. The genealogical interpretation, supported by explicit Markov chain construction and limit theorems, connects these coalescents to evolving branching populations with selection mechanisms parameterized by offspring fitness and size-biased sampling (Huillet, 2013).

Implications include:

  • Neutrality tests (e.g., Fu–Li’s FF^*): Statistics like external/total length ratio are shifted under Beta-coalescent genealogies; the presence of many singletons may indicate skewed offspring variance rather than demographic events (Dhersin et al., 2012).
  • Correlation structure: For 1<α<21<\alpha<2, positive correlation between external branch lengths increases variance in mutation counts, affecting inference procedures.
  • Limiting distributions: Power-law tails and Gumbel-type extremal behaviors dominate functionals such as minimal clade size and largest block at small times (Siri-Jégousse et al., 2013).

5. Methodologies and Limit Theorems

Analytical results for Beta and PD-coalescents rest on several advanced methodologies:

  • Moment recursion: Precise asymptotics for mean and variance of external branch lengths and total lengths via recursive equations and linear expansion methods (Dhersin et al., 2012, Dhersin et al., 2012).
  • Renewal approximation and Wasserstein control: For total collisions and tree-length, renewal-type arguments with quantification via Wasserstein distances facilitate transfer of stable limit theorems from approximating random walks to actual coalescence statistics (Gnedin et al., 2012).
  • Paint-box and exchangeability: The characterization of block frequencies as draws from PD distributions relies on exchangeability and Kingman’s paint-box construction (Siri-Jégousse et al., 2013).
  • Tauberian and coupling techniques: Laplace transform and Tauberian methods are critical for deriving tail behaviors, especially for minimal clade statistics (Siri-Jégousse et al., 2013).
  • Forward-in-time population models: Simulation of fitness-dependent Poisson point processes with truncating selection directly yields genealogies corresponding to limiting Beta or PD-coalescents (Huillet, 2013).

6. Connections to Other Classes and Regimes

Beta and PD-coalescents provide a spectrum of models, interpolating between Kingman’s binary-only regime (α2\alpha\to2), Bolthausen–Sznitman’s multiple-merge regime (α=1\alpha=1), and intermediate multiple-merger regimes (1<α<21<\alpha<2). The Poisson–Dirichlet framework includes the two-parameter family PD(α,θ)\mathrm{PD}(\alpha,\theta), with additional parameter θ\theta introducing further flexibility. As α1\alpha\downarrow1, tail heuristics and empirical statistics transition sharply, with many functionals exhibiting phase changes in their scaling exponents and correlation structure (Dhersin et al., 2012, Dhersin et al., 2012, Siri-Jégousse et al., 2013).

A plausible implication is that models in the Beta and PD class capture the “universality” class of coalescents arising in a wide range of natural and artificial selection regimes, particularly where reproduction is highly uneven.

7. Summary Table: Key Scaling and Laws

Functional Beta(2α,α)(2-\alpha,\alpha), 1<α<21<\alpha<2 Law / Limit
nα1T(n)n^{\alpha-1} T^{(n)} Converges in law Power-law tail (tα\sim t^{-\alpha}) (Dhersin et al., 2012, Siri-Jégousse et al., 2013)
nα2Lext(n)n^{\alpha-2} L_{\rm ext}^{(n)} L2L^2 converges α(α1)Γ(α)\alpha(\alpha-1)\Gamma(\alpha) (Dhersin et al., 2012)
Lext(n)/L(n)L_{\rm ext}^{(n)}/L^{(n)} In probability 2α2-\alpha (Dhersin et al., 2012)
Number of collisions Rescaled, converges Beta or stable law (Dhersin et al., 2012, Gnedin et al., 2012)
Largest block Extreme value limit Gumbel-type (Siri-Jégousse et al., 2013)

These results form the mathematical foundation for comprehensive modeling of genetic genealogies under heavy-tailed offspring distributions, large population models, and non-classical evolutionary dynamics.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Beta and Poisson-Dirichlet Coalescents.