Beta & Poisson-Dirichlet Coalescents

Updated 22 January 2026

Beta and Poisson-Dirichlet Coalescents are stochastic models capturing genealogies with multiple simultaneous mergers under skewed offspring distributions.
They employ Beta and Poisson-Dirichlet laws to quantify external branch lengths, collision events, and extreme block sizes, crucial for non-neutral evolution studies.
Rigorous analytical methods, such as moment recursion and renewal approximations, underpin precise scaling limits and applications in evolutionary genetics.

Beta and Poisson-Dirichlet Coalescents are fundamental objects in the theory of exchangeable random partitions and stochastic processes describing genealogies in populations with highly skewed offspring distributions. These coalescent processes generalize the classic Kingman coalescent by allowing for multiple mergers (“simultaneous collisions”) of ancestral lineages, with particular focus on the Beta and Poisson-Dirichlet (PD) distributional classes which characterize different regimes of coalescent behavior. Their study underpins a rigorous framework for modeling genealogies under selection, large offspring variance, and various non-neutral evolutionary mechanisms.

1. Definitions and Structural Properties

A $\Lambda$ -coalescent is an exchangeable Markov process on the partitions of a finite set (e.g., $\{1,\dots,n\}$ ), whose transitions allow $k$ blocks to merge at once, with $2\leq k\leq b$ when there are $b$ blocks present. The rate at which a given $k$ -tuple merges is

$\lambda_{b,k} = \int_0^1 x^{k-2}(1-x)^{b-k} \Lambda(dx).$

Here, $\Lambda$ is a finite measure on $[0,1]$ . For the Beta $(a,b)$ -coalescent, $\Lambda(dx) = \frac{1}{B(a,b)} x^{a-1}(1-x)^{b-1} dx$ .

Kingman's coalescent corresponds to $\Lambda=\delta_0$ . The Bolthausen–Sznitman coalescent uses $\Lambda(dx)=dx$ ; the Beta $(1,b)$ -coalescent generalizes this, yielding a spectrum of behaviors depending on parameters $a,b$ .

The Poisson–Dirichlet (PD $(\alpha,\theta)$ ) coalescents correspond to a class of $\Xi$ -coalescents (further generalizations of $\Lambda$ -coalescents), with $(\alpha,\theta)$ parameterizing random frequencies of blocks via stick-breaking or Poisson–Kingman constructions. In certain sampling limits, the finite-dimensional distributions of block sizes converge to a PD $(\alpha,\theta)$ law (Siri-Jégousse et al., 2013).

2. Asymptotic Behavior and Scaling Limits

Under Beta $(2-\alpha,\alpha)$ -coalescents for $1<\alpha<2$ , critical asymptotic results concern the external branch length $T^{(n)}$ (the time a singleton persists before coalescence), the total external tree length $L_{\rm ext}^{(n)}$ , and the total number of collisions.

External branch length: As $n\to\infty$ ,

$n^{\alpha-1} T^{(n)} \xrightarrow{d} T,$

where $T$ is a random variable with explicit density and tail behavior,

$P(T>t) = \left(1 + \frac{t}{\alpha \Gamma(\alpha)}\right)^{-\alpha/(\alpha-1)},$

$f_T(t) = \frac{1}{(\alpha-1)\Gamma(\alpha)} \left(1 + \frac{t}{\alpha\Gamma(\alpha)}\right)^{-(\alpha/(\alpha-1))-1}.$

The number $\sigma^{(n)}$ of collisions until absorption of a leaf, properly rescaled, converges in law to Beta $(1,\alpha)$ (Dhersin et al., 2012).

Total external length: For $1<\alpha<2$ ,

$n^{\alpha-2} L_{\rm ext}^{(n)} \xrightarrow{L^2} \alpha(\alpha-1)\Gamma(\alpha),$

with variance and covariance structure detailed via explicit asymptotic expansions. The ratio to total tree length converges in probability to $2-\alpha$ , sharply contrasting the Kingman regime (Dhersin et al., 2012).

Largest block and minimal clade: The largest block at deterministic time scale or at $T^{(n)}$ exhibits extremal Gumbel-type limit theorems. The minimal clade size (block containing a fixed element at its coalescence time) has a heavy-tail, decaying as $k^{-(\alpha-1)^2}$ (Siri-Jégousse et al., 2013).

Total number of collisions: For Beta $(1,b)$ -coalescents, $(X_n - d_n)/c_n \xrightarrow{d} S_1$ , where $d_n = n/\log n + n\log\log n/(\log n)^2$ , $c_n = n/(\log n)^2$ , and $S_1$ is the spectrally negative $1$-stable law; in the general Beta $(a,b)$ coalescent with $0 $(2-a)$

3. Poisson-Dirichlet Coalescents and Model Construction

Coalescent processes associated with Poisson–Dirichlet laws arise naturally in two distinct ways:

As limit partitions at fixed time: For the Beta $(2-\alpha,\alpha)$ -coalescent with $1<\alpha<2$ , the ranked block frequencies at fixed $t>0$ converge, as $n\to\infty$ , to a PD $(2-\alpha,\alpha)$ partition (Siri-Jégousse et al., 2013).
From size-biased stick-breaking sampling: The two-parameter PD $(\alpha,-\beta)$ distributions arise as the limit collision measure in discrete-time $\Xi$ -coalescents constructed via sampling $N$ points from a normalized Pareto $(\alpha)$ random partition, with possible $\beta$ size-biasing. The precise limiting regime depends on $\alpha$ and $\beta$ (Huillet, 2013), as summarized below:

$\alpha$ Range	Limiting Process	Time Scaling
$0\le\alpha<1$	Discrete-time PD $(\alpha, -\beta)$ $\Xi$ -coalescent	None
$1\le\alpha<2$	Beta $(2-\alpha, \alpha-\beta)$ $\Lambda$ -coalescent	Speed up by $N^{\alpha-1}$
$\alpha\ge2$	Kingman coalescent	Speed up by $N$ or $N/\log N$

In forward-time Poisson branching-selection models, the genealogical tree is equivalent to these coalescent structures, providing a direct evolutionary mechanism for the emergence of such partitions (Huillet, 2013).

4. Genealogical and Population Genetics Interpretation

In constant-population models where offspring distribution is heavy-tailed (e.g., marine species, viral populations), Beta-coalescents and Poisson–Dirichlet coalescents more accurately capture the probability of large family sizes and multiple mergers. The genealogical interpretation, supported by explicit Markov chain construction and limit theorems, connects these coalescents to evolving branching populations with selection mechanisms parameterized by offspring fitness and size-biased sampling (Huillet, 2013).

Implications include:

Neutrality tests (e.g., Fu–Li’s $F^*$ ): Statistics like external/total length ratio are shifted under Beta-coalescent genealogies; the presence of many singletons may indicate skewed offspring variance rather than demographic events (Dhersin et al., 2012).
Correlation structure: For $1<\alpha<2$ , positive correlation between external branch lengths increases variance in mutation counts, affecting inference procedures.
Limiting distributions: Power-law tails and Gumbel-type extremal behaviors dominate functionals such as minimal clade size and largest block at small times (Siri-Jégousse et al., 2013).

5. Methodologies and Limit Theorems

Analytical results for Beta and PD-coalescents rest on several advanced methodologies:

Moment recursion: Precise asymptotics for mean and variance of external branch lengths and total lengths via recursive equations and linear expansion methods (Dhersin et al., 2012, Dhersin et al., 2012).
Renewal approximation and Wasserstein control: For total collisions and tree-length, renewal-type arguments with quantification via Wasserstein distances facilitate transfer of stable limit theorems from approximating random walks to actual coalescence statistics (Gnedin et al., 2012).
Paint-box and exchangeability: The characterization of block frequencies as draws from PD distributions relies on exchangeability and Kingman’s paint-box construction (Siri-Jégousse et al., 2013).
Tauberian and coupling techniques: Laplace transform and Tauberian methods are critical for deriving tail behaviors, especially for minimal clade statistics (Siri-Jégousse et al., 2013).
Forward-in-time population models: Simulation of fitness-dependent Poisson point processes with truncating selection directly yields genealogies corresponding to limiting Beta or PD-coalescents (Huillet, 2013).

6. Connections to Other Classes and Regimes

Beta and PD-coalescents provide a spectrum of models, interpolating between Kingman’s binary-only regime ( $\alpha\to2$ ), Bolthausen–Sznitman’s multiple-merge regime ( $\alpha=1$ ), and intermediate multiple-merger regimes ( $1<\alpha<2$ ). The Poisson–Dirichlet framework includes the two-parameter family $\mathrm{PD}(\alpha,\theta)$ , with additional parameter $\theta$ introducing further flexibility. As $\alpha\downarrow1$ , tail heuristics and empirical statistics transition sharply, with many functionals exhibiting phase changes in their scaling exponents and correlation structure (Dhersin et al., 2012, Dhersin et al., 2012, Siri-Jégousse et al., 2013).

A plausible implication is that models in the Beta and PD class capture the “universality” class of coalescents arising in a wide range of natural and artificial selection regimes, particularly where reproduction is highly uneven.

7. Summary Table: Key Scaling and Laws

Functional	Beta $(2-\alpha,\alpha)$ , $1<\alpha<2$	Law / Limit
$n^{\alpha-1} T^{(n)}$	Converges in law	Power-law tail ( $\sim t^{-\alpha}$ ) (Dhersin et al., 2012, Siri-Jégousse et al., 2013)
$n^{\alpha-2} L_{\rm ext}^{(n)}$	$L^2$ converges	$\alpha(\alpha-1)\Gamma(\alpha)$ (Dhersin et al., 2012)
$L_{\rm ext}^{(n)}/L^{(n)}$	In probability	$2-\alpha$ (Dhersin et al., 2012)
Number of collisions	Rescaled, converges	Beta or stable law (Dhersin et al., 2012, Gnedin et al., 2012)
Largest block	Extreme value limit	Gumbel-type (Siri-Jégousse et al., 2013)

These results form the mathematical foundation for comprehensive modeling of genetic genealogies under heavy-tailed offspring distributions, large population models, and non-classical evolutionary dynamics.

Markdown Report Issue Upgrade to Chat

References (5)

Asymptotics of the minimal clade size and related functionals of certain beta-coalescents (2013)

On the length of an external branch in the Beta-coalescent (2012)

Asympotic behavior of the total length of external branches for Beta-coalescents (2012)

On asymptotics of the beta-coalescents (2012)

Pareto genealogies arising from a Poisson branching evolution model with selection (2013)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Beta and Poisson-Dirichlet Coalescents.

Beta & Poisson-Dirichlet Coalescents

1. Definitions and Structural Properties

2. Asymptotic Behavior and Scaling Limits

3. Poisson-Dirichlet Coalescents and Model Construction

4. Genealogical and Population Genetics Interpretation

5. Methodologies and Limit Theorems

6. Connections to Other Classes and Regimes

7. Summary Table: Key Scaling and Laws

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Don't miss out on important new AI/ML research

Beta & Poisson-Dirichlet Coalescents

1. Definitions and Structural Properties

2. Asymptotic Behavior and Scaling Limits

3. Poisson-Dirichlet Coalescents and Model Construction

4. Genealogical and Population Genetics Interpretation

5. Methodologies and Limit Theorems

6. Connections to Other Classes and Regimes

7. Summary Table: Key Scaling and Laws

Topic to Video (Beta)

Whiteboard

Follow Topic

Continue Learning

Related Topics

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research