Flexible Dirichlet Models

Updated 11 March 2026

Flexible Dirichlet (FD) is a class of multivariate distributions that generalizes the classical Dirichlet by integrating custom baseline distributions and generator functions.
It supports tailor-made marginal behaviors and enables both positive and negative correlations, overcoming limits of standard Dirichlet models.
FD models are applied in compositional data analysis and mixture modeling, offering improved fit and interpretability in real-world datasets.

The Flexible Dirichlet (FD) family refers to a class of multivariate probability distributions that generalize the classical Dirichlet, introducing greater flexibility in capturing marginal behaviors, dependence structure, and support. This is achieved through two principal frameworks: (1) Dirichlet-generated models—embedding baseline distributions in the Dirichlet generator; and (2) stochastic diffusion processes whose stationary laws are generalized Dirichlet distributions. Both frameworks expand modeling capacity for compositional data, densities on simplices, and multivariate phenomena constrained by a conservation principle (Bakosi et al., 2013, Arashi et al., 2019).

1. Formal Constructions and Definitions

Two distinct but closely related constructions underlie FD families.

Dirichlet-generated (Beta-generated-multivariate) construction:

Let $p \geq 2$ . For each $i=1,\ldots,p$ , let $G_i(x_i)$ be the CDF of a baseline distribution (commonly Gamma( $\beta_i$ , $\theta_i$ )), and $F(u_1, ..., u_p)$ the CDF of a $(p+1)$ -dimensional Dirichlet( $\alpha_1, ..., \alpha_{p+1}$ ) on the simplex $\{(y_1,...,y_p):\, y_i > 0,\, \sum_{i=1}^p y_i < 1\}$ . The FD cumulative distribution function is

$H(x_1,\dots,x_p) = F\left(G_1(x_1), G_2(x_2), ..., G_p(x_p)\right)$

The joint PDF is

$f(x) = \frac{1}{B(\boldsymbol\alpha)}\left(1 - \sum_{i=1}^p G_i(x_i)\right)^{\alpha_{p+1}-1} \prod_{i=1}^p g_i(x_i) G_i(x_i)^{\alpha_i-1}$

where $g_i$ is the PDF of the baseline and $B(\boldsymbol\alpha)$ the multivariate Beta function. The support is $\{x_i \geq 0 : \sum_i G_i(x_i) < 1\}$ (Arashi et al., 2019).

Stochastic diffusion construction (Generalized Dirichlet diffusion):

Let $(X_1, ..., X_N)$ be a vector-valued Itô diffusion always on the simplex $\sum X_i = 1$ , $X_i \geq 0$ . The Fokker–Planck operator is constructed so that its unique stationary distribution is Lochner’s generalized Dirichlet law,

$f(x_1,...,x_N) = \prod_{i=1}^{N-1}\frac{\Gamma(\alpha_i+\beta_i)}{\Gamma(\alpha_i)\Gamma(\beta_i)} x_i^{\alpha_i-1} (1-\sum_{k=1}^i x_k)^{\gamma_i} \times x_N^{\alpha_N-1}$

with parameter coupling as in the text, and $\gamma_i$ functions of $\alpha$ and $\beta$ (Bakosi et al., 2013).

2. Distributional Properties and Parameterizations

Both frameworks provide substantial modeling flexibility by decoupling marginal shape and dependence constraints present in classical Dirichlet models.

Parameters:
- Dirichlet-generated model: $p$ baseline distribution parameters ( $\beta_i, \theta_i$ per component), $p+1$ Dirichlet shape parameters $(\alpha_1,...,\alpha_{p+1})$ .
- Generalized Dirichlet: $2N$ parameters $(\alpha_i, \beta_i)$ , with $2(N-1)$ degrees of freedom due to simplex constraint.
Marginals:

Each $X_i$ marginal is a univariate beta-generated (e.g., beta-Gamma) distribution:

$h_i(x_i) = \frac{1}{B(\alpha_i,\alpha_+-\alpha_i)} g_i(x_i) G_i(x_i)^{\alpha_i-1}(1-G_i(x_i))^{\alpha_+-\alpha_i-1}$

This enables control over skewness, tails, and modality not available in the fixed Beta( $\cdot$ ) marginals of the standard Dirichlet.

Dependence:

Both positive and negative covariances are achievable. The sign and strength of $\operatorname{Cov}(X_i,X_j)$ depend on $\alpha_{p+1}$ (or $\beta_j$ ), whereas all classical Dirichlet off-diagonal covariances are strictly negative. In the generalized Dirichlet, sequential partitions admit a lower-triangular dependence structure, supporting arbitrary sign (Bakosi et al., 2013, Arashi et al., 2019).

3. Moment Structure and Correlations

Closed forms are available for expectations, variances, and covariances in both constructions.

First moments (Dirichlet-generated):

$E[X_i] = \frac{\theta_i\,\Gamma(1+\beta_i)}{\Gamma(\beta_i)}\, \frac{B(\alpha_i+1,\alpha_+-\alpha_i)}{B(\alpha_i,\alpha_+-\alpha_i)}\, \frac{I(...) }{B(\boldsymbol\alpha)}$

where $I$ is an auxiliary multiple Beta-integral (Arashi et al., 2019).

Second moments and covariances:

$\operatorname{Var}[X_i]=E[X_i^2] - (E[X_i])^2$

$\operatorname{Cov}[X_i,X_j] = E[X_i X_j] - E[X_i]E[X_j]$

Generalized Dirichlet explicit formulæ: For $i>1$ ,

$\operatorname{Var}[X_i] = \mu_i\left( \frac{\alpha_i+1}{\alpha_i+\beta_i+1} M_{i-1} - \mu_i\right)$

$\operatorname{Cov}[X_i,X_j] = \mu_j\left( \frac{\alpha_i}{\alpha_i+\beta_i+1} M_{i-1} - \mu_i\right) \quad (i \neq j)$

where $\mu_i$ and $M_{i-1}$ as defined above (Bakosi et al., 2013).

Possible correlation patterns:

$X_1$ is always negatively correlated with the rest; for $j>1$ , $\operatorname{Cov}(X_j, X_m)$ is sign-switchable via hyperparameter selection.

4. Parameter Estimation and Model Fitting

Maximum likelihood estimation is the standard approach for both FD constructions.

Log-likelihood:

$\ell(\boldsymbol\psi) = -\log B(\boldsymbol\alpha) + \sum_{k,i} (\alpha_i-1)\log G_i(x_{k,i}) + (\alpha_{p+1}-1)\sum_k \log\left(1-\sum_i G_i(x_{k,i})\right) + \sum_{k,i}\log g_i(x_{k,i})$

Optimization:

Score equations involve digamma functions and derivatives of the baseline CDF/PDF; no closed-form solutions are available, but standard optimization routines (Newton–Raphson, quasi-Newton, optim() in R) yield reliable convergence for moderate $p$ and sample size (Arashi et al., 2019).

Identifiability:

The SDE-to-parameter mapping in the diffusion construction is many-to-one: different SDE coefficients may yield the same $(\alpha, \beta)$ , but each SDE determines a unique stationary law.

5. Flexibility, Support, and Special Cases

The FD family allows for custom support, tail behavior, and degenerate cases.

Support:
- Classical Dirichlet: Simplex $\{y_i>0, \sum y_i=1\}$ .
- FD: $\{x_i\ge0 : \sum G_i(x_i) < 1\}$ , a wedge in $\mathbb{R}^p$ . Choice of $G_i$ allows custom marginal supports.
Tail behavior:
- With a Gamma baseline, each $X_i$ exhibits exponential tail modulated by a polynomial $x_i^{\beta_i-1}$ .
- Using Pareto, Weibull, Fréchet, etc., as baseline in $G_i$ allows modeling of heavy tails.
Special cases:
- Uniform baselines and $\alpha_{p+1}=1$ recover classical Dirichlet.
- Decoupling: $\alpha_i=1$ and $\alpha_{p+1}=1$ yields independent marginals.
Relations to other families:
- Unifies classical Dirichlet, Liouville, beta-generated univariate families, and the generalized Dirichlet as special or limiting cases.

6. Applications and Empirical Performance

FD models are notably suitable for compositional data analysis and mixture modeling where flexible marginal and joint behavior is required.

Real data examples:
- Pekin duck serum proteins (p=3): FD model captures extreme outliers, with lower AIC/BIC and KS distance than the Dirichlet (Arashi et al., 2019).
- White-cell counts (p=3): With negative correlations, FD model provides superior fit (assessed by QQ-plots, contours, and model selection metrics).
Model testing:

New empirical-cdf KS test techniques are available for goodness-of-fit assessment (Arashi et al., 2019).

Interpretation:

FD models handle both negative and positive dependencies as they arise in real-world compositional, phase-fraction, and biological data—features unattainable by standard Dirichlet models or Dirichlet diffusion processes (Bakosi et al., 2013, Arashi et al., 2019).

7. Conditional Distributions and Extensions

Marginals:

Each $X_i$ is a beta-generated baseline variable (Arashi et al., 2019).

Conditionals:

Conditionals are again beta-generated, with updated Dirichlet parameters depending on the observed values.

Generalization:

The constructions are framework-agnostic: any continuous baseline with tractable $G_i$ can be inserted, and the generator need not be limited to Dirichlet forms if more complex correlation structures are required.

Stochastic modeling:

In the diffusion framework, the FD law arises as the unique long-time law for a class of multidimensional Itô processes, providing an avenue for physical or biological systems modeling with conservation constraints and custom dependence patterns (Bakosi et al., 2013).

In summary, the Flexible Dirichlet family—encompassing Dirichlet-generated and generalized Dirichlet diffusion models—extends classical compositional modeling to a general class parameterized by both baseline distribution and generator characteristics. This enables closed-form marginals and higher moments, support for both signs of correlation, user-controlled tail behavior, and tractable parameter estimation for applications in compositional data, mixture models, and systems governed by conservation principles (Bakosi et al., 2013, Arashi et al., 2019).

Markdown Report Issue Upgrade to Chat

References (2)

A stochastic diffusion process for Lochner's generalized Dirichlet distribution (2013)

Developing multivariate distributions using Dirichlet generator (2019)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Flexible Dirichlet (FD).