Papers
Topics
Authors
Recent
Search
2000 character limit reached

Flexible Dirichlet Models

Updated 11 March 2026
  • Flexible Dirichlet (FD) is a class of multivariate distributions that generalizes the classical Dirichlet by integrating custom baseline distributions and generator functions.
  • It supports tailor-made marginal behaviors and enables both positive and negative correlations, overcoming limits of standard Dirichlet models.
  • FD models are applied in compositional data analysis and mixture modeling, offering improved fit and interpretability in real-world datasets.

The Flexible Dirichlet (FD) family refers to a class of multivariate probability distributions that generalize the classical Dirichlet, introducing greater flexibility in capturing marginal behaviors, dependence structure, and support. This is achieved through two principal frameworks: (1) Dirichlet-generated models—embedding baseline distributions in the Dirichlet generator; and (2) stochastic diffusion processes whose stationary laws are generalized Dirichlet distributions. Both frameworks expand modeling capacity for compositional data, densities on simplices, and multivariate phenomena constrained by a conservation principle (Bakosi et al., 2013, Arashi et al., 2019).

1. Formal Constructions and Definitions

Two distinct but closely related constructions underlie FD families.

Dirichlet-generated (Beta-generated-multivariate) construction:

Let p2p \geq 2. For each i=1,,pi=1,\ldots,p, let Gi(xi)G_i(x_i) be the CDF of a baseline distribution (commonly Gamma(βi\beta_i, θi\theta_i)), and F(u1,...,up)F(u_1, ..., u_p) the CDF of a (p+1)(p+1)-dimensional Dirichlet(α1,...,αp+1\alpha_1, ..., \alpha_{p+1}) on the simplex {(y1,...,yp):yi>0,i=1pyi<1}\{(y_1,...,y_p):\, y_i > 0,\, \sum_{i=1}^p y_i < 1\}. The FD cumulative distribution function is

H(x1,,xp)=F(G1(x1),G2(x2),...,Gp(xp))H(x_1,\dots,x_p) = F\left(G_1(x_1), G_2(x_2), ..., G_p(x_p)\right)

The joint PDF is

f(x)=1B(α)(1i=1pGi(xi))αp+11i=1pgi(xi)Gi(xi)αi1f(x) = \frac{1}{B(\boldsymbol\alpha)}\left(1 - \sum_{i=1}^p G_i(x_i)\right)^{\alpha_{p+1}-1} \prod_{i=1}^p g_i(x_i) G_i(x_i)^{\alpha_i-1}

where gig_i is the PDF of the baseline and B(α)B(\boldsymbol\alpha) the multivariate Beta function. The support is {xi0:iGi(xi)<1}\{x_i \geq 0 : \sum_i G_i(x_i) < 1\} (Arashi et al., 2019).

Stochastic diffusion construction (Generalized Dirichlet diffusion):

Let (X1,...,XN)(X_1, ..., X_N) be a vector-valued Itô diffusion always on the simplex Xi=1\sum X_i = 1, Xi0X_i \geq 0. The Fokker–Planck operator is constructed so that its unique stationary distribution is Lochner’s generalized Dirichlet law,

f(x1,...,xN)=i=1N1Γ(αi+βi)Γ(αi)Γ(βi)xiαi1(1k=1ixk)γi×xNαN1f(x_1,...,x_N) = \prod_{i=1}^{N-1}\frac{\Gamma(\alpha_i+\beta_i)}{\Gamma(\alpha_i)\Gamma(\beta_i)} x_i^{\alpha_i-1} (1-\sum_{k=1}^i x_k)^{\gamma_i} \times x_N^{\alpha_N-1}

with parameter coupling as in the text, and γi\gamma_i functions of α\alpha and β\beta (Bakosi et al., 2013).

2. Distributional Properties and Parameterizations

Both frameworks provide substantial modeling flexibility by decoupling marginal shape and dependence constraints present in classical Dirichlet models.

  • Parameters:
    • Dirichlet-generated model: pp baseline distribution parameters (βi,θi\beta_i, \theta_i per component), p+1p+1 Dirichlet shape parameters (α1,...,αp+1)(\alpha_1,...,\alpha_{p+1}).
    • Generalized Dirichlet: $2N$ parameters (αi,βi)(\alpha_i, \beta_i), with $2(N-1)$ degrees of freedom due to simplex constraint.
  • Marginals:

Each XiX_i marginal is a univariate beta-generated (e.g., beta-Gamma) distribution:

hi(xi)=1B(αi,α+αi)gi(xi)Gi(xi)αi1(1Gi(xi))α+αi1h_i(x_i) = \frac{1}{B(\alpha_i,\alpha_+-\alpha_i)} g_i(x_i) G_i(x_i)^{\alpha_i-1}(1-G_i(x_i))^{\alpha_+-\alpha_i-1}

This enables control over skewness, tails, and modality not available in the fixed Beta(\cdot) marginals of the standard Dirichlet.

  • Dependence:

Both positive and negative covariances are achievable. The sign and strength of Cov(Xi,Xj)\operatorname{Cov}(X_i,X_j) depend on αp+1\alpha_{p+1} (or βj\beta_j), whereas all classical Dirichlet off-diagonal covariances are strictly negative. In the generalized Dirichlet, sequential partitions admit a lower-triangular dependence structure, supporting arbitrary sign (Bakosi et al., 2013, Arashi et al., 2019).

3. Moment Structure and Correlations

Closed forms are available for expectations, variances, and covariances in both constructions.

  • First moments (Dirichlet-generated):

E[Xi]=θiΓ(1+βi)Γ(βi)B(αi+1,α+αi)B(αi,α+αi)I(...)B(α)E[X_i] = \frac{\theta_i\,\Gamma(1+\beta_i)}{\Gamma(\beta_i)}\, \frac{B(\alpha_i+1,\alpha_+-\alpha_i)}{B(\alpha_i,\alpha_+-\alpha_i)}\, \frac{I(...) }{B(\boldsymbol\alpha)}

where II is an auxiliary multiple Beta-integral (Arashi et al., 2019).

  • Second moments and covariances:

Var[Xi]=E[Xi2](E[Xi])2\operatorname{Var}[X_i]=E[X_i^2] - (E[X_i])^2

Cov[Xi,Xj]=E[XiXj]E[Xi]E[Xj]\operatorname{Cov}[X_i,X_j] = E[X_i X_j] - E[X_i]E[X_j]

  • Generalized Dirichlet explicit formulæ: For i>1i>1,

Var[Xi]=μi(αi+1αi+βi+1Mi1μi)\operatorname{Var}[X_i] = \mu_i\left( \frac{\alpha_i+1}{\alpha_i+\beta_i+1} M_{i-1} - \mu_i\right)

Cov[Xi,Xj]=μj(αiαi+βi+1Mi1μi)(ij)\operatorname{Cov}[X_i,X_j] = \mu_j\left( \frac{\alpha_i}{\alpha_i+\beta_i+1} M_{i-1} - \mu_i\right) \quad (i \neq j)

where μi\mu_i and Mi1M_{i-1} as defined above (Bakosi et al., 2013).

  • Possible correlation patterns:

X1X_1 is always negatively correlated with the rest; for j>1j>1, Cov(Xj,Xm)\operatorname{Cov}(X_j, X_m) is sign-switchable via hyperparameter selection.

4. Parameter Estimation and Model Fitting

Maximum likelihood estimation is the standard approach for both FD constructions.

  • Log-likelihood:

(ψ)=logB(α)+k,i(αi1)logGi(xk,i)+(αp+11)klog(1iGi(xk,i))+k,iloggi(xk,i)\ell(\boldsymbol\psi) = -\log B(\boldsymbol\alpha) + \sum_{k,i} (\alpha_i-1)\log G_i(x_{k,i}) + (\alpha_{p+1}-1)\sum_k \log\left(1-\sum_i G_i(x_{k,i})\right) + \sum_{k,i}\log g_i(x_{k,i})

  • Optimization:

Score equations involve digamma functions and derivatives of the baseline CDF/PDF; no closed-form solutions are available, but standard optimization routines (Newton–Raphson, quasi-Newton, optim() in R) yield reliable convergence for moderate pp and sample size (Arashi et al., 2019).

  • Identifiability:

The SDE-to-parameter mapping in the diffusion construction is many-to-one: different SDE coefficients may yield the same (α,β)(\alpha, \beta), but each SDE determines a unique stationary law.

5. Flexibility, Support, and Special Cases

The FD family allows for custom support, tail behavior, and degenerate cases.

  • Support:
    • Classical Dirichlet: Simplex {yi>0,yi=1}\{y_i>0, \sum y_i=1\}.
    • FD: {xi0:Gi(xi)<1}\{x_i\ge0 : \sum G_i(x_i) < 1\}, a wedge in Rp\mathbb{R}^p. Choice of GiG_i allows custom marginal supports.
  • Tail behavior:
    • With a Gamma baseline, each XiX_i exhibits exponential tail modulated by a polynomial xiβi1x_i^{\beta_i-1}.
    • Using Pareto, Weibull, Fréchet, etc., as baseline in GiG_i allows modeling of heavy tails.
  • Special cases:
    • Uniform baselines and αp+1=1\alpha_{p+1}=1 recover classical Dirichlet.
    • Decoupling: αi=1\alpha_i=1 and αp+1=1\alpha_{p+1}=1 yields independent marginals.
  • Relations to other families:
    • Unifies classical Dirichlet, Liouville, beta-generated univariate families, and the generalized Dirichlet as special or limiting cases.

6. Applications and Empirical Performance

FD models are notably suitable for compositional data analysis and mixture modeling where flexible marginal and joint behavior is required.

  • Real data examples:
    • Pekin duck serum proteins (p=3): FD model captures extreme outliers, with lower AIC/BIC and KS distance than the Dirichlet (Arashi et al., 2019).
    • White-cell counts (p=3): With negative correlations, FD model provides superior fit (assessed by QQ-plots, contours, and model selection metrics).
  • Model testing:

New empirical-cdf KS test techniques are available for goodness-of-fit assessment (Arashi et al., 2019).

  • Interpretation:

FD models handle both negative and positive dependencies as they arise in real-world compositional, phase-fraction, and biological data—features unattainable by standard Dirichlet models or Dirichlet diffusion processes (Bakosi et al., 2013, Arashi et al., 2019).

7. Conditional Distributions and Extensions

  • Marginals:

Each XiX_i is a beta-generated baseline variable (Arashi et al., 2019).

  • Conditionals:

Conditionals are again beta-generated, with updated Dirichlet parameters depending on the observed values.

  • Generalization:

The constructions are framework-agnostic: any continuous baseline with tractable GiG_i can be inserted, and the generator need not be limited to Dirichlet forms if more complex correlation structures are required.

  • Stochastic modeling:

In the diffusion framework, the FD law arises as the unique long-time law for a class of multidimensional Itô processes, providing an avenue for physical or biological systems modeling with conservation constraints and custom dependence patterns (Bakosi et al., 2013).


In summary, the Flexible Dirichlet family—encompassing Dirichlet-generated and generalized Dirichlet diffusion models—extends classical compositional modeling to a general class parameterized by both baseline distribution and generator characteristics. This enables closed-form marginals and higher moments, support for both signs of correlation, user-controlled tail behavior, and tractable parameter estimation for applications in compositional data, mixture models, and systems governed by conservation principles (Bakosi et al., 2013, Arashi et al., 2019).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (2)

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Flexible Dirichlet (FD).