Beta-Binomial Distribution
- Beta-binomial distribution is a compound model combining binomial outcomes with beta-distributed probabilities to naturally account for overdispersion.
- It underpins Bayesian predictive inference by updating beta priors with observed binomial data, yielding closed-form posterior and predictive densities.
- Extensions like the tilted beta-binomial and generalized score distribution enhance modeling flexibility for varied dispersion regimes and high-dimensional data.
The beta-binomial distribution is a compound discrete distribution arising when the probability parameter of a binomial distribution is treated as a random variable following a beta distribution. Its flexibility and analytic tractability make it a foundational tool for Bayesian inference, overdispersion modeling, credible region construction, and regression in applied statistics and machine learning.
1. Definition and Formal Properties
Given and , the marginal distribution of is the beta-binomial: where denotes the beta function (Cifuentes-Amado et al., 2019, Sharony, 2017).
The parameters and serve as shape parameters for the underlying beta prior. The mean and variance of are: The variance exceeds that of the binomial distribution with the same mean unless , capturing overdispersion. Higher factorial moments take the form: where and denotes the Pochhammer symbol (Cifuentes-Amado et al., 2019).
2. Bayesian Predictive Inference and Posterior Updating
The beta-binomial arises naturally as the posterior predictive for a future binomial count when Bayesian inference is performed with a beta prior. Given observed data from trials and parameters , the posterior on is . The predictive density for a new sample of trials is: (Hamura, 2021, Westphal, 2019).
The Bayesian estimator for , under entropy loss, is the posterior mean:
Truncated beta priors (with either upper or two-sided support restrictions) yield analogous expressions involving incomplete beta functions and ratios of incomplete integrals for posterior and predictive calculations (Hamura, 2021).
3. Overdispersion, Extensions, and Regression Models
The standard beta-binomial models overdispersion through the stochasticity of . Alternative formulations extend this property:
- Tilted beta-binomial: Employs a convex mixture of a mean-tilted polynomial density and a beta law for . The marginal PMF is:
where is the standard beta-binomial term, and derives from a polynomial-tilted law (Cifuentes-Amado et al., 2019). This model can be embedded in GLM-style regression with links on , , and , and estimated via MCMC methods.
- Generalised Score Distribution (GSD): Provides an underdispersed continuation of the beta-binomial by parameterizing the mean and variance to fully cover the feasible mean-variance space on finite discrete supports. The regime recovers a reparameterized beta-binomial; for , the GSD uses a mixture of binomial and deterministic components, enabling modeling of both over- and under-dispersion beyond the beta-binomial's variance floor (Ćmiel et al., 2022).
| Model | Dispersion Regime | Extra Parameters |
|---|---|---|
| Beta-binomial | Overdispersion | |
| Tilted beta-binomial | Enhanced overdisp. | |
| GSD | Under/Overdisp. |
4. Multivariate Beta-Binomial and High-Dimensional Inference
The multivariate beta-binomial (Westphal, 2019) models correlated binomial marginals by inducing an -dimensional beta distribution as the push-forward of a -dimensional Dirichlet. The marginal distributions are beta-binomial, but with controlled correlations: Posterior updates and credible region construction utilize either the full Dirichlet structure or a reduced parameterization with first and second moments. For , computational tractability favors the latter. Gaussian copulas are used to approximate marginals and joint distribution, outperforming normal approximations for small samples or many dimensions and ensuring credible regions remain within the lawful parameter space.
5. Risk, Dominance, and Predictive Performance
Under Kullback–Leibler divergence loss, the total risk for the beta-binomial Bayes predictive decomposes into a sum of one-step-ahead sub-risks (Hamura, 2021): For restricted parameter regions, truncated beta priors can yield predictive estimators with uniformly lower risk than the unrestricted beta prior, subject to conditions on the truncation region (e.g., for when ). This dominance is evidenced both theoretically and numerically, with similar conditions for two-sided truncation (Hamura, 2021).
6. Algorithmic, Computational, and Application Contexts
In sequential decision-making, the beta-binomial is central for online optimization scenarios such as multi-armed bandit problems, providing analytic expressions (in terms of hypergeometric functions) for comparing payout rates between stochastic processes (Sharony, 2017). Compared to brute force Monte Carlo, such analytic reductions offer orders-of-magnitude speedups at the same numerical precision.
Estimation for beta-binomial and GSD models can be handled by the method of moments, maximum likelihood, or Bayesian approaches. For GSD, explicit formulas for MLE and moment equations enable efficient parameter recovery; for regression and hierarchical modeling, standard MCMC packages such as OpenBUGS are used, leveraging closed-form PMFs (Cifuentes-Amado et al., 2019, Ćmiel et al., 2022).
The beta-binomial forms a fundamental predictive tool in Bayesian statistics, supports robust inference under uncertainty and overdispersion, and, through its extensions, provides comprehensive modeling of discrete count data in both univariate and multivariate high-dimensional settings.
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days free