Multivariate Bernoulli Distribution
- The multivariate Bernoulli distribution is a probability model for binary vectors defined with fixed margins and potential dependence structures via pairwise correlations.
- Its joint density can be represented as a convex combination of extremal ray densities, forming a convex polytope subject to margin constraints.
- Algorithmic construction through linear programming ensures feasible correlation assignments, making it essential for simulation in risk aggregation and discrete multivariate modeling.
A multivariate Bernoulli distribution is a probability law for a vector of binary random variables, , defined on the discrete hypercube , with prescribed marginal distributions (each ) and, possibly, specified dependence structure such as pairwise correlation matrix . The Fréchet class of such distributions comprises all joint laws with fixed marginals . Characterizing and constructing multivariate Bernoulli laws with given margins and correlations is central to discrete multivariate modeling, combinatorics, and dependent risk simulation (Fontana et al., 2017).
1. Fréchet Class: Definition and Structure
For and margin vector with , the Fréchet class is
where denotes all distribution functions on . This class is equivalently described in terms of joint densities with the constraints: Thus, is a convex polytope cut out by the margin constraints and the simplex condition. This setup encompasses all feasible dependence structures (including both extreme positive/negative association and independence) compatible with the specified margins (Fontana et al., 2017).
2. Convex Geometry and Extremal Representation
Theorem 3.2 in (Fontana et al., 2017) formally establishes that the set of all densities with margins is a convex polytope in , whose extreme points (vertices) can be determined algebraically. Explicitly, every is a convex combination
where are the ray densities (vertices or extremal distributions) of the polytope, , and . These ray densities are computed as nonnegative solutions to the homogeneous margin-constraint system (where is the margin-constraint matrix), and generally possess support of minimal size dictated by the polytope's combinatorics (Fontana et al., 2017).
3. Polynomial and Copula-Type Expansions
Every admits a unique polynomial (Farlie-Gumbel-Morgenstern-type) expansion indexed by subsets : where , is the finite difference operator, and the parameter vector encodes dependence at all interaction orders up to (Fontana et al., 2017). The cumulative function is expressed as a multilinear polynomial in , generalizing copula representations in the discrete setting: This structure mirrors the FGM copula, but, in finite support and with matching margins, the parameters must satisfy linear constraints induced by the margin equations.
4. Compatibility and Realizability of Correlation Matrices
For a given pairwise correlation matrix , the compatibility problem is: does there exist with pairwise second moments matching
for all ? Proposition 3.1 in (Fontana et al., 2017) shows that this is the case if and only if the target vector of second moments lies in the convex hull of the second-moment columns of the ray matrix . Explicitly, the feasibility system is
where collects the pairwise moments of the rays. If a nonnegative exists, the desired law can be constructed as . If not, is infeasible, and one may project onto the feasible region to find the nearest compatible correlation structure (Fontana et al., 2017).
5. Bounding Achievable Correlations
Proposition 3.2 gives explicit bounds for each pairwise correlation: where and are the minimal and maximal pairwise moments among all rays. In terms of correlations,
The bivariate case reduces to the classical Fréchet–Hoeffding bounds: This establishes the sharp compatibility region for each pair of margins and attainable correlation (Fontana et al., 2017).
6. Algorithmic Construction and Numerical Illustration
The procedure for constructing a multivariate Bernoulli law with prescribed margins and correlations involves:
- Building the constraint matrix for the margin system ;
- Computing the extremal rays as generators of the kernel cone (e.g., via 4ti2);
- Assembling the ray matrix and the second-moment matrix ;
- Solving the linear program for feasibility;
- Forming the joint probability vector if a solution exists.
In the case , for , and desired , the algorithm yields the explicit weight vector and the corresponding joint pmf on (Fontana et al., 2017).
7. Theoretical and Practical Implications
This convex-geometric characterization unifies the study of discrete dependence, simulation, and parameter compatibility in multivariate Bernoulli laws. Through explicit polytope structure, any marginal/correlation assignment can be checked for feasibility; if feasible, extremal and mixed laws can be synthesized efficiently. The theoretical framework extends naturally to related combinatorial problems, correlation polytopes, and computational methods for high-dimensional discrete data (Fontana et al., 2017). The approach is foundational for simulation in risk aggregation, network modeling, and dependence-extremal analysis in statistics and applied probability.