Conjugate GLMMs (CGLMMs)
- Conjugate GLMMs are a class of mixed models that enable closed-form marginal likelihood computation through conjugacy between exponential family likelihoods and random effects.
- They integrate conjugate random effects to capture overdispersion and Gaussian random effects to model intra-cluster correlations, ensuring efficient and exact inference.
- CGLMMs unify common models like Gaussian, Poisson, beta-binomial, and gamma, reducing computational burden while accurately addressing hierarchical data structures.
A conjugate generalized linear mixed model (CGLMM) is a class of mixed models for clustered and longitudinal data, distinguished by the feature that, for particular choices of random effects and link functions, the marginal likelihood can be computed in closed form without recourse to numerical integration. CGLMMs unify several well-known models—including Gaussian, Poisson, negative-binomial, beta-binomial, and gamma models—under a single analytic framework. Central to the construction is the conjugacy between the exponential family conditional likelihood and the distribution of (latent) random effects, facilitating analytic marginalization. This enables efficient and exact maximum-likelihood inference in settings that exhibit both overdispersion and hierarchical multilevel structure (Molenberghs et al., 2011, Lee et al., 2017).
1. Model Formulation: Hierarchical and Conjugate Structure
In the general CGLMM framework, the observed outcome for subject (or cluster) and measurement is modeled conditionally on two types of latent random effects:
- Conjugate random effects capture overdispersion—extra variability not explained by the canonical exponential-family model. The form of , the distribution of , is chosen to be conjugate to the conditional likelihood (e.g., Gamma for Poisson, Beta for Bernoulli) (Molenberghs et al., 2011).
- Normal (Gaussian) random effects of dimension induce correlation within clusters or repeated measures, capturing subject-level heterogeneity.
The CGLMM is typically specified as follows:
where is the canonical parameter, and the conditional mean satisfies: 0 with link function 1, fixed effects 2, and design matrices 3.
The conjugate random effect modifies the mean at the observation level: 4 where 5 is a function of the fixed and normal random effects.
2. Exponential Family Structure and Conjugacy Conditions
Marginalization of the random effects relies on expressing both the outcome distribution and the random-effects prior in compatible (conjugate) exponential-family form (Lee et al., 2017). For a one-parameter exponential family: 6 with canonical parameter 7 and cumulant 8, conjugate priors take: 9
Closed-form marginalization is possible if the mapping 0 and 1 induced by covariates is affine in 2 and 3: 4 Those conditions are satisfied for Gaussian, Poisson, and gamma models with nontrivial choices of unit- and group-level covariates, and for the binomial under group-level predictors alone.
3. Marginal Likelihood and Analytic Marginalization
The likelihood contribution for subject 5 is expressed as: 6 Analytic marginalization is performed over the conjugate random effect 7, leveraging conjugacy to obtain, e.g., negative-binomial or beta-binomial-type densities. The remaining integral over the normal random effects 8 is handled via classical approaches such as Gauss–Hermite quadrature or Laplace approximation (Molenberghs et al., 2011).
A key property of CGLMMs is that for a wide range of settings—including those with both group- and unit-level covariates in Gaussian, Poisson, and gamma families—the entire marginal likelihood is available in closed form, yielding substantial computational advantages (Lee et al., 2017).
4. Key Special Cases and Model Families
Table 1 summarizes the principal CGLMM special cases for clustered data (Lee et al., 2017):
| Family | Conditional Model | Conjugate Prior | Marginal |
|---|---|---|---|
| Gaussian | 9 | 0 | Gaussian-integrated |
| Poisson | 1 | 2 | Negative-binomial |
| Binomial | 3 | 4 | Beta-binomial |
| Gamma | 5 | 6 | Inverse-gamma |
Context and significance: For Poisson outcomes, the CGLMM recovers the negative-binomial–normal model. For binary outcomes, it yields the beta–binomial–normal model, and for continuous measures, the hierarchical normal model. In each case, overdispersion and cluster-level correlation are jointly modeled.
5. Marginal Covariance, Correlation, and Overdispersion
CGLMMs allow explicit derivation of the marginal covariance and correlation structure for repeated measures or clustered observations. In the Poisson–Gamma–Normal case, for independent 7,
8
Marginal variances thus combine overdispersion (additional variation from conjugate effects) and correlation from normal random effects (Molenberghs et al., 2011).
An immediate implication is that the closed-form correlation between observations within a cluster can be obtained analytically, facilitating inference on dependence structures and the impact of modeling choices on variance decomposition.
6. Inference, Computation, and Practical Implementation
With explicit marginal likelihoods, parameter estimation proceeds via direct maximum likelihood. After analytic marginalization over conjugate random effects, gradient and Hessian computations are facilitated, supporting classical Wald and likelihood-ratio test theory. Remaining numerical integration (over 9) is typically one-dimensional or low-dimensional, depending on the correlation structure (Molenberghs et al., 2011). In big-data and federated learning environments, groupwise sufficient statistics can be calculated and communicated, simplifying computation and supporting privacy (Lee et al., 2017).
Standard estimation techniques include adaptive quadrature, Laplace approximation, or expectation-maximization (when treating random effects as missing data). In applied practice, platforms such as SAS NLMIXED can accommodate CGLMMs by analytic specification of the closest closed-form (Molenberghs et al., 2011).
7. Illustrative Data Applications and Model Diagnostics
CGLMMs have been applied to epileptic seizure counts, clinical trials on onychomycosis, and childhood asthma survival. For the Poisson–Gamma–Normal CGLMM fit to epileptic seizure data with 45 placebo and 44 active subjects and weekly repeated counts, maximum likelihood estimates yield both overdispersion and random effect variance: 0 (s.e. 0.2113) and 1 (s.e. 0.1850). The overdispersion index, 2, quantifies the departure from Poisson variance (Molenberghs et al., 2011).
In treatment comparison, inference may differ from simpler mixed models; in this example, slope differences found to be significant under the Poisson–Normal disappear under the fuller CGLMM that accommodates overdispersion.
Closed-form marginal correlations allow model comparison and diagnostic assessment, with combined model correlations systematically below those from purely normal-effects models, reflecting the greater absorption of variance by the independent conjugate effects.
Primary references: (Molenberghs et al., 2011, Lee et al., 2017).