Bivariate Pseudo-Poisson Model
- The topic is defined as a joint distribution where one variable is exactly Poisson and the second is modeled by a Poisson regression, enabling both equi-dispersion and over-dispersion.
- It employs linear and nonlinear conditional rate models with closed-form moments and maximum likelihood estimation to capture dependency in count data.
- Extensions include symmetric Poisson conditional models and robust goodness-of-fit tests, with applications in epidemiology, traffic safety, and health statistics.
A bivariate pseudo-Poisson distribution is a class of joint distributions for two non-negative integer-valued random variables in which one marginal distribution is exactly Poisson, and the conditional distribution of the second variable given the first is a Poisson regression (with rates that may be affine or nonlinear in the first variable). This construction enables the modeling of bivariate count data with the property that one margin is equi-dispersed while the other is over-dispersed, and it establishes an explicit, tractable dependence structure driven by the conditional mean specification (Arnold et al., 2020, Lakhani, 18 Nov 2025, Veeranna et al., 2023).
1. Model Specification and Joint Distributions
Linear Pseudo-Poisson Model
Let be non-negative integer-valued random variables. The canonical three-parameter linear pseudo-Poisson model is defined by:
- ,
- , ,
The joint probability mass function (pmf) is
The natural parameter space is . Independence is recovered when (Arnold et al., 2020, Veeranna et al., 2023).
Nonlinear Conditional Rate Extensions
The model can be extended by specifying the Poisson conditional rate via a nonlinear, bounded regression function , yielding (Lakhani, 18 Nov 2025): where typically is non-decreasing, with and . Examples include the exponential kernel and the Lomax kernel .
2. Marginal, Conditional, and Generating Functions
- Marginal of : By construction, (Arnold et al., 2020, Lakhani, 18 Nov 2025).
- Conditional of : (linear), or more generally (nonlinear) (Arnold et al., 2020, Lakhani, 18 Nov 2025).
- Marginal of : is a Poisson mixture, yielding a Neyman Type A marginal with probability generating function (pgf)
for the linear model. Hence when (Arnold et al., 2020).
- Joint pgf:
Mixed moments can be obtained via differentiation (Arnold et al., 2020, Veeranna et al., 2023).
3. Moments, Dispersion, and Dependence
For the linear model, the first and second moments and covariance are available in closed form:
Thus, is equi-dispersed for and over-dispersed otherwise.
For nonlinear conditional rate models, the sign of the correlation is determined by the sign of . The exponential kernel, for instance, yields
with , (Lakhani, 18 Nov 2025).
The generalized dispersion index (Kokonendji–Puig) for can be written
with indicating overdispersion (Veeranna et al., 2023).
4. Parameter Estimation and Inference
Parameter estimation procedures for the pseudo-Poisson model are direct:
- For the linear form, the log-likelihood for i.i.d. observations is
The estimator for is explicit, , while and are obtained by solving the score equations numerically (e.g., by Newton–Raphson) (Arnold et al., 2020, Veeranna et al., 2023).
- The Fisher information matrix is available in closed form as a function of the parameters and moments of .
- Maximum likelihood estimators (MLEs) are consistent and asymptotically normal. Likelihood-ratio tests can be formed for nested submodels (e.g., for independence) (Arnold et al., 2020).
For nonlinear pseudo-Poisson models, estimation is also via MLE, with parameter-specific score equations. The estimator for remains (Lakhani, 18 Nov 2025).
5. Goodness-of-Fit Testing
A range of goodness-of-fit (GoF) tests are available for the pseudo-Poisson model (Veeranna et al., 2023):
| Test Class | Principle | Comments |
|---|---|---|
| Supremum (pgf-based) | Maximum standardized absolute deviation of empirical vs theoretical pgf over grid | Bootstrap for critical values |
| Fisher-index–based | Difference between empirical and model-based generalized dispersion index (GDI) | Not consistent vs all alternatives, simple |
| Muñoz–Gamero quadratic (MG) | Integrated squared deviation of empirical minus model pgf against weight | Bootstrap for critical values |
| Pointwise KK | Standardized difference at fixed | Highly sensitive to location |
| Classical | Cell-based comparison of observed and expected counts | Simple, least powerful |
Supremum and MG tests are consistent against broad alternatives and show good power properties given moderate sample size (). Fisher-index–based tests target overdispersion specifically.
The R package PseudoPoissonGoF (under development) automates fitting and all proposed GoF procedures (Veeranna et al., 2023).
6. Extensions and Generalizations
Symmetric Poisson Conditional Models
Allowing both conditionals to be Poisson (so neither marginal is Poisson except in independence) leads to the symmetric or "bivariate Poisson-conditional" model: with . The corresponding joint pmf is
where is a normalizing constant requiring summation to infinite limits. Correlation is negative for , and zero under independence () (Arnold et al., 2023).
Nonlinear Curvature Models
Generalizing the conditional mean to bounded, nonlinear kernels (e.g., exponential, Lomax) enables modeling negative as well as positive correlation, and accommodates complex boundary behaviors as at . Akaike Information Criterion (AIC) comparisons demonstrate that such models can substantially outperform linear models, especially in cases where negative correlation is present (Lakhani, 18 Nov 2025). For example, setting in the exponential kernel induces strictly decreasing conditional means and negative dependence—a feature unavailable in linear forms.
7. Applications and Empirical Results
Pseudo-Poisson models are particularly advantageous where one variable is equi-dispersed (truly Poisson) and the other is over-dispersed. Empirical case studies include:
- Health and Retirement Study: number of chronic conditions, health-care utilizations; is nearly equi-dispersed, slightly over-dispersed, estimated parameters , , (Arnold et al., 2020).
- Interstate-95 Traffic Accidents: fatalities, number of injury accidents; both margins over-dispersed, and AIC comparison favors the model that treats as the marginal and as Poisson (Arnold et al., 2020).
- Negative Correlation: Simulation studies with exponential-curvature conditional mean (e.g., ) fit negative correlations () that linear pseudo-Poisson models cannot (Lakhani, 18 Nov 2025).
Pseudo-Poisson models are widely used in epidemiology, traffic safety, and health statistics due to analytic tractability, flexibility in dependence structure, and direct parameter interpretation.
References:
- (Arnold et al., 2020) Statistical Inference for distributions with one Poisson conditional
- (Lakhani, 18 Nov 2025) Pseudo-Poisson Distributions with Nonlinear Conditional Rates
- (Arnold et al., 2023) On classical and Bayesian inference for bivariate Poisson conditionals distributions
- (Veeranna et al., 2023) Goodness of fit tests for the pseudo-Poisson distribution
Sponsored by Paperpile, the PDF & BibTeX manager trusted by top AI labs.
Get 30 days free