LogitMap Pairs in Categorical Analysis
- LogitMap Pairs are explicit correspondences between parameterizations in log-linear and logistic models, enabling systematic transfer of estimates and inferential properties.
- They employ algebraic mappings, such as incidence matrices with corner-point constraints, to relate log-linear parameters with logistic coefficients and ensure equivalent MLEs and deviance measures.
- Extended applications include latent-feature models for dyadic prediction and RC association models, offering robust frameworks for model selection and hypothesis testing in categorical data analysis.
LogitMap Pairs refer to mathematical correspondences between parameterizations and inferential structures in different classes of log-linear and logit (logistic) models, as well as the explicit mappings in more recent latent-feature log-linear approaches and generalized row–column (RC) association models for categorical data. These pairings originate from foundational results in contingency table analysis and have been systematically formalized to enable the transfer of model parameters, interval estimates, deviances, and inferences between distinct but structurally related models for categorical and dyadic data.
1. Classical LogitMap Pairs: Log-linear and Logistic Regression Correspondence
The archetypal LogitMap Pair arises from the correspondence between Poisson log-linear models for multidimensional contingency tables containing at least one binary factor , and conditional binary logistic regression models where is the binary response. If denotes the set of factors, the binary response, and , counts are arranged as , indexing over all factor combinations.
Fitting a saturated Poisson GLM with main effects and all interactions among and all such interactions with produces parameter estimates (log-linear parameters) fulfilling
with the requirement that the model incorporates all joint structure among and their interactions with . The implied conditional odds for then possess the exact structure of a logistic regression:
$\logit\ \Pr(Y=1|X=j_2\dots j_p) = X_{\text{logit}} \beta$
where is a vector of logistic regression parameters. No reduced log-linear model yields that identical logistic form unless all interactions are included (Jing et al., 2017).
2. Formal Parameter Mapping and Structural Incidence Relations
Critical to the LogitMap Pair is the algebraic mapping from the log-linear parameters to the logistic regression coefficients . With corner-point constraints and suitable indexing:
- represents the overall intercept,
- indexes the effect for marginal subset , including main and interaction effects.
The mapping is realized as:
where is an incidence matrix effecting the difference between log-linear parameters that involve and those that do not:
For a concrete ,
$\logit\ \Pr(Y=1|X=x) = \sum_{u \subseteq X} [\lambda_{Y \cup u} - \lambda_u] = \sum_{u \subseteq X} \beta_u 1_{(x\, \text{has}\, u)}$
This algebraic structure ensures that effect estimates for in the logistic layer are linear combinations of the relevant log-linear effects, enabling exact parameter-pair correspondences (Jing et al., 2017).
3. MLE, Inferential Equivalence, and Deviance Correspondence
The maximum likelihood estimates (MLEs) of and under the conditions described above coincide for the mapped components: . This extends to standard errors, as the relevant Fisher information block in the log-linear model exactly matches the information in the logistic model for those effects involving . Thus, asymptotically, the Wald confidence intervals for coincide with those for the corresponding coefficients.
Deviance equivalence between the log-linear (Poisson) and logistic (product-binomial) models holds precisely when no cell merging occurs in the logistic data: . Each log-likelihood contribution matches term by term if the logistic regression is carried out without collapsing cells in the contingency structure (Jing et al., 2017).
4. LogitMap in Dyadic Prediction: Latent-Feature Log-linear Models
Expanding the LogitMap concept, latent-feature log-linear (LFL) models for dyadic prediction define a LogitMap from a dyad , possibly endowed with side-information , to the space of conditional label probabilities. For dyads indexed by with label (of cardinality ), the LFL approach assigns each label a row-latent matrix , column-latent matrix , per-label bias , and side-weight . The natural parameter is
with multinomial logit probabilities
This framework generalizes LogitMap Pairs to dyadic data, allowing both identifier-only and side-information-rich regimes. The approach enables learning well-calibrated, low-rank log-linear predictors for nominal and ordinal outcomes with discriminative objectives, maintaining scalability and resistance to sample-selection bias (Menon et al., 2010).
5. Extended LogitMap Pairs in RC Association Models
A further generalization is realized in the extended class of row-column (RC) association models for two-way tables. Here, arbitrary pairs of logit functions—local (), global/cumulative (), continuation (), and reverse-continuation ()—can be paired to define odds-ratio-type interactions on different scales via -divergence functions:
with the generalized odds-ratio at cutpoints and derived from a convex . Reconstruction theorems guarantee that, given all marginal logits and the interaction matrix (subject to a rank- constraint), the joint probability table is uniquely identified. This explicitly defines LogitMap Pairs between sets of marginal logit effects and pairs of association parameters, covering diverse logit types and scaling choices (Forcina et al., 2019).
6. Uniqueness, Structural Constraints, and Positive Association
Within the extended RC association framework, the LogitMap is a bijective correspondence: the pair of marginal logits and -scaled logit-pairwise interactions determine the full joint distribution uniquely. This is guaranteed by properties of the function , the additive Lagrange structure in the representation theorems, and supporting monotonicity arguments.
Rank constraints on the interaction matrix , implemented via specific algebraic operators, enable fitting parsimonious association structures (e.g., rank-1 for strong monotone dependence). Nonnegative induce positive association properties, with tractable stochastic ordering implications depending on the logit type pair—e.g., quadrant orderings for and strong dependence for (Forcina et al., 2019).
7. Applications and Modeling Implications
The LogitMap Pair correspondence is leveraged in classical contingency table analysis, dyadic prediction scenarios (collaborative filtering, link prediction), and ordinal association studies (e.g., social mobility). In each case, the LogitMap yields a structured mapping between different parameterizations and inferential frameworks, enabling:
- Transfer of MLEs and their asymptotic properties,
- Direct comparison of deviance and fit statistics,
- Efficient model selection and hypothesis testing,
- Robustness in the presence of sample selection or structural zeros.
Notable applications include the coronary-heart-disease cohort re-analysis, where the mapped estimates and deviance were shown to coincide exactly, and the British social mobility study, where extended RC models with selected logit-pairings and rank constraints provided robust, interpretable inferences (Jing et al., 2017, Forcina et al., 2019).
In summary, LogitMap Pairs formalize a robust, algebraically explicit, and practically valuable connection between diverse models for categorical data, enabling precise inferential translation across log-linear, logistic, latent-feature, and generalized association frameworks.