Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
143 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Structured Mixture of Continuation-ratio Logits Models for Ordinal Regression (2211.04034v2)

Published 8 Nov 2022 in stat.ME

Abstract: We develop a nonparametric Bayesian modeling approach to ordinal regression based on priors placed directly on the discrete distribution of the ordinal responses. The prior probability models are built from a structured mixture of multinomial distributions. We leverage a continuation-ratio logits representation to formulate the mixture kernel, with mixture weights defined through the logit stick-breaking process that incorporates the covariates through a linear function. The implied regression functions for the response probabilities can be expressed as weighted sums of parametric regression functions, with covariate-dependent weights. Thus, the modeling approach achieves flexible ordinal regression relationships, avoiding linearity or additivity assumptions in the covariate effects. Model flexibility is formally explored through the Kullback-Leibler support of the prior probability model. A key model feature is that the parameters for both the mixture kernel and the mixture weights can be associated with a continuation-ratio logits regression structure. Hence, an efficient and relatively easy to implement posterior simulation method can be designed, using P\'olya-Gamma data augmentation. Moreover, the model is built from a conditional independence structure for category-specific parameters, which results in additional computational efficiency gains through partial parallel sampling. In addition to the general mixture structure, we study simplified model versions that incorporate covariate dependence only in the mixture kernel parameters or only in the mixture weights. For all proposed models, we discuss approaches to prior specification and develop Markov chain Monte Carlo methods for posterior simulation. The methodology is illustrated with several synthetic and real data examples.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Albert, J. H. and Chib, S. (1993), “Bayesian Analysis of Binary and Polychotomous Response Data,” Journal of the American Statistical Association, 88, 669–679.
  2. Bao, J. and Hanson, T. (2015), “Bayesian Nonparametric Multivariate Ordinal Regression,” Canadian Journal of Statistics, 43, 337–357.
  3. Barrientos, A. F., Jara, A., and Quintana, F. A. (2012), “On the Support of MacEachern’s Dependent Dirichlet Processes and Extensions,” Bayesian Analysis, 7, 277–310.
  4. Basu, S. and Chib, S. (2003), “Marginal Likelihood and Bayes Factors for Dirichlet Process Mixture Models,” Journal of the American Statistical Association, 98, 224–235.
  5. Bender, R. and Grouven, U. (1998), “Using Binary Logistic Regression Models for Ordinal Data with Non-proportional Odds,” Journal of Clinical Epidemiology, 51, 809–816.
  6. Blei, D. M., Kucukelbir, A., and McAuliffe, J. D. (2017), “Variational Inference: A Review for Statisticians,” Journal of the American Statistical Association, 112, 859–877.
  7. Boes, S. and Winkelmann, R. (2006), “Ordered Response Models,” Allgemeines Statistisches Archiv, 90, 167–181.
  8. Chib, S. and Greenberg, E. (2010), “Additive Cubic Spline Regression with Dirichlet Process Mixture Errors,” Journal of Econometrics, 156, 322–336.
  9. Choudhuri, N., Ghosal, S., and Roy, A. (2007), “Nonparametric Binary Regression Using a Gaussian Process Prior,” Statistical Methodology, 4, 227–243.
  10. Chung, Y. and Dunson, D. B. (2009), “Nonparametric Bayes conditional distribution modeling with variable selection,” Journal of the American Statistical Association, 104, 1646–1660.
  11. DeIorio, M., Müller, P., Rosner, G. L., and MacEachern, S. N. (2004), “An ANOVA Model for Dependent Random Measures,” Journal of the American Statistical Association, 99, 205–215.
  12. DeYoreo, M. and Kottas, A. (2018), “Bayesian Nonparametric Modeling for Multivariate Ordinal Regression,” Journal of Computational and Graphical Statistics, 27, 71–84.
  13. — (2020), “Bayesian Nonparametric Density Regression for Ordinal Responses,” in Fan, Y., Nott, D., Smith, M. S., and Dortet-Bernadet, J.-L. (editors), Flexible Bayesian Regression Modelling, Academic Press, 65–90.
  14. Dunson, D. B. and Park, J.-H. (2008), “Kernel stick-breaking processes,” Biometrika, 95, 307–323.
  15. Ferguson, T. S. (1973), “A Bayesian Analysis of Some Nonparametric Problems,” The Annals of Statistics, 1, 209–230.
  16. Gelfand, A. E. and Ghosh, S. K. (1998), “Model choice: A Minimum Posterior Predictive Loss Approach,” Biometrika, 85, 1–11.
  17. Gelfand, A. E., Kottas, A., and MacEachern, S. N. (2005), “Bayesian Nonparametric Spatial Modeling With Dirichlet Process Mixing,” Journal of the American Statistical Association, 100, 1021–1035.
  18. Heiner, M. and Kottas, A. (2022), “Bayesian nonparametric density autoregression with lag selection,” Bayesian Analysis, 17, 1245–1273.
  19. Ishwaran, H. and James, L. F. (2001), “Gibbs Sampling Methods for Stick-Breaking Priors,” Journal of the American Statistical Association, 96, 161–173.
  20. Kottas, A. and Fronczyk, K. (2013), “Flexible Bayesian modelling for clustered categorical responses in developmental toxicology,” in Damien, P., Dellaportas, P., Polson, N. G., and Stephens, D. A. (editors), Bayesian Theory and Applications, Oxford, UK: Oxford University Press, 70–83.
  21. Linderman, S., Johnson, M. J., and Adams, R. P. (2015), “Dependent Multinomial Models Made Easy: Stick-Breaking with the Pólya-gamma Augmentation,” in Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2, Cambridge, MA, USA: MIT Press.
  22. MacEachern, S. N. (2000), “Dependent Dirichlet processes,” Technical report, Department of Statistics, The Ohio State University.
  23. Peyhardi, J., Trottier, C., and Guédon, Y. (2015), “A new specification of generalized linear models for categorical responses,” Biometrika, 102, 889–906.
  24. Pirjol, D. (2013), “The Logistic-normal Integral and Its Generalizations,” Journal of Computational and Applied Mathematics, 237, 460–469.
  25. Polson, N. G., Scott, J. G., and Windle, J. (2013), “Bayesian Inference for Logistic Models Using Pólya-Gamma Latent Variables,” Journal of the American Statistical Association, 108, 1339–1349.
  26. Quintana, F. A., Müller, P., Jara, A., and MacEachern, S. N. (2022), “The Dependent Dirichlet Process and Related Models,” Statistical Science, 37, 24–41.
  27. Rigon, T. and Durante, D. (2021), “Tractable Bayesian Density Regression via Logit Stick-breaking Priors,” Journal of Statistical Planning and Inference, 211, 131–142.
  28. Rodríguez, A. and Dunson, D. B. (2011), “Nonparametric Bayesian Models through Probit Stick-breaking Processes,” Bayesian Analysis, 6, 145–177.
  29. Tutz, G. (1991), “Sequential Models in Categorical Regression,” Computational Statistics and Data Analysis, 11, 275–295.
Citations (2)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com