Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 76 tok/s
Gemini 2.5 Pro 58 tok/s Pro
GPT-5 Medium 26 tok/s Pro
GPT-5 High 25 tok/s Pro
GPT-4o 81 tok/s Pro
Kimi K2 206 tok/s Pro
GPT OSS 120B 465 tok/s Pro
Claude Sonnet 4 35 tok/s Pro
2000 character limit reached

Identification enhanced generalised linear model estimation with nonignorable missing outcomes (2204.10508v4)

Published 22 Apr 2022 in stat.ME, math.ST, and stat.TH

Abstract: Missing data often result in undesirable bias and loss of efficiency. These issues become substantial when the response mechanism is nonignorable, meaning that the response model depends on unobserved variables. To manage nonignorable nonresponse, it is necessary to estimate the joint distribution of unobserved variables and response indicators. However, model misspecification and identification issues can prevent robust estimates, even with careful estimation of the target joint distribution. In this study, we modeled the distribution of the observed parts and derived sufficient conditions for model identifiability, assuming a logistic regression model as the response mechanism and generalized linear models as the main outcome model of interest. More importantly, the derived sufficient conditions do not require any instrumental variables, which are often assumed to guarantee model identifiability but cannot be practically determined beforehand. To analyze missing data in applications, we propose practical guidelines and sensitivity analysis to determine the response mechanism. Furthermore, we present the performance of the proposed estimators in numerical studies and apply the proposed method to two sets of real data: exit polls from the 19th South Korean election and public data collected from the Korean Survey of Household Finances and Living Conditions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (28)
  1. Baraldi, A. N., and Enders, C. K. (2010), “An introduction to modern missing data analyses,” Journal of school psychology, 48(1), 5–37.
  2. Cui, X., Guo, J., and Yang, G. (2017), “On the identifiability and estimation of generalized linear models with parametric nonignorable missing data mechanism,” Computational Statistics & Data Analysis, 107, 64–80.
  3. Franks, A. M., Airoldib, E. M., and Rubin, D. B. (2020), “Nonstandard conditionally specified models fornonignorable missing data,” Proceedings of the National Academy of Sciences, 117(32), 19045–19053.
  4. Hazewinkel, A.-D., Bowden, J., Wade, K. H., Palmer, T., Wiles, N. J., and Tilling, K. (2022), “Sensitivity to missing not at random dropout in clinical trials: Use and interpretation of the trimmed means estimator,” Statistics in Medicine, 41(8), 1462–1481.
  5. Hicks, S. C., Townes, F. W., Teng, M., and Irizarry, R. A. (2018), “Missing data and technical variability in single-cell RNA-sequencing experiments,” Biostatistics, 19(4), 562–578.
  6. Ibrahim, J., Chen, M.-H., Lipsitz, S., and Herring, A. (2005), “Missing-Data Methods for Generalized Linear Models,” Journal of the American Statistical Association, 100, 332–347.
  7. Im, J., Cho, I.-H., and Kim, J.-K. (2018), “FHDI: An R Package for Fractional Hot Deck Imputation,” The R Journal, pp. 140–154.
  8. Im, J., and Kim, S. (2017), “Multiple imputation for nonignorable missing data,” Journal of the Korean Statistical Society, 46(4), 583–592.
  9. Kim, J. K. (2011), “Parametric fractional imputation for missing data analysis,” Biometrika, 98(1), 119–132.
  10. Kim, J. K., and Yu, C. L. (2011), “A semiparametric estimation of mean functionals with nonignorable missing data,” Journal of the American Statistical Association, 106(493), 157–165.
  11. Leurent, B., Gomes, M., Faria, R., Morris, S., Grieve, R., and Carpenter, J. R. (2018), “Sensitivity analysis for not-at-random missing data in trial-based cost-effectiveness analysis: a tutorial,” Pharmacoeconomics, 36(8), 889–901.
  12. Li, M., Ma, Y., and Zhao, J. (2021), “Efficient estimation in a partially specified nonignorable propensity score model,” Computational Statistics & Data Analysis, p. 107322.
  13. Liu, C. (2004), “Robit regression: a simple robust alternative to logistic and probit regression,” Applied Bayesian Modeling and Casual Inference from Incomplete-Data Perspectives, pp. 227–238.
  14. Miao, W., Ding, P., and Geng, Z. (2016), “Identifiability of normal and normal mixture models with nonignorable missing data,” Journal of the American Statistical Association, 111(516), 1673–1683.
  15. Miao, W., and Tchetgen Tchetgen, E. J. (2016), “On varieties of doubly robust estimators under missingness not at random with a shadow variable,” Biometrika, 103(2), 475–482.
  16. Morikawa, K., and Kim, J. K. (2021), “Semiparametric optimal estimation with nonignorable nonresponse data,” The Annals of Statistics, 49(5), 2991–3014.
  17. Parent, M. C. (2013), “Handling item-level missing data: Simpler is just as good,” The Counseling Psychologist, 41(4), 568–600.
  18. Peng, J., Hahn, J., and Huang, K.-W. (2023), “Handling missing values in information systems research: A review of methods and assumptions,” Information Systems Research, 34(1), 5–26.
  19. Potdar, K., Pardawala, T. S., and Pai, C. D. (2017), “A comparative study of categorical variable encoding techniques for neural network classifiers,” International journal of computer applications, 175(4), 7–9.
  20. Riddles, M. K., Kim, J. K., and Im, J. (2016), “A propensity-score-adjustment method for nonignorable nonresponse,” Journal of Survey Statistics and Methodology, 4(2), 215–245.
  21. Sauerbrei, W., Abrahamowicz, M., Altman, D. G., le Cessie, S., Carpenter, J., and initiative, S. (2014), “STRengthening analytical thinking for observational studies: the STRATOS initiative,” Statistics in medicine, 33(30), 5413–5432.
  22. Shao, J., and Wang, L. (2016), “Semiparametric inverse propensity weighting for nonignorable missing data,” Biometrika, 103(1), 175–187.
  23. Shetty, S., Ma, Y., and Zhao, J. (2021), “Avoid Estimating the Unknown Function in a Semiparametric Nonignorable Propensity Model,” arXiv preprint arXiv:2108.04966, .
  24. Wang, H., Lu, Z., and Liu, Y. (2021), “Score test for missing at random or not,” arXiv preprint arXiv:2105.12921, .
  25. Wang, S., Shao, J., and Kim, J. K. (2014), “An instrumental variable approach for identification and estimation with nonignorable nonresponse,” Statistica Sinica, 24, 1097–1116.
  26. Zhao, J., and Ma, Y. (2018), “Optimal pseudolikelihood estimation in the analysis of multivariate missing data with nonignorable nonresponse,” Biometrika, 105(2), 479–486.
  27. Zhao, J., and Shao, J. (2015), “Semiparametric pseudo-likelihoods in generalized linear models with nonignorable missing data,” Journal of the American Statistical Association, 110(512), 1577–1590.
  28. Zou, H. (2006), “The adaptive lasso and its oracle properties,” Journal of the American statistical association, 101(476), 1418–1429.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 1 post and received 0 likes.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube