Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
143 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Robust Bayesian Inference for Berkson and Classical Measurement Error Models (2306.01468v2)

Published 2 Jun 2023 in stat.ME and stat.ML

Abstract: Measurement error occurs when a covariate influencing a response variable is corrupted by noise. This can lead to misleading inference outcomes, particularly in problems where accurately estimating the relationship between covariates and response variables is crucial, such as causal effect estimation. Existing methods for dealing with measurement error often rely on strong assumptions such as knowledge of the error distribution or its variance and availability of replicated measurements of the covariates. We propose a Bayesian Nonparametric Learning framework that is robust to mismeasured covariates, does not require the preceding assumptions, and can incorporate prior beliefs about the error distribution. This approach gives rise to a general framework that is suitable for both Classical and Berkson error models via the appropriate specification of the prior centering measure of a Dirichlet Process (DP). Moreover, it offers flexibility in the choice of loss function depending on the type of regression model. We provide bounds on the generalization error based on the Maximum Mean Discrepancy (MMD) loss which allows for generalization to non-Gaussian distributed errors and nonlinear covariate-response relationships. We showcase the effectiveness of the proposed framework versus prior art in real-world problems containing either Berkson or Classical measurement errors.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (71)
  1. Learning models from data with measurement error: Tackling underreporting. In ICML, pages 61–70. PMLR.
  2. Universal robust regression via maximum mean discrepancy. Biometrika, 111(1):71–92.
  3. Non-gaussian berkson errors in bioassay. Statistical Methods in Medical Research, 25(1):430–445.
  4. Berkson, J. (1950). Are there two regressions? JASA, 45(250):164–180.
  5. Bayesian smoothing and regression splines for measurement error problems. JASA, 97(457):160–169.
  6. Instrumental variables. Number 8. Cambridge university press.
  7. Measurement error is often neglected in medical literature: a systematic review. Journal of clinical epidemiology, 98:89–97.
  8. Statistical inference for generative models with maximum mean discrepancy. arXiv preprint arXiv:1906.05944.
  9. Burr, D. (1988). On errors-in-variables in binary regression-berkson case. JASA, 83(403):739–743.
  10. Campbell, A. (2002). The potential role of aluminium in alzheimer’s disease. Nephrology Dialysis Transplantation, 17(suppl_2):17–20.
  11. Flexible parametric measurement error models. Biometrics, 55(1):44–54.
  12. Measurement error in nonlinear models, volume 105. CRC press.
  13. Measurement error in nonlinear models: a modern perspective. Chapman and Hall/CRC.
  14. Gaussian process regression with location errors. arXiv preprint arXiv:1506.08256.
  15. Finite sample properties of parametric mmd estimation: robustness to misspecification and dependence. Bernoulli, 28(1):181–213.
  16. Cochran, W. G. (1968). Errors of measurement in statistics. Technometrics, 10(4):637–666.
  17. Simulation-extrapolation estimation in parametric measurement error models. JASA, 89(428):1314–1328.
  18. Methodology for non-parametric deconvolution when the error distribution is unknown. JRSS: Series B: Statistical Methodology, pages 231–252.
  19. Nonparametric methods for solving the berkson errors-in-variables problem. JRSS: Series B: Statistical Methodology, 68(2):201–220.
  20. Robust bayesian inference for simulator-based models via the mmd posterior bootstrap. In AISTATS, pages 943–970. PMLR.
  21. Deming, W. (1943). Statistical Adjustment of Data. Dover books on elementary and intermediate mathematics. J. Wiley & Sons, Incorporated.
  22. Total least squares regression in input sparsity time. NEURIPS, 32.
  23. Nonparametric regression with errors in variables. The Annals of Statistics, pages 1900–1925.
  24. Partial identifiability in discrete data with measurement error. In UAI, pages 1798–1808. PMLR.
  25. Scalable nonparametric sampling from multimodal posteriors with the posterior bootstrap. In ICML, pages 1952–1962. PMLR.
  26. On measurement error adjustment methods in poisson regression. Environmetrics: The official journal of the International Environmetrics Society, 10(2):213–224.
  27. Total least squares. In Smoothing Techniques for Curve Estimation, pages 69–76. Springer.
  28. An analysis of the total least squares problem. SIAM journal on numerical analysis, 17(6):883–893.
  29. A kernel two-sample test. JMLR, 13(1):723–773.
  30. A regularized total least squares algorithm. In Total Least Squares and Errors-in-Variables Modeling, pages 57–66. Springer.
  31. Bias due to berkson error: issues when using predicted values in place of observed covariates. Biostatistics, 22(4):858–872.
  32. Introduction to Econometrics with R. Universität Duisburg-Essen.
  33. Semiparametric regression with R. Springer.
  34. Measurement error models: from nonparametric methods to deep neural networks. Statistical Science, 37(4):473–493.
  35. On errors-in-variables in polynomial regression-berkson case. Statistica Sinica, pages 923–936.
  36. Exact and approximate sum representations for the dirichlet process. Canadian Journal of Statistics, 30(2):269–283.
  37. Uniform confidence bands in deconvolution with unknown error distribution. Journal of Econometrics, 207(1):129–161.
  38. Robust inference in deconvolution. Quantitative Economics, 12(1):109–142.
  39. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980.
  40. A rigorous theory of conditional mean embeddings. SIAM Journal on Mathematics of Data Science, 2(3):583–606.
  41. On consistent estimators in nonlinear functional errors-in-variables models. Total least squares and errors-in-variables modeling: Analysis, Algorithms and Applications, pages 145–154.
  42. Measurement bias and effect restoration in causal inference. Biometrika, 101(2):423–437.
  43. Li, T. (2002). Robust and consistent estimation of nonlinear errors-in-variables models. Journal of Econometrics, 110(1):1–26.
  44. Nonparametric estimation of the measurement error model using multiple indicators. Journal of Multivariate Analysis, 65(2):139–165.
  45. Efficacy of repeated measures in regression models with measurement error. Biometrics, pages 645–654.
  46. Lütkenöner, B. (2015). A family of kernels and their associated deconvolving kernels for normally distributed measurement errors. Journal of Statistical Computation and Simulation, 85(12):2347–2363.
  47. Nonparametric learning from bayesian models with randomized objective functions. NEURIPS, 31.
  48. General bayesian updating and the loss-likelihood bootstrap. Biometrika, 106(2):465–478.
  49. Convex total least squares. In ICML, pages 109–117. PMLR.
  50. Overview of total least-squares methods. Signal processing, 87(10):2283–2302.
  51. Simulation–extrapolation for bias correction with exposure uncertainty in radiation risk analysis utilizing grouped data. JRSS: Series C: Applied Statistics, 67(1):275–289.
  52. Bayesian nonparametric predictive inference and bootstrap techniques. Annals of the Institute of Statistical Mathematics, 48(4):663–673.
  53. A bayesian semiparametric model for case-control studies with errors in variables. Biometrika, 84(3):523–537.
  54. Neal, R. M. (2000). Markov chain sampling methods for dirichlet process mixture models. Journal of computational and graphical statistics, 9(2):249–265.
  55. Econometrics in outcomes research: the use of instrumental variables. Annual review of public health, 19(1):17–34.
  56. Rustamov, R. M. (2021). Closed-form expressions for maximum mean discrepancy with applications to wasserstein auto-encoders. Stat, 10(1):e329.
  57. Bayesian semiparametric regression in the presence of conditionally heteroscedastic measurement and regression errors. Biometrics, 70(4):823–834.
  58. Schennach, S. M. (2013). Regressions with berkson errors in covariates—a nonparametric approach. The Annals of Statistics, pages 1642–1668.
  59. Sethuraman, J. (1994). A constructive definition of dirichlet priors. Statistica sinica, pages 639–650.
  60. Cost-efficient study designs for binary response data with gaussian covariate measurement error. Biometrics, pages 851–869.
  61. Hilbert space embeddings and metrics on probability measures. JMLR, 11:1517–1561.
  62. Characteristic and universal tensor product kernels. J. Mach. Learn. Res., 18:233–1.
  63. Van Huffel, S. (2004). Total least squares and errors-in-variables modeling: Bridging the gap between statistics, computational mathematics and engineering. In COMPSTAT 2004—Proceedings in Computational Statistics, pages 539–555. Springer.
  64. Analysis and solution of the nongeneric total least squares problem. SIAM journal on matrix analysis and applications, 9(3):360–372.
  65. Wang, L. (2003). Estimation of nonlinear berkson-type measurement error models. Statistica Sinica, pages 1201–1210.
  66. Wang, L. (2004). Estimation of nonlinear models with berkson measurement errors. Annals of statistics, 32(6):2559–2579.
  67. Deconvolution estimation in measurement error models: the r package decon. Journal of statistical software, 39(10).
  68. Total least squares adjustment in partial errors-in-variables models: algorithm and statistical analysis. Journal of geodesy, 86:661–675.
  69. Modeling tunnel profile in the presence of coordinate errors: A gaussian process-based approach. IISE Transactions, 49(11):1065–1077.
  70. Gaussian processes with errors in variables: Theory and computation. JMLR, 24(87):1–53.
  71. Causal inference with treatment measurement error: a nonparametric instrumental variable approach. In Uncertainty in Artificial Intelligence, pages 2414–2424. PMLR.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets