Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Marginal Maximum Likelihood Approach for Hierarchical Simultaneous Autoregressive Models with Missing Data (2403.17257v2)

Published 25 Mar 2024 in stat.ME and stat.CO

Abstract: Efficient estimation methods for simultaneous autoregressive (SAR) models with missing data in the response variable have been well-explored in the literature. A common practice is to introduce measurement error into SAR models to separate the noise component from the spatial process. However, prior research has not considered incorporating measurement error into SAR models with missing data. Maximum likelihood estimation for such models, especially with large datasets, poses significant computational challenges. This paper proposes an efficient likelihood-based estimation method, the marginal maximum likelihood (ML), for estimating SAR models on large datasets with measurement errors and a high percentage of missing data in the response variable. The spatial error model (SEM) and the spatial autoregressive model (SAM), two popular SAR model types, are considered. The missing data mechanism is assumed to follow a missing at random (MAR) pattern. We propose a fast method for marginal ML estimation with a computational complexity of $O(n{3/2})$, where $n$ is the total number of observations. This complexity applies when the spatial weight matrix is constructed based on a local neighbourhood structure. The effectiveness of the proposed methods is demonstrated through simulations and real-world data applications.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (53)
  1. Allison, P. D. (2001). Missing data. Sage publications.
  2. Peer effects in European primary schools: Evidence from the progress in international reading literacy study. Journal of Labor Economics, 27(3):315–348.
  3. Does school integration generate peer effects? evidence from Boston’s Metco program. American Economic Review, 94(5):1613–1634.
  4. Anselin, L. (1988). Spatial econometrics: methods and models, volume 4. Springer Science & Business Media.
  5. Hierarchical spatial models. Encyclopedia of GIS, 14(1):425–431.
  6. Matrix: Sparse and Dense Matrix Classes and Methods. R package version 1.5-3.
  7. Spatial auto-correlation and auto-regressive models estimation from sample survey data. Biometrical Journal, 62(6):1494–1507.
  8. Bishop, C. M. (1998). Latent variable models. In Learning in graphical models, pages 371–403. Springer.
  9. Bivand, R. (2010). Comparing estimation methods for spatial econometrics techniques using R. NHH Dept. of Economics Discussion Paper, (26).
  10. Spatial data analysis with R-INLA with some extensions. Journal of statistical software, 63:1–31.
  11. spData: Datasets for Spatial Analysis. R package version 2.2.2.
  12. The SAR model for very large datasets: a reduced rank approach. Econometrics, 3(2):317–338.
  13. Fixed rank kriging for very large spatial data sets. Journal of the Royal Statistical Society Series B: Statistical Methodology, 70(1):209–226.
  14. Davis, T. A. (2006). Direct Methods for Sparse Linear Systems. Society for Industrial and Applied Mathematics.
  15. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological), 39(1):1–22.
  16. Spatial autoregressive models for geographically hierarchical data structures. Geographical Analysis, 47(2):173–191.
  17. Enders, C. K. (2003). Using the Expectation Maximization algorithm to estimate coefficient alpha for scales with item-level missing data. Psychological Methods, 8(3):322.
  18. Ford, W. (2015). Chapter 11 - Gaussian elimination and the LU decomposition. In Ford, W., editor, Numerical Linear Algebra with Applications, pages 205–239. Academic Press, Boston.
  19. Crime and social interactions. The Quarterly Journal of Economics, 111(2):507–548.
  20. Estimating spatial econometrics models with Integrated Nested Laplace Approximation. Mathematics, 9(17):2044.
  21. Hepple, L. W. (1979). Bayesian analysis of the linear model with spatial dependence. In Exploratory and Explanatory Statistical Analysis of Spatial Data, pages 179–199. Springer.
  22. A generalized moments estimator for the autoregressive parameter in a spatial model. International Economic Review, 40(2):509–533.
  23. On the asymptotic distribution of the Moran I test statistic with applications. Journal of Econometrics, 104(2):219–257.
  24. Spatial models with spatially lagged dependent variables and incomplete data. Journal of Geographical Systems, 12(3):241–257.
  25. Fast cars. Journal of Statistical Computation and Simulation, 59(2):123–145.
  26. Lauritzen, S. L. (1995). The EM algorithm for graphical association models with missing data. Computational Statistics & Data Analysis, 19(2):191–201.
  27. Lee, L.-f. (2003). Best spatial two-stage least squares estimators for a spatial autoregressive model with autoregressive disturbances. Econometric Reviews, 22(4):307–335.
  28. Lee, L.-f. (2007). GMM and 2SLS estimation of mixed regressive, spatial autoregressive models. Journal of Econometrics, 137(2):489–514.
  29. Introduction to spatial econometrics. Chapman and Hall/CRC.
  30. LeSage, J. P. (1997). Bayesian estimation of spatial autoregressive models. International Regional Science Review, 20(1-2):113–129.
  31. Models for spatially dependent missing data. The Journal of Real Estate Finance and Economics, 29(2):233–254.
  32. One-step estimation of spatial dependence parameters: Properties and extensions of the APLE statistic. Journal of Multivariate Analysis, 105(1):68–84.
  33. Statistical analysis with missing data, volume 793. John Wiley & Sons.
  34. Longstaff, F. A. (2010). The subprime credit crisis and contagion in financial markets. Journal of Financial Economics, 97(3):436–450.
  35. IPW-based robust estimation of the SAR model with missing data. Statistics & Probability Letters, 172:109065.
  36. A numerically efficient implementation of the expectation maximization algorithm for state space models. Applied Mathematics and Computation, 241:222–232.
  37. ESDA (exploratory spatial data analysis) of vegetation cover in urban areas—recognition of vulnerabilities for the management of resources in urban green infrastructure. Sustainability, 12(5):1933.
  38. Spatially varying SAR models and Bayesian inference for high-resolution lattice data. Annals of the Institute of Statistical Mathematics, 66(3):473–494.
  39. Ord, K. (1975). Estimation methods for models of spatial interaction. Journal of the American Statistical Association, 70(349):120–126.
  40. Pace, R. K. (1997). Performing large spatial regressions and autoregressions. Economics Letters, 54(3):283–291.
  41. Sparse spatial autoregressions. Statistics & Probability Letters, 33(3):291–297.
  42. Chebyshev approximation of log-determinants of spatial weight matrices. Computational Statistics & Data Analysis, 45(2):179–196.
  43. The Matrix Cookbook. Technical University of Denmark, 7(15):510.
  44. Rubin, D. B. (1976). Inference and missing data. Biometrika, 63(3):581–592.
  45. Gaussian Markov Random Fields: Theory and Applications. Chapman and Hall/CRC.
  46. Approximate Bayesian inference for latent Gaussian models by using Integrated Nested Laplace Approximations. Journal of the Royal Statistical Society: Series B (statistical methodology), 71(2):319–392.
  47. Su, L. (2012). Semiparametric GMM estimation of spatial autoregressive models. Journal of Econometrics, 167(2):543–560.
  48. Suesse, T. (2018a). Estimation of spatial autoregressive models with measurement error for large data sets. Computational Statistics, 33(4):1627–1648.
  49. Suesse, T. (2018b). Marginal maximum likelihood estimation of SAR models with missing data. Computational Statistics & Data Analysis, 120:98–110.
  50. Computational aspects of the EM algorithm for spatial econometric models with missing data. Journal of Statistical Computation and Simulation, 87(9):1767–1786.
  51. Analysis of determinants of mammalian species richness in South America using spatial autoregressive models. Ecography, 27(4):427–436.
  52. Spatial autoregressive models for statistical inference from ecological data. Ecological Monographs, 88(1):36–59.
  53. Estimation of spatial autoregressive models with randomly missing data in the dependent variable. The Econometrics Journal, 16(1):73–102.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com