Papers
Topics
Authors
Recent
Search
2000 character limit reached

Global Moran’s I in Spatial Analysis

Updated 2 April 2026
  • Global Moran’s I is a quadratic-form statistic that measures global spatial autocorrelation, indicating clustering when positive, randomness near zero, and dispersion when negative.
  • It is computed using a spatial weight matrix and standardized data, with its interpretation supported by spectral bounds and regression-based eigenvalue methods.
  • The index underpins spatial econometric modeling, offering applications in permutation testing, multi-scale analysis, and connections to information theory through entropy measures.

Global Moran’s I is a quadratic-form statistic that serves as the classical global index of spatial autocorrelation for a univariate attribute distributed over a spatial domain. Its rigorous mathematical construction, spectral properties, relationships to alternative spatial statistics, and role in spatial econometric modeling make it a foundational tool in modern spatial analysis.

1. Definition and Mathematical Structure

Let x=(x1,,xN)x = (x_1, \ldots, x_N)^\top be a real-valued variable observed on NN spatial units, with sample mean xˉ\bar x and (possibly sample or population) variance σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N. Fix a spatial-weight matrix W=[wij]W = [w_{ij}] with zero diagonal (wii=0w_{ii}=0) and i,jwij=1\sum_{i,j} w_{ij} = 1 (global normalization). The z-score standardized vector is z=(xxˉ1)/σxz = (x - \bar x \mathbf{1})/\sigma_x, satisfying z1=0z^\top \mathbf{1} = 0, zz=Nz^\top z = N.

The global Moran’s I is defined as

NN0

which, under global normalization and NN1, reduces to NN2 (Chen, 27 Aug 2025).

Moran’s I quantifies the degree of spatial correlation: NN3 signals clustering of similar values, NN4 anti-clustering (checkerboard-like alternation), and NN5 spatial randomness.

2. Spectral Properties, Bounds, and Interpretation

Moran’s I can be interpreted as a Rayleigh quotient, with attainable values determined by the spectral properties of NN6. For symmetric NN7, extremal values satisfy

NN8

where NN9 and xˉ\bar x0 are the smallest and largest eigenvalues of xˉ\bar x1 (projected onto the zero-mean subspace) (Maruyama, 2015, Chen, 2022). If xˉ\bar x2 is globally normalized, the “absolute” theoretical bounds are xˉ\bar x3, although in practice the achievable range is xˉ\bar x4, intersected with xˉ\bar x5 (Chen, 2022). Row-standardization of xˉ\bar x6 can enforce xˉ\bar x7, but general weighting schemes (including irregular degree graphs or distance-based weights) may yield I-values outside the unit interval (Duchin et al., 2021).

A monotone transformation of xˉ\bar x8 to xˉ\bar x9 rescales it into σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N0, where σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N1 is computed from the spectral bounds, yielding a direct interpretational comparability to Pearson’s correlation (Maruyama, 2015).

3. Structural Decomposition and Connection with Other Indices

Chen introduced an exact decomposition of global Moran’s I in terms of the Getis–Ord indices and the size correlation function (Chen, 27 Aug 2025):

σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N2

where

  • σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N3 is the normalized size vector,
  • σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N4 is the size-correlation function,
  • σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N5 is the global Getis–Ord index,
  • σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N6, σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N7 is the sum of local Getis–Ord indices.

This structural decomposition reveals that global spatial autocorrelation depends on four distinct components: the global Getis–Ord index, the sum over local Getis–Ord indices, the number of spatial elements σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N8, and σx2=i=1N(xixˉ)2/N\sigma_x^2 = \sum_{i=1}^N (x_i - \bar x)^2/N9—the correlation structure of the normalized attribute. The local Getis–Ord indices are equivalent to gravity-model–based potential indices, establishing the association of Moran’s I with spatial interaction: weak spatial interaction implies weak global autocorrelation (Chen, 27 Aug 2025). The relationship is strictly nonlinear, and the interplay between W=[wij]W = [w_{ij}]0 and W=[wij]W = [w_{ij}]1 (and, implicitly, the gravity model) governs the magnitude and sign of W=[wij]W = [w_{ij}]2.

4. Algebraic, Regression, and Eigenvalue Perspectives

Moran’s I admits multiple algebraic forms:

  • Quadratic form: W=[wij]W = [w_{ij}]3;
  • Outer-product eigen formulation: W=[wij]W = [w_{ij}]4 (so W=[wij]W = [w_{ij}]5 is a nonzero eigenvalue of W=[wij]W = [w_{ij}]6);
  • Inner-product formulation: W=[wij]W = [w_{ij}]7, providing regression-model–based interpretations (Chen, 2022, Chen, 2022).

Regression techniques can compute W=[wij]W = [w_{ij}]8 as the slope of the least-squares fit of W=[wij]W = [w_{ij}]9 on wii=0w_{ii}=00, either with or without intercept, and thus directly connect autocorrelation measurement with spatial autoregressive models. The inner-product equation wii=0w_{ii}=01 is formally the inverse of the simplest noise-free spatial autoregressive (SAR) model wii=0w_{ii}=02, with the explicit relationship wii=0w_{ii}=03 in that limiting case (Chen, 2022). This reconciliation aligns spatial autocorrelation estimation and the SAR coefficient within a unified algebraic framework and clarifies stability domains for SAR estimation via eigen-analysis (Chen, 2022, Chen, 2022).

5. Weight Matrix Choices and Their Implications

The choice and normalization of the weight matrix wii=0w_{ii}=04 are fundamental to the behavior of global Moran’s I (Chen, 2016, Duchin et al., 2021). Standard options include:

  • Binary adjacency (e.g., rook or queen contiguity in planar maps),
  • Row-standardized versions (wii=0w_{ii}=05) so that rows sum to one,
  • Inverse distance decay (wii=0w_{ii}=06 or wii=0w_{ii}=07),
  • Graph Laplacian (wii=0w_{ii}=08), which inverts the interpretation of I,
  • Doubly-stochastic matrices (wii=0w_{ii}=09), e.g. Metropolis–Hastings random-walk kernels for robust boundedness i,jwij=1\sum_{i,j} w_{ij} = 10 and interpretable Markov variance-retention properties (Duchin et al., 2021).

The spectrum of i,jwij=1\sum_{i,j} w_{ij} = 11 and its degree of regularity (homogeneity of neighbor counts) directly control the attainable range of i,jwij=1\sum_{i,j} w_{ij} = 12 and the stability of its interpretation (Duchin et al., 2021, Maruyama, 2015). Irregular graphs can yield arbitrarily extreme i,jwij=1\sum_{i,j} w_{ij} = 13 values for suitably concentrated attribute vectors.

6. Statistical Inference and Information-Theoretic Interpretation

Under the null hypothesis of spatial randomness (random permutations of spatial labels), the expected value is i,jwij=1\sum_{i,j} w_{ij} = 14 (Arthur, 2024, Tillé et al., 2017). Analytical variance expressions exist, depending on i,jwij=1\sum_{i,j} w_{ij} = 15 and higher moments of i,jwij=1\sum_{i,j} w_{ij} = 16. Both parametric (normal approximation) and permutation-based inference for i,jwij=1\sum_{i,j} w_{ij} = 17 are standard: the latter matches the exact reference distribution under exchangeability (Kashlak et al., 2020, Wang et al., 2021). Permutation-free closed-form p-value bounds using concentration inequalities have been developed for computational efficiency at scale (Kashlak et al., 2020, Wang et al., 2021).

Recent developments connect Moran’s I to self-information (surprisal) and entropy. The probability of observing a given value i,jwij=1\sum_{i,j} w_{ij} = 18 under random spatial arrangement defines i,jwij=1\sum_{i,j} w_{ij} = 19—yielding an information-theoretic measure of spatial surprisingness. High autocorrelation (high z=(xxˉ1)/σxz = (x - \bar x \mathbf{1})/\sigma_x0) corresponds to high self-information and low entropy, i.e., the data are compressible and “surprising” given the underlying spatial arrangement (Wang et al., 2024). This provides a bridge between spatial statistics and information theory, and motivates loss functions in spatial learning or ecological inference.

7. Extensions, Applications, and Analytical Generalizations

Analytical generalizations include scale-dependent spatial autocorrelation functions (SACF), constructed by parametrizing z=(xxˉ1)/σxz = (x - \bar x \mathbf{1})/\sigma_x1 with a spatial lag z=(xxˉ1)/σxz = (x - \bar x \mathbf{1})/\sigma_x2 (e.g., “relative step” function) (Chen, 2020). This yields cumulative and density SACFs, analogous to temporal autocorrelation functions, and admits partial SACF versions via spatial Yule–Walker equations. The resulting framework allows multi-scale and fractal analysis of spatial autocorrelation, with implications for identifying correlation dimensions in geographical systems (Chen, 2020).

Empirical studies demonstrate the versatility of global Moran’s I: from quantifying the tempo-spatial clustering of infectious disease (Deeb, 2020), measuring spatial balance in sample surveys (Tillé et al., 2017), to providing a null-preserving resampling algorithm that fixes observed autocorrelation for hypothesis testing (Arthur, 2024). The index is foundational for exploratory spatial data analysis, spatial panel model diagnostics, and for distinguishing clustering, randomness, and dispersion in heterogeneous spatial systems (Chen, 2016, Wang et al., 2021).


References

(Chen, 27 Aug 2025): "Structural Decomposition of Moran's Index by Getis-Ord's Indices" (Duchin et al., 2021): "Measuring Segregation via Analysis on Graphs" (Wang et al., 2024): "Probing the Information Theoretical Roots of Spatial Dependence Measures" (Arthur, 2024): "A General Method for Resampling Autocorrelated Spatial Data" (Chen, 2016): "New Approaches for Calculating Moran's Index of Spatial Autocorrelation" (Chen, 2020): "An Analytical Process of Spatial Autocorrelation Functions Based on Moran's Index" (Maruyama, 2015): "An alternative to Moran's I for spatial autocorrelation" (Deeb, 2020): "Spatial autocorrelation and the dynamics of the mean center of COVID-19 infections in Lebanon" (Chen, 2022): "Spatial autocorrelation equation based on Moran's index" (Chen, 2022): "Deriving two sets of bounds of Moran's index by conditional extremum method" (Kashlak et al., 2020): "Computation-free Nonparametric testing for Local and Global Spatial Autocorrelation with application to the Canadian Electorate" (Wang et al., 2021): "Local Statistics for Spatial Panel Models with Application to the US Electorate" (Chen, 2022): "Derivation of an Inverse Spatial Autoregressive Model for Estimating Moran's Index" (Tillé et al., 2017): "Measuring the spatial balance of a sample: A new measure based on the Moran's I index"

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Global Moran’s I.