Papers
Topics
Authors
Recent
Search
2000 character limit reached

Local Moran’s I (LISA)

Updated 2 April 2026
  • Local Moran’s I (LISA) is a statistical measure that decomposes spatial autocorrelation into local values to identify clusters (hot-spots and cold-spots) and spatial outliers.
  • It utilizes various normalization techniques, such as global and row-normalized weights along with population variance standardization, ensuring consistency with global Moran’s I.
  • Inference is based on permutation testing and advanced visualization methods, making it widely applicable in spatial econometrics, epidemiology, and urban studies.

Local Moran’s I (LISA) provides a formal, spatially explicit decomposition of spatial autocorrelation by quantifying the association between a value at a geographic location and the values observed at its spatial neighbors. As a local indicator of spatial association, it extends the global Moran’s I statistic, producing a separate measure for each spatial unit (“location”) that can detect local clusters (“hot-spots” and “cold-spots”) as well as local spatial outliers. The rigorous mathematical foundation, multiple forms of weights normalization, inferential permutation testing, and recent advances in visualization and normalization schemes underpin its broad uptake in geospatial modeling, spatial econometrics, epidemiology, and other fields.

1. Mathematical Definitions and Normalization Schemes

The classical (univariate) Local Moran’s I for an attribute vector x=(x1,,xn)x = (x_1,\ldots,x_n) over nn spatial units is defined, after mean-removal and (optionally) standardization, using a spatial weights matrix W=(wij)W = (w_{ij}) to encode the neighborhood structure. Mathematically:

Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})

where xˉ\bar{x} is the global mean. This is formally a local “gamma index” of matrix similarity, and up to proportionality can be seen as the iith row "projection" of the global Moran’s I calculation (Wang et al., 2021).

Normalization of both weights and variables fundamentally alters the inferential and interpretive properties. Three prominent variants have been formally analyzed (Chen, 2022):

Index Weights Matrix Variable Type Formula Proportionality to Global II?
MI₁ un-normalized vijv_{ij} mean-centered yiy_i Ii(1)=jvijyiyjo2I_i^{(1)} = \frac{\sum_j v_{ij} y_i y_j}{o^2} Yes (factor nn0)
MI₂ row-normalized nn1 standardized nn2 nn3 No
MI₃ global-normalized nn4 standardized nn5 nn6 Yes (nn7)

Here, nn8, nn9 (where W=(wij)W = (w_{ij})0 is the population variance), W=(wij)W = (w_{ij})1 is the raw (un-normalized) adjacency, W=(wij)W = (w_{ij})2 with W=(wij)W = (w_{ij})3, and W=(wij)W = (w_{ij})4 with W=(wij)W = (w_{ij})5.

Of these, MI₃ (using global-normalized weights and standardization by population variance) uniquely satisfies Anselin’s second requirement that “the sum of LISAs for all observations is proportional to a global indicator of spatial association,” with constant W=(wij)W = (w_{ij})6 (Chen, 2022).

2. Panel and Multivariate Extensions

For spatial panel data—multiple outcomes or residuals W=(wij)W = (w_{ij})7 measured for each region W=(wij)W = (w_{ij})8 across time points W=(wij)W = (w_{ij})9—Local Moran’s I generalizes to a multivariate inner product form. Let Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})0 denote the Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})1-dimensional vector for region Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})2, Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})3 the across-region mean vector, and Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})4 a positive-definite Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})5 matrix (often the identity). The panel-data LISA is:

Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})6

Matrix notation enables the simultaneous calculation of all Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})7 entries as the diagonal of Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})8, where Ii=j=1nwij(xixˉ)(xjxˉ)I_i = \sum_{j=1}^n w_{ij}\,(x_i - \bar{x})\,(x_j - \bar{x})9 is the xˉ\bar{x}0 data matrix (Wang et al., 2021).

A plausible implication is that this generalization retains theoretical properties (e.g., local/global decomposability under global-normalized weights) under appropriate centering and normalization, but with the local statistic now quantifying multivariate (“vector”) similarity.

3. Computational Algorithms and Inference

LISA values are not interpreted without significance testing. A standard workflow is:

  1. Standardize variables (usually via z-scores or mean-centering and population variance scaling).
  2. Construct an appropriate weights matrix (e.g., binary adjacency, inverse distance, with possible normalizations).
  3. For each region xˉ\bar{x}1, compute xˉ\bar{x}2.
  4. Assess significance via permutation: with region xˉ\bar{x}3’s value fixed, permute neighbor labels xˉ\bar{x}4 times, recompute xˉ\bar{x}5 for each, and estimate a pseudo xˉ\bar{x}6-value

xˉ\bar{x}7

  1. Apply multiple-testing correction, e.g., Benjamini–Hochberg FDR, to control for family-wise error (Mason et al., 2024, Wang et al., 2021).

For spatial panel models, analytic (non-sampling) bounds based on concentration inequalities can provide fast xˉ\bar{x}8 computation of xˉ\bar{x}9-values, avoiding the cost of ii0 enumerations or Monte Carlo approximation: ii1 where ii2 is the upper incomplete gamma, and the constants are calculated from the weight structure and similarity function (Wang et al., 2021).

4. Interpretation and Classification of Local Patterns

The sign and magnitude of each ii3 are interpreted in conjunction with the value at ii4 and the spatial lag ii5. Classical quadrant-based classification yields four cluster/outlier types:

  • High–High (HH): ii6, ii7, ii8—localized hot-spot (high value surrounded by highs)
  • Low–Low (LL): ii9, II0, II1—cold-spot (low value among lows)
  • High–Low (HL): II2, II3, II4—high-value outlier amidst lows
  • Low–High (LH): II5, II6, II7—low outlier among highs

A significant positive II8 indicates the attribute at II9 moves with its neighbors; a significant negative vijv_{ij}0 marks a spatial outlier. Final map labeling should always consider both the sign of vijv_{ij}1 and the accompanying vijv_{ij}2/vijv_{ij}3 values (Mason et al., 2024).

5. Visualization Techniques

Advanced interactive visualizations have been proposed to facilitate exploration and interpretation of Local Moran’s I statistics. Three systems highlighted include (Mason et al., 2024):

  • Moran Dual-Density Plot: Shows the distribution of vijv_{ij}4-scores for the whole study region (with neighbors marked), the spatial lag, the computed vijv_{ij}5, and the permutation-based significance.
  • Moran Network Scatterplot: Displays the Moran scatterplot (vijv_{ij}6 vs. vijv_{ij}7), with neighbor relations rendered interactively (edges colored and scaled by weight, neighbor cluster labels).
  • Spatial Lag Radial Plot: Radially arranges neighbors around vijv_{ij}8 by bearing, with distance set by vijv_{ij}9, and marker size by yiy_i0; a dashed circle indicates yiy_i1.

Brushing and linking allow coordinated updates across views, with map-based, scatterplot, and spatial context jointly updating as the user examines different locations, promoting holistic understanding of spatial patterns.

6. Comparative Properties and Best Practices

Key methodological considerations and comparisons include:

  • Global-normalized weights and population variance standardization (MI₃) uniquely ensure that LISA values sum exactly to the global Moran’s I (Chen, 2022).
  • Row normalization is often used in practical software and visualization for interpretability, but such MI₂ LISA values do not decompose the global measure.
  • Choice of weights (contiguity, distance, or inverse distance) and normalization method critically impact detected patterns.
  • Multiple testing is inherent: reporting cluster significance demands Type I error control; FDR procedures are standard (Wang et al., 2021, Mason et al., 2024).
  • Caution is warranted for sparse neighborhoods: unstable significance inference and wide null distributions can result for locations with very few neighbors.
  • For panel models, LISA can be directly applied to model residuals, supporting model checking for spatial dependence in spatial econometric regression (Wang et al., 2021).

7. Practical Applications and Case Studies

Local Moran’s I (LISA) is applied to diverse spatial problems. Notable recent implementations include:

  • Electoral Panel Model: Over 3,100 U.S. counties across five presidential elections (2000–2016). Covariates included population density, median income, and racial composition. LISA identified statistically significant Republican “corridors” and Democratic clusters, guiding substantive political analysis (Wang et al., 2021).
  • Cancer Mortality Studies: Age-adjusted cancer deaths in U.S. counties (2011–2020), with HH and LL clusters mapped at fine spatial scale. Visualization dashboards (available as open-source moranplot library) bring transparency and contextualization to spatial “hot-spot” findings (Mason et al., 2024).
  • Urban Demography: Empirical verification on Beijing–Tianjin–Hebei regional data demonstrates the exactness of MI₃ sum-to-global proportionality and illustrates how alternate weightings affect the cluster identification (Chen, 2022).

These studies exemplify the broad applicability and rigorous interpretability afforded by formally constructed LISA statistics, in both univariate and panel settings.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Local Moran’s I (LISA).