Local Indicators of Spatial Association (LISA)
- Local Indicators of Spatial Association (LISA) are spatial statistical tools that decompose global measures like Moran's I to quantify local autocorrelation.
- They classify observations into distinct clusters (high-high, low-low) or outliers (high-low, low-high) to facilitate local spatial analysis.
- LISA methods use spatial weights and permutation tests to assess significance, with applications spanning economics, epidemiology, ecology, and medical imaging.
Local Indicators of Spatial Association (LISA) are a class of spatial statistics designed to measure the degree, nature, and significance of spatial autocorrelation at local, rather than solely global, scales. LISA statistics decompose global indices of spatial association, such as Moran’s I, providing spatially explicit diagnostics that distinguish between local spatial clusters, outliers, and randomness. Their adoption is foundational in exploratory spatial data analysis (ESDA), spatial econometrics, and spatial epidemiology, among other domains.
1. Mathematical Formulation and Key Principles
Local Moran’s I, the canonical LISA, quantifies the contribution of each observation to the overall spatial autocorrelation diagnosed by global Moran’s I. For a univariate variable with mean , and a spatial weight matrix , the local statistic is most commonly given as:
where:
- (z-score normalization, with )
- : second moment
- : spatial weights that encode proximity or adjacency (Mason et al., 2024).
With row-normalized weights (), an equivalent, widely-used formulation is:
This normalization reflects the effective degrees of freedom at each site.
2. Relationship to Global Moran’s I
Global Moran’s I measures overall spatial autocorrelation by aggregating cross-products over all pairs:
LISA statistics "disaggregate" this quantity spatially. The global index can be recovered as a weighted average (or sum) of the local indicators:
when weights and normalization are compatible (Mason et al., 2024, Chen, 2016). Each quantifies the extent to which the value at location and the values at neighboring sites jointly contribute to the overall spatial autocorrelation.
3. Interpretation, Classification, and Statistical Testing
The sign and magnitude of local Moran’s are diagnostic:
- : observation and its neighbors exhibit similar z-signs (local clusters: high-high or low-low)
- : observation is an outlier relative to its neighbors (high-low or low-high, i.e., spatial outlier)
Local LISA diagnostics thus classify each observation into one of four categories:
- High-High (HH): , : local hot spot
- Low-Low (LL): , : local cold spot
- High-Low (HL): , : high-value spatial outlier
- Low-High (LH): , : low-value spatial outlier (Mason et al., 2024)
Empirical significance of each is established via conditional random permutation tests. For each location , the neighborhood is randomly permuted many times (typically 999 or more), and the pseudo-p-value is the fraction of permuted values exceeding the observed magnitude. Multiple testing adjustments are critical for interpreting statistical significance due to the large number of local tests (Mason et al., 2024).
4. The Construction and Role of Spatial Weights
The spatial weights matrix is foundational to LISA statistics:
- It is typically symmetric (), with zero diagonal (), and constrained so that each row sums to unity (row-standardization) or the total weights sum to .
- Common choices include binary adjacency (rook/queen contiguity), k-nearest neighbors, or inverse distance (Mason et al., 2024).
The sensitivity of LISA statistics to the specification of is substantial; results may vary markedly based on neighborhood definitions, with direct implications for the spatial clusters and outliers detected.
5. Visualization and Diagnostic Tools
Advanced visual analytic techniques have been developed for LISA, reflecting their complexity and interpretive richness:
- Moran Scatterplot: Plots standardized value vs. the spatial lag . Trend-line slope approximates the global index. The quadrant location provides immediate categorization (HH, LL, HL, LH) (Chen, 2016, Mason et al., 2024).
- Dual-Density Plot: Visualizes the distribution of z-scores and local neighbor values, spatial lags, permutation distributions of , and empirical significance. Encodings (point size, color) communicate weight and cluster type (Mason et al., 2024).
- Network Scatterplot and Lag Radial Plot: Explore the relational network among local statistics and the geographic orientation of local clusters, augmenting traditional spatial maps.
Integrated dashboards linking these displays via interactive brushing and detail-on-demand facilitate multiscale diagnosis of spatial association structure within large geostatistical datasets (Mason et al., 2024).
6. Theoretical Properties, Extensions, and Limitations
- Quadratic Form and Eigenstructure: Local Moran’s indices arise as diagonal entries of . The eigenstructure of underpins both global and local statistics (Chen, 2016).
- Distribution Under Spatial Randomness: Under the null hypothesis, (after standardization and suitable normalization), but in finite samples (Mason et al., 2024).
- Limitations and Considerations:
- The choice of heavily impacts results—artifacts can arise with poorly matched neighborhood definitions (Mason et al., 2024).
- Non-Gaussianity in X, low neighborhood variance, or sparse graphs complicate the interpretation of .
- Multiple testing issues are acute—naively interpreting clusters without controlling Type I error can be misleading.
- LISA is strictly first-moment based; it detects only linear association in values across space.
Extensions of LISA include bivariate forms, multiple scales, and separate treatment for categorical variables, though the basic local cross-product structure remains foundational.
7. Applications and Empirical Paradigms
LISA is operationalized in diverse settings:
- Regional science: Identification of economic hot-spots and cold-spots.
- Epidemiology: Detection of disease clusters beyond environmental randomness.
- Ecology: Mapping of biodiversity hot-spots.
- Spatial machine learning: Auxiliary-task learning using multi-resolution LISA to enhance neural networks with inductive biases for spatial autocorrelation (Klemmer et al., 2020).
- Radiomics and medical imaging: Segmentation of pathological tissue via slice-wise LISA calculations (Ryan et al., 2018).
Empirical workflows typically consist of exploratory spatial data transformation, construction and standardization of , computation of LISA statistics, permutation-based significance testing, and visualization using scatterplots, mapped clusters, and interactive diagnostics (Mason et al., 2024).
LISA statistics provide a rigorous, mathematically grounded mechanism to analyze spatial autocorrelation at local scales. They extend the interpretive range and resolution of global autocorrelation indices, are sensitive to spatial weight formulations, and require careful statistical interpretation, especially when assessing local significance. Their development and integration with modern geostatistical visualization and modeling paradigms have made them core instruments in spatial quantitative analysis (Mason et al., 2024, Chen, 2016).