Spatial Durbin Model (SDM) Overview
- Spatial Durbin Models (SDM) are econometric frameworks that integrate lagged outcomes and covariates to capture both direct and spillover effects.
- They employ a K-nearest neighbor spatial weights matrix to standardize neighbor interactions and enhance model robustness.
- Implementation via maximum likelihood estimation in R and Python facilitates the decomposition of direct, indirect, and total effects for policy insights.
The Spatial Durbin Model (SDM) is a prominent specification in spatial econometrics and network analysis for modeling regional or networked data where an outcome in one location is affected not only by its own characteristics but also by interactions with neighboring units. The SDM explicitly incorporates endogenous spatial dependence (via lagged outcomes) and exogenous contextual dependence (via lagged covariates), enabling a rigorous decomposition of direct and spillover effects. Applied notably in the spatial analysis of morbidity and poverty in Thailand’s provinces, the SDM provides a framework to quantify the diffusion and contextual influence processes on spatial networks (Kukieattikool et al., 6 Jan 2026).
1. Mathematical Specification
The SDM is defined in matrix notation as: where:
- is an vector of dependent variable observations (e.g., province-level disease ratios).
- is an row-standardized spatial weights matrix.
- is a scalar autoregressive coefficient capturing “outcome contagion” via .
- is an matrix of explanatory variables (e.g., poverty indicators).
- is a vector of direct effects.
- is an matrix of spatially lagged covariates (neighbor characteristics).
- is a vector of spillover effects.
- is an vector of disturbances, .
This model collapses to standard spatial autoregressive or spatial error models under restricted parameterizations but is unique in jointly representing endogenous and exogenous spatial interactions. Under regularity conditions (), the reduced form is given by: This form is essential for effect decomposition and inference (Kukieattikool et al., 6 Jan 2026).
2. Construction of the Spatial Weights Matrix
In the referenced application, the weights matrix encodes a fixed-degree undirected network among Thai provinces (excluding Bangkok). Its construction involves:
- Geographical centroid computation for each province.
- For each province , identification of its nearest neighbors () by Euclidean centroid-to-centroid distance.
- Raw adjacency matrix where if , else $0$.
- Row standardization: if , $0$ otherwise.
This -nearest neighbor (NN) schema ensures homogeneity in neighbor set cardinality, avoiding the unevenness inherent in contiguity-based schemes (Kukieattikool et al., 6 Jan 2026).
| Step | Description | Resultant Matrix/Operation |
|---|---|---|
| Centroid Calculation | Each province | Geographical coordinates |
| -NN Selection | 7 nearest centroids per province | Adjacency indicator matrix () |
| Row-Standardization | Scale rows to sum to 1 | Final : all neighbors |
3. Estimation, Identification, and Assumptions
Estimation employs maximum likelihood (MLE) for the SDM with Gaussian errors, implemented (in R) via the lagsarlm function from the spatialreg package, type="mixed" indicating the SDM. Key assumptions include:
- is independently, identically distributed, normal with constant variance.
- and are exogenous and uncorrelated with (no simultaneity).
- is exogenously specified and such that is nonsingular ().
- Identification requires sufficient non-collinearity between and for distinct estimation of and . Strong collinearity impairs identification, but with distinct spatial and local poverty indicators this is mitigated (Kukieattikool et al., 6 Jan 2026).
4. Decomposition of Effects
In SDMs, feedback through generates local and propagated effects. For each covariate :
- Direct effect: Average of the diagonal elements—average own-unit response to a covariate increase.
- Indirect (spillover) effect: Average of off-diagonal row sums—average effect on other units from a covariate change in one unit.
- Total effect: Sum of direct and indirect effects.
For example, in modeling digestive disease morbidity ("C2"), direct effects identified living deprivation (). Indirect effects included health deprivation (), accessibility deprivation (), and poor-household count (). A one-unit increase in living deprivation increased local morbidity, while increased neighbor health deprivation reduced it, and increased neighboring poor households raised local morbidity (Kukieattikool et al., 6 Jan 2026).
5. Diagnostics and Spatial Dependence Tests
Spatial diagnostics prior to, during, and post-modeling are critical. The following were employed:
- Global Moran’s I: Detects overall spatial autocorrelation in :
- Monte Carlo permutation testing: Assesses significance of spatial indices.
- Local Moran’s I (LISA): Identifies spatial clusters (HH, LL, HL, LH) with corresponding p-values.
- LM tests on SDM residuals: Confirms adequacy by checking for absence of remaining spatial autocorrelation.
- Model selection criteria: Akaike Information Criterion (AIC), log-likelihood, and significance of test for improved fit and presence of spatial dependence (Kukieattikool et al., 6 Jan 2026).
6. Practical Implementation (R and Python)
The SDM and diagnostics are operationalized as follows:
R (spatialreg/spdep):
1 2 3 4 5 6 7 8 9 10 11 |
library(spdep)
coords <- cbind(province_data%%%%56%%%%lat)
knn7 <- knearneigh(coords, k=7)
nb7 <- knn2nb(knn7)
listw7 <- nb2listw(nb7, style="W")
y <- province_data$ICD10_ratio_C2
X <- as.matrix(province_data[, c("pov.rate","CNT","living","health","education","income","accessibility")])
sdm_C2 <- lagsarlm(y ~ pov.rate + CNT + living + health + education + income + accessibility,
data=province_data, listw=listw7, type="mixed")
imp <- impacts(sdm_C2, listw=listw7, R=1000)
lm.morantest(sdm_C2, listw7) |
Python (PySAL spreg/libpysal):
1 2 3 4 5 6 7 8 |
import libpysal from libpysal.weights import KNN from spreg import ML_Durbin w = KNN.from_dataframe(df, k=7) w.transform = 'R' y = df['ICD10_ratio_C2'].values.reshape((-1,1)) X = df[['pov.rate','CNT','living','health','education','income','accessibility']].values sdm = ML_Durbin(y=y, x=X, w=w, name_y='C2', name_x=list_of_X_names, name_w='KNN7') |
impacts() or attribute accessors (Kukieattikool et al., 6 Jan 2026).
7. Applications and Implications
Application of the SDM to Thailand’s provincial morbidity and poverty data revealed strong spatial clustering in health outcomes, with neighboring influences often dominating local effects. These results substantiate processes such as contagion, contextual influence, and structural diffusion. The framework underscores the necessity of inter-jurisdictional policy responses, as spillovers cross administrative boundaries. More broadly, the SDM provides a statistical basis for assessing spatial network effects within the study of health inequality, regional vulnerability, and multi-attribute social phenomena (Kukieattikool et al., 6 Jan 2026).