Linear Structural Equations in SEMs
- Linear Structural Equations are mathematical formulations in SEMs that model linear functional relationships among observed variables using mixed graphs and structured matrices.
- The framework utilizes structural coefficient matrices and error covariance representations to capture directed effects and latent confounding in complex data.
- Global identifiability conditions, derived from avoiding specific subgraph structures, ensure unique parameter recovery and reliable statistical inference.
Linear structural equations constitute the foundation of a broad class of multivariate statistical models known as linear structural equation models (SEMs). These models describe noisy linear functional relationships among observed variables, are encoded by mixed graphs (comprising both directed and bidirected edges), and are parametrized via matrices representing structural coefficients and error covariances. Their mathematical, graphical, and inferential properties have been the subject of substantial theoretical development, particularly in the context of identifiability, algorithmic recovery, and algebraic characterization.
1. Formal Definition and Model Parametrization
A linear SEM is built upon a mixed graph where indexes observed variables, is the set of directed edges specifying linear effects (e.g., interpreted as ), and is the set of bidirected edges encoding potential correlation between exogenous errors (reflecting, for instance, latent confounding). The model, under Gaussian noise, is parametrized by :
- is the matrix of structural coefficients, with whenever .
- is a positive definite error covariance matrix, with off-diagonal entries unless .
The induced covariance matrix over is
and the parametrization map is
The invertibility of is ensured by requiring acyclicity in the directed part of . The explicit rational form for is central to both identification theory and statistical estimation.
2. Global Identifiability: Injectivity and Graphical Characterization
Global identifiability is defined as the injectivity of the parametrization map , i.e., the property that implies . This property is essential for valid statistical inference: when injectivity fails, parameter estimates may not be unique and the likelihood surface may possess singularities, resulting in nonstandard asymptotic properties (e.g., likelihood ratio test statistics deviating from behavior).
The main graphical result is a necessary and sufficient condition: for an acyclic mixed graph fails to be injective if and only if contains an induced subgraph (with ) such that
- its directed part contains a converging arborescence (i.e., a directed tree spanning and converging at some node ),
- and the bidirected part is connected.
This dual condition (combining directed flow convergence and error correlation connectedness) pinpoints the precise structural situations in which non-identifiability arises.
3. Statistical Implications and Pathological Scenarios
When the global identifiability criterion is violated, inference procedures—such as maximum likelihood estimation and the assessment of significance via likelihood ratio tests—become unreliable. Simulations (see Figure 1 and 2 in (Drton et al., 2010)) demonstrate that in the non-injective case, the distribution of -values under the null hypothesis is nonuniform (e.g., overly conservative), reflecting an irregular model geometry. For example, in a model where , , and specific off-diagonal error covariances, the map is not injective and the resulting hypothesis test does not behave as expected. Modifying the parameters (e.g., ) can restore injectivity and standard asymptotics.
4. Algebraic and Combinatorial Structure
The image of is an algebraic variety, whose structure is shaped by the combinatorics of . For acyclic graphs, inversion of can be achieved via recursive (block-LDL) decomposition; for general graphs (especially with cycles), 's entries remain rational functions of the parameters. The algebraic viewpoint reveals that satisfies polynomial constraints—many of which correspond to conditional independence and more intricate (non-determinantal) relations. These constraints are critical in model testing and structure learning.
A related combinatorial result is the relationship between treks (paths without colliding arrowheads) and matrix entries: each decomposes as a sum over treks between and , with each trek monomial defined by products of 's and 's. This "trek rule" facilitates both explicit parameter recovery and the computation of determinantal (rank) constraints on submatrices of .
5. Practical and Methodological Considerations
Ensuring global identifiability at the model specification stage is paramount for reliable inference. The absence of the problematic induced subgraphs described above is a necessary and sufficient graphical criterion. In practical terms, the diagrammatic structure of the SEM must avoid any set of nodes for which the directed structure supports a converging arborescence and the error terms are fully connected via bidirected edges.
Applied researchers are thereby advised to inspect their mixed graph models for such substructures. Failure to do so risks inference anomalies, including nonuniqueness of parameter solutions, degeneracy of the likelihood function, and invalid asymptotics for classical test statistics.
6. Summary Table: Identifiability Condition
Graph Structural Feature | Identifiability Consequence | Inferential Implication |
---|---|---|
No induced subgraph with both converging | Global identifiability (injective) | Unique parameters; regular likelihood |
arborescence and connected bidirected part | correct χ² inference | |
Some induced subgraph with both | Not globally identifiable | Nonunique parameters; irregular model |
Possibly nonstandard -values |
7. Conclusion
Linear structural equations, as formalized by mixed graph models and parametrized via explicit matrix functions, are globally identifiable if and only if their underlying graphs avoid specific substructures that couple directed arborescences and error covariance connectivity. This global identifiability is requisite for classical statistical inference, and its graphical characterization affords an effective and essential diagnostic for SEM construction and application. The interplay of linear algebra, combinatorics, and statistical theory thereby underpins both the power and the delicacy of inference in linear SEM frameworks (Drton et al., 2010).