Learning Causally Predictable Outcomes from Psychiatric Longitudinal Data (2506.16629v2)

Published 19 Jun 2025 in cs.LG, q-bio.QM, and stat.ML

Abstract: Causal inference in longitudinal biomedical data remains a central challenge, especially in psychiatry, where symptom heterogeneity and latent confounding frequently undermine classical estimators. Most existing methods for treatment effect estimation presuppose a fixed outcome variable and address confounding through observed covariate adjustment. However, the assumption of unconfoundedness may not hold for a fixed outcome in practice. To address this foundational limitation, we directly optimize the outcome definition to maximize causal identifiability. Our DEBIAS (Durable Effects with Backdoor-Invariant Aggregated Symptoms) algorithm learns non-negative, clinically interpretable weights for outcome aggregation, maximizing durable treatment effects and empirically minimizing both observed and latent confounding by leveraging the time-limited direct effects of prior treatments in psychiatric longitudinal data. The algorithm also furnishes an empirically verifiable test for outcome unconfoundedness. DEBIAS consistently outperforms state-of-the-art methods in recovering causal effects for clinically interpretable composite outcomes across comprehensive experiments in depression and schizophrenia.

Collections

Sign up for free to add this paper to one or more collections.

Sign Up

Summary

The paper introduces the DEBIAS algorithm, which optimizes the aggregation of psychiatric symptom items to enhance causal identifiability in longitudinal studies.
It employs partial correlation maximization, latent confounding penalties, and orthogonality constraints to derive clinically interpretable outcome scores.
Empirical results on depression and schizophrenia datasets demonstrate superior treatment correlation and confounding control compared to state-of-the-art methods.

Causal Outcome Learning in Psychiatric Longitudinal Data: The DEBIAS Algorithm

This work presents a methodologically rigorous and empirically validated framework for outcome learning in psychiatric longitudinal studies, focusing on causally predictable outcomes in the presence of latent confounding. The DEBIAS (Durable Effects with Backdoor-Invariant Aggregated Symptoms) algorithm addresses foundational challenges in longitudinal causal inference by optimizing clinically interpretable aggregations of symptom items to achieve robust, durable treatment effect estimation.

Problem Context and Conceptual Advance

Observational longitudinal data in psychiatry often entail high symptom heterogeneity and pervasive latent confounding, severely undermining the validity of classical treatment effect estimators. Existing methods presume a fixed outcome variable and rely on observed covariate adjustment to address confounding, yet unobserved confounders frequently bias effect estimates. Traditional approaches assume unconfoundedness for the fixed outcome, a condition rarely testable or satisfied in practice — especially for composite measures like total severity scores. Moreover, outcomes are typically not optimized for causal identifiability, but rather for predictivity or group discrimination.

In contrast, this work conceptualizes outcome learning as the empirical search for aggregations of outcome items (via non-negative, interpretable weights) that maximize causal identifiability and minimize both observed and latent confounding. The central innovation is to algorithmically construct these outcome aggregations so that causal inference on the learned outcomes becomes empirically possible and statistically testable.

DEBIAS Algorithm: Formulation and Optimization

DEBIAS operationalizes this outcome-centric paradigm by targeting the scenario where short-term (historical) treatments have only restricted direct effects in time, as is realistic for many psychiatric medications. The algorithm learns weights $\alpha$ for outcome items such that, after adjustment for current treatment and covariates, the aggregated outcome is independent of previous treatments. This blocking of backdoor paths empirically eliminates confounding by exploiting the temporal structure inherent in psychiatric treatment.

The main optimization problem involves maximizing the (partial) correlation between treatment and the learned outcome at all post-treatment timepoints, penalized by partial correlations with historical treatments (to minimize confounding), and further regularized to ensure orthogonality among multiple derived scores. The constraints enforce non-negativity and normalization for clinical and statistical interpretability.

Key Algorithmic Steps

Partial Correlation Maximization: The primary objective is to maximize partial correlations (or equivalently, standardized group separability) between treatment and aggregated outcome at future timepoints.
Latent Confounding Penalty: Partial correlations with historical treatments, adjusting for current treatment and covariates, are penalized to minimize effects from latent confounders.
Orthogonality Constraint: When extracting multiple scores, a Mahalanobis cosine similarity penalty promotes non-redundant severity dimensions.
Projected Gradient Procedure: Projected gradient ascent with Armijo backtracking is used for optimization, with explicit projection onto the feasible non-negative, normalized domain.
Hyperparameter Selection: The regularization parameter is selected via cross-validation, constrained by statistical tests for confounding (i.e., requiring adequate p-values for confounding metrics).

The computational complexity scales linearly with the number of subjects and time points, and quadratically with the number of outcome items, covariates, and scores—permitting application to sizeable psychiatric datasets.

Comparative and Empirical Results

DEBIAS is thoroughly evaluated against state-of-the-art comparators, including IPTW-augmented NNCCA, R-Learner with XGBoost, and Causal Forests. The key distinguishing features of DEBIAS include:

Joint adjustment for both observed and latent confounding via leveraging temporal information.
Extraction of multiple, interpretable aggregated outcomes even under binary treatments.
Borrowing statistical strength across timepoints for more stable outcome construction.

On both the TADS depression and CATIE schizophrenia datasets (with artificially induced confounding for ground-truth assessment), DEBIAS demonstrates:

Highest correlations between treatment and learned outcome scores at clinically meaningful timepoints.
Largest p-values for residual confounding associations, indicating superior confounding control.
Lowest total weight indexed on confounded outcome items, reflecting selectivity and robustness.
Efficient computation (within 10–20 seconds on datasets of several hundred subjects).

In ablation analyses, omitting either the correlation maximization or confounding penalty consistently resulted in inferior performance, confirming their necessity.

Theoretical Implications

The empirical restoration of unconfoundedness for the learned composite outcomes — as opposed to a priori defined total scores — demonstrates that the identifiability of causal effects can depend as much on outcome definition as on predictor selection or adjustment. This insight generalizes the pursuit of causal estimands to an outcome-learning axis, providing provable guarantees (to the extent of conditional independence approximated by partial correlations) for a subset of outcome spaces where causal inference is tractable.

The method departs from the conventional paradigm by algorithmically selecting outcomes (rather than predictors) to be causally estimable, representing a substantive extension of the potential outcomes framework and of empirical causal modeling.

Practical and Future Implications

Practically, DEBIAS enables the derivation of composite psychiatric outcomes that are maximally responsive to intervention, empirically unconfounded, and interpretable. This offers substantial advantages for longitudinal cohort studies, real-world evidence generation, and clinical trial analyses—especially where randomized assignment is infeasible and covariate data is incomplete. Its non-negativity and sparsity constraints ensure derived outcome scores are compatible with clinical reasoning.

Potential practical extensions include:

Application to other medical specialties, e.g., multi-domain composite endpoints in cardiovascular or metabolic diseases.
Generalization to nonlinear outcome transformations and kernelized versions for settings with complex dependencies.
Incorporation into multitask or federated learning frameworks for multi-site or privacy-preserving causal effect estimation.

Future research should investigate non-linear extensions, evaluate impact in high-dimensional phenotypic settings (e.g., multi-omics), and more deeply connect algorithmic outcome learning with graphical causal structure discovery.

Conclusion

This paper establishes outcome learning as an essential axis for robust causal inference in longitudinal observational studies, formally articulating and empirically validating the construction of causally predictable composite outcomes via the DEBIAS algorithm. The approach provides both theoretical rigor and measurable applied advantages, and is well-positioned for broader adoption across both methodological research and applied clinical data science in psychiatry and beyond.

PDF Markdown

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Related Papers

Authors (1)

Eric V. Strobl

Tweets

https://twitter.com/XTXI/status/1937127284344758500