Modeling Interconnected Modules in Multivariate Outcomes: Evaluating the Impact of Alcohol Intake on Plasma Metabolomics (2404.10884v1)
Abstract: Alcohol consumption has been shown to influence cardiovascular mechanisms in humans, leading to observable alterations in the plasma metabolomic profile. Regression models are commonly employed to investigate these effects, treating metabolomics features as the outcomes and alcohol intake as the exposure. Given the latent dependence structure among the numerous metabolomic features (e.g., co-expression networks with interconnected modules), modeling this structure is crucial for accurately identifying metabolomic features associated with alcohol intake. However, integrating dependence structures into regression models remains difficult in both estimation and inference procedures due to their large or high dimensionality. To bridge this gap, we propose an innovative multivariate regression model that accounts for correlations among outcome features by incorporating an interconnected community structure. Furthermore, we derive closed-form and likelihood-based estimators, accompanied by explicit exact and explicit asymptotic covariance matrix estimators, respectively. Simulation analysis demonstrates that our approach provides accurate estimation of both dependence and regression coefficients, and enhances sensitivity while maintaining a well-controlled discovery rate, as evidenced through benchmarking against existing regression models. Finally, we apply our approach to assess the impact of alcohol intake on $249$ metabolomic biomarkers measured using nuclear magnetic resonance spectroscopy. The results indicate that alcohol intake can elevate high-density lipoprotein levels by enhancing the transport rate of Apolipoproteins A1.
- Matrix Riccati equations in control and systems theory. Birkhäuser.
- Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological) 57, 289–300.
- Advanced econometric methods. Springer, New York, NY.
- On seemingly unrelated regressions with uniform correlation error. Communications in Statistics - Theory and Methods 51, 5714–5727.
- Conditions for the numerical equality of the ols, gls and amemiya–cragg estimators. Economics Letters 116, 538–540.
- Magnus, J. R. (1978). Maximum likelihood estimation of the gls model with unknown parameters in the disturbance covariance matrix. Journal of Econometrics 7, 281–312.
- The equality of the ordinary least squares estimator and the best linear unbiased estimator. The American Statistician 43, 153–161.
- The algebraic matrix riccati equation. In Topics in Operator Theory Systems and Networks, pages 351–381. Springer.
- Schmidt, P. (2020). Econometrics. CRC Press.
- Covariance matrix estimation for high-throughput biomedical data with interconnected communities. The American Statistician .