Scalable Scalar-on-Image Cortical Surface Regression with a Relaxed-Thresholded Gaussian Process Prior (2403.13628v1)
Abstract: In addressing the challenge of analysing the large-scale Adolescent Brain Cognition Development (ABCD) fMRI dataset, involving over 5,000 subjects and extensive neuroimaging data, we propose a scalable Bayesian scalar-on-image regression model for computational feasibility and efficiency. Our model employs a relaxed-thresholded Gaussian process (RTGP), integrating piecewise-smooth, sparse, and continuous functions capable of both hard- and soft-thresholding. This approach introduces additional flexibility in feature selection in scalar-on-image regression and leads to scalable posterior computation by adopting a variational approximation and utilising the Karhunen-Lo`eve expansion for Gaussian processes. This advancement substantially reduces the computational costs in vertex-wise analysis of cortical surface data in large-scale Bayesian spatial models. The model's parameter estimation and prediction accuracy and feature selection performance are validated through extensive simulation studies and an application to the ABCD study. Here, we perform regression analysis correlating intelligence scores with task-based functional MRI data, taking into account confounding factors including age, sex, and parental education level. This validation highlights our model's capability to handle large-scale neuroimaging data while maintaining computational feasibility and accuracy.
- Optimal predictive model selection. Annals of Statistics 32, 870–897.
- Berlucchi, G. (1983). Two hemispheres but one brain. Behavioral and Brain Sciences 6, 171–172.
- Bishop, C. M. (2006). Pattern Recognition and Machine Learning. Springer New York.
- Variational inference: A review for statisticians. JASA 112, 859–877.
- The Adolescent Brain Cognitive Development (ABCD) study: imaging acquisition across 21 sites. Developmental Cognitive Neuroscience 32, 43–54.
- Cressie, N. A. (1993). Statistics for Spatial Data.
- Hierarchical nearest-neighbor gaussian process models for large geostatistical datasets. JASA 111, 800–812.
- Cortical surface-based analysis: Ii: inflation, flattening, and a surface-based coordinate system. NeuroImage 9, 195–207.
- Non-stationary spatial modeling. In Bayesian Statistics 6 – Proceedings of the Sixth Valencia Meeting, pages 761–768.
- Introduction to variational methods for graphical models. Machine Learning 37, 183–233.
- Geodesic squared exponential kernel for non-rigid shape registration. pages 1–8. IEEE.
- Scalar-on-image regression via the soft-thresholded Gaussian process. Biometrika 105, 165–184.
- Cortical thickness correlates of specific cognitive performance accounted for by the general factor of intelligence in healthy children aged 6 to 18. NeuroImage 55, 1443–1453.
- A general framework for Vecchia approximations of Gaussian Processes. Statistical Science 36, 124–141.
- Li, M. (2023). Statistical inference on large-scale and complex data via Gaussian Process. PhD thesis.
- Adolescent neurocognitive development and impacts of substance use: Overview of the Adolescent Brain Cognitive Development (ABCD) baseline neurocognition battery. Developmental Cognitive Neuroscience 32, 67–79.
- Reports of the death of brain-behavior associations have been greatly exaggerated. bioRxiv page 2023.06.16.545340.
- Reproducible brain-wide association studies require thousands of individuals. Nature 2022 603:7902 603, 654–660.
- Matérn, B. (2013). Spatial Variation: Stochastic models and their application to some problems in forest surveys and other sampling investigations. PhD thesis.
- A bayesian general linear modeling approach to cortical surface fmri data analysis. JASA 115, 501–520.
- Bayesian lesion estimation with a structured spike-and-slab prior. JASA 0, 1–23.
- White matter integrity and cognitive performance in school-age children: A population-based neuroimaging study. NeuroImage 119, 119–128.
- Nelsen, R. B. (1999). An Introduction to Copulas. An Introduction to Copulas .
- ABCD Neurocognitive Prediction Challenge 2019: Predicting individual residual fluid intelligence scores from cortical grey matter morphology. Lecture Notes in Computer Science pages 114–123.
- Spatial shrinkage via the product independent Gaussian process prior. JCGS 30, 1068–1080.
- Thresholded Multiscale Gaussian Processes with Application to Bayesian Feature Selection for Massive Neuroimaging Data. arXiv .
- Stein, M. L. (1999). Interpolation of spatial data : some theory for kriging. Springer.
- Stein, M. L. (2014). Limitations on low rank approximations for covariance matrices of spatial data. Spatial Statistics 8, 1–19.
- Vecchia, A. V. (1988). Estimation and model identification for continuous spatial processes. J. R. Stat., Series B 50, 297–312.
- Bayesian inference for group-level cortical surface image-on-scalar-regression with Gaussian process priors. arXiv .