Heterogeneous Image-based Classification Using Distributional Data Analysis (2403.07126v1)
Abstract: Diagnostic imaging has gained prominence as potential biomarkers for early detection and diagnosis in a diverse array of disorders including cancer. However, existing methods routinely face challenges arising from various factors such as image heterogeneity. We develop a novel imaging-based distributional data analysis (DDA) approach that incorporates the probability (quantile) distribution of the pixel-level features as covariates. The proposed approach uses a smoothed quantile distribution (via a suitable basis representation) as functional predictors in a scalar-on-functional quantile regression model. Some distinctive features of the proposed approach include the ability to: (i) account for heterogeneity within the image; (ii) incorporate granular information spanning the entire distribution; and (iii) tackle variability in image sizes for unregistered images in cancer applications. Our primary goal is risk prediction in Hepatocellular carcinoma that is achieved via predicting the change in tumor grades at post-diagnostic visits using pre-diagnostic enhancement pattern mapping (EPM) images of the liver. Along the way, the proposed DDA approach is also used for case versus control diagnosis and risk stratification objectives. Our analysis reveals that when coupled with global structural radiomics features derived from the corresponding T1-MRI scans, the proposed smoothed quantile distributions derived from EPM images showed considerable improvements in sensitivity and comparable specificity in contrast to classification based on routinely used summary measures that do not account for image heterogeneity. Given that there are limited predictive modeling approaches based on heterogeneous images in cancer, the proposed method is expected to provide considerable advantages in image-based early detection and risk prediction.
- “Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach” In Nat Comm 5 Nature Publishing Group UK London, 2014, pp. 4006
- “Modelling a response as a function of high-frequency count data: the association between physical activity and fat mass” In Stat Meth in Med Res 26.5 Sage Publications Sage UK: London, England, 2017, pp. 2210–2226
- Madhuri Avula, Narasimha Prasad Lakkakula and Murali Prasad Raja “Bone Cancer Detection from MRI Scan Imagery Using Mean Pixel Intensity” In 2014 8th Asia Modelling Symposium, 2014, pp. 141–146 DOI: 10.1109/AMS.2014.36
- “Liver Imaging Reporting and Data System (LI-RADS) version 2018: imaging of hepatocellular carcinoma in at-risk patients” In Radiology 289.3 Radiological Society of North America, 2018, pp. 816–830
- Ibiayi Dagogo-Jack and Alice T Shaw “Tumour heterogeneity and resistance to cancer therapies” In Nat Rev Clin Onc 15.2 Nature Publishing Group UK London, 2018, pp. 81–94
- “Wavelet shrinkage: asymptopia?” In J of the Royal Stat Soc: Series B 57.2 Wiley Online Library, 1995, pp. 301–337
- “Compositional data analysis in time-use epidemiology: what, why, how” In Int J of Env Res and Pub Health 17.7 MDPI, 2020, pp. 2220
- “Bayesian scalar on image regression with nonignorable nonresponse” In J Am Stat Assoc 115.532 Taylor & Francis, 2020, pp. 1574–1597
- “Characterizing evoked hemodynamics with fMRI” In Neuroimage 2.2 Elsevier, 1995, pp. 157–165
- Laya Ghodrati and Victor M Panaretos “Distribution-on-distribution regression via optimal transport maps” In Biometrika 109.4 Oxford University Press, 2022, pp. 957–974
- “Distributional data analysis via quantile functions and its application to modeling digital biomarkers of gait in Alzheimer’s disease” In Biostatistics 24.3 Oxford University Press, 2023, pp. 539–561
- Rajarshi Guhaniyogi, Shaan Qamar and David B Dunson “Bayesian tensor regression” In J Mach Learn Res 18.1 JMLR. org, 2017, pp. 2733–2763
- Jonathan RM Hosking “L-moments: analysis and estimation of distributions using linear combinations of order statistics” In J of the Royal Stat Soc, Series B 52.1 Oxford University Press, 1990, pp. 105–124
- The MathWorks Inc. “MATLAB version: 9.13.0 (R2022b)” Natick, Massachusetts, United States: The MathWorks Inc., 2022 URL: https://www.mathworks.com
- “Bayesian longitudinal tensor response regression for modeling neuroplasticity” In Human Brain Mapping Wiley Online Library, 2023
- “Radiomics: the bridge between medical imaging and personalized medicine” In Nat Rev Clin Onc 14.12 Nature Publishing Group UK London, 2017, pp. 749–762
- “Radiomics: extracting more information from medical images using advanced feature analysis” In Euro. J. of Cancer 48.4 Elsevier, 2012, pp. 441–446
- “Quantitative radiomics studies for tissue characterization: a review of technology and methodological procedures” In The British J of Rad 90.1070 The British Institute of Radiology., 2017, pp. 20160665
- “Spatial Bayesian variable selection models on functional magnetic resonance imaging time-series data” In Bayesian Analysis 9.3 NIH Public Access, 2014, pp. 699
- Xin Ma, Suprateek Kundu and Alzheimer’s Disease Neuroimaging Initiative “Multi-task learning with high-dimensional noisy images” In J Am Stat Assoc Taylor & Francis, 2022, pp. 1–14
- Nadim Mahmud, Maarouf A Hoteit and David S Goldberg “Risk Factors and Center-Level Variation in Hepatocellular Carcinoma Under-Staging for Liver Transplantation” In Liver Trans 26.8 Wiley Online Library, 2020, pp. 977–988
- “Distributional data analysis with accelerometer data in a nhanes database with nonparametric survey regression models.” In arXiv:2104.01165, 2021
- Jeffrey S Morris “Functional regression” In Ann. Rev Stat and Its Appl 2 Annual Reviews, 2015, pp. 321–359
- “Characterization of the Imaging Signature of Hepatocellular Carcinoma with Enhancement Pattern Mapping” In HEPATOLOGY 76, 2022, pp. S1356–S1356 WILEY 111 RIVER ST, NJ USA
- “A Safe Feature Elimination Rule for L_𝐿_L\_italic_L _{1111} L 1-Regularized Logistic Regression” In IEEE Trans on Pat Anal and Mach Intel 44.9 IEEE, 2021, pp. 4544–4554
- “Enhancement pattern mapping technique for improving contrast-to-noise ratios and detectability of hepatobiliary tumors on multiphase computed tomography” In Med Phys 47.1 Wiley Online Library, 2020, pp. 64–74
- James O Ramsay and CJ1125714 Dalzell “Some tools for functional data analysis” In J. of the Royal Stat Soc Series B 53.3 Oxford University Press, 1991, pp. 539–561
- “Wavelet-domain regression and predictive inference in psychiatric neuroimaging” In The Ann of Appl Stat 9.2 NIH Public Access, 2015, pp. 1076
- “Spatial Bayesian variable selection with application to functional magnetic resonance imaging” In J. Am. Stat. Assoc. 102.478 Taylor & Francis, 2007, pp. 417–431
- Renáta Talská, Karel Hron and Tomáš Matys Grygar “Compositional scalar-on-function regression with application to sediment particle size distributions” In Math Geosciences 53.7 Springer, 2021, pp. 1667–1695
- Robert Tibshirani “Regression shrinkage and selection via the lasso” In J of the Roy Stat Soc: Series B 58.1 Wiley Online Library, 1996, pp. 267–288
- “Surveillance imaging and alpha fetoprotein for early detection of hepatocellular carcinoma in patients with cirrhosis: a meta-analysis” In Gastroenterology 154.6 Elsevier, 2018, pp. 1706–1718
- “Computational radiomics system to decode the radiographic phenotype” In Can Res 77.21 AACR, 2017, pp. e104–e107
- Xiao Wang, Hongtu Zhu and Alzheimer’s Disease Neuroimaging Initiative “Generalized scalar-on-image regression models via total variation” In J Am Stat Assoc 112.519 Taylor & Francis, 2017, pp. 1156–1168
- “Quantile function on scalar regression analysis for distributional data” In J Am Stat Assoc 115.529 Taylor & Francis, 2020, pp. 90–106
- “Advances in the Early Detection of Hepatobiliary Cancers” In Cancers 15.15 MDPI, 2023, pp. 3880
- “Functional principal component model for high-dimensional brain imaging” In NeuroImage 58 Elsevier, 2011, pp. 772–784