Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
120 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Functional principal component analysis as an alternative to mixed-effect models for describing sparse repeated measures in presence of missing data (2402.10624v2)

Published 16 Feb 2024 in stat.ME

Abstract: Analyzing longitudinal data in health studies is challenging due to sparse and error-prone measurements, strong within-individual correlation, missing data and various trajectory shapes. While mixed-effect models (MM) effectively address these challenges, they remain parametric models and may incur computational costs. In contrast, Functional Principal Component Analysis (FPCA) is a non-parametric approach developed for regular and dense functional data that flexibly describes temporal trajectories at a potentially lower computational cost. This paper presents an empirical simulation study evaluating the behaviour of FPCA with sparse and error-prone repeated measures and its robustness under different missing data schemes in comparison with MM. The results show that FPCA is well-suited in the presence of missing at random data caused by dropout, except in scenarios involving most frequent and systematic dropout. Like MM, FPCA fails under missing not at random mechanism. The FPCA was applied to describe the trajectories of four cognitive functions before clinical dementia and contrast them with those of matched controls in a case-control study nested in a population-based aging cohort. The average cognitive declines of future dementia cases showed a sudden divergence from those of their matched controls with a sharp acceleration 5 to 2.5 years prior to diagnosis.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (43)
  1. Les études épidémiologiques sur le vieillissement en France : de l’étude Paquid à l’étude des Trois Cités. Comptes Rendus Biologies, 325(6):665–672, June 2002.
  2. Compensatory mechanisms in higher-educated subjects with Alzheimer’s disease: a study of 20 years of cognitive decline. Brain: A Journal of Neurology, 137(Pt 4):1167–1175, April 2014.
  3. Arthur L. Benton. The revised visual retention test: clinical and experimental applications. The Psychological Corporation, New York, 3rd ed edition, 1963. OCLC: 456503710.
  4. Principal components analysis of sampled functions. Psychometrika, 51(2):285–311, June 1986.
  5. Longitudinal models for AIDS marker data. Statistical Methods in Medical Research, 7(1):13–27, March 1998.
  6. Nonlinear models for repeated measurement data: An overview and update. Journal of Agricultural, Biological, and Environmental Statistics, 8(4):387–419, December 2003.
  7. P. Diggle and M. G. Kenward. Informative Drop-Out in Longitudinal Data Analysis. Journal of the Royal Statistical Society. Series C (Applied Statistics), 43(1):49–93, 1994. Publisher: [Wiley, Royal Statistical Society].
  8. A random change point model for assessing variability in repeated measures of cognitive function. Statistics in Medicine, 27(27):5786–5798, 2008. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.3380.
  9. ”Mini-mental state”. A practical method for grading the cognitive state of patients for the clinician. Journal of Psychiatric Research, 12(3):189–198, November 1975.
  10. Multivariate functional principal component analysis for data observed on different (dimensional) domains. Journal of the American Statistical Association, 113:649–659, 2018.
  11. Handling drop-out in longitudinal studies. Statistics in Medicine, 23(9):1455–1497, 2004. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.1728.
  12. Prostate cancer screening with prostate-specific antigen (PSA) test: a systematic review and meta-analysis. BMJ, 362:k3519, September 2018. Publisher: British Medical Journal Publishing Group Section: Research.
  13. B. Isaacs and A. T. Kennie. The Set test as an aid to the detection of dementia in old people. The British Journal of Psychiatry: The Journal of Mental Science, 123(575):467–470, October 1973.
  14. Random Changepoint Model for Joint Modeling of Cognitive Decline and Dementia. Biometrics, 62(1):254–260, 2006. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/j.1541-0420.2005.00443.x.
  15. Kari Karhunen. Zur Spektraltheorie stochastischer Prozesse. 1946. Google-Books-ID: X3F3SwAACAAJ.
  16. Longitudinal Data with Follow-up Truncated by Death: Match the Analysis Method to Research Aims. Statistical Science: A Review Journal of the Institute of Mathematical Statistics, 24(2):211, 2009.
  17. Random-Effects Models for Longitudinal Data. Biometrics, 38(4):963, December 1982.
  18. Roderick J. A. Little and Donald B. Rubin. Statistical Analysis With Missing Data. Wiley, May 1987. Google-Books-ID: w40QAQAAIAAJ.
  19. Summarizing the extent of visit irregularity in longitudinal data. BMC Medical Research Methodology, 20(1):135, May 2020.
  20. M. Loève. Fonctions aléatoires à décomposition orthogonale exponentielle. La Revue Scientifique, 84:159–162, 1946.
  21. Fernando Miguez. nlraa: Nonlinear Regression for Agricultural Applications, 2022. R package version 1.5.
  22. Linear Mixed Models for Longitudinal Data. Springer Series in Statistics. Springer, New York, NY, 2000.
  23. Using simulation studies to evaluate statistical methods. Statistics in Medicine, 38(11):2074–2102, 2019. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.8086.
  24. Gauss-Hermite Quadrature Approximation for Estimation in Generalised Linear Mixed Models. Computational Statistics, 18(1):57–78, March 2003.
  25. lcmm: Extended Mixed Models Using Latent Classes and Latent Processes, 2023. R package version: 2.0.2.
  26. Sensitivity of four psychometric tests to measure cognitive changes in brain aging-population-based studies. American Journal of Epidemiology, 165(3):344–350, February 2007.
  27. Misuse of the linear mixed model when evaluating risk factors of cognitive decline. American Journal of Epidemiology, 174(9):1077–1088, November 2011.
  28. Functional Data Analysis. Springer Series in Statistics. Springer, New York, NY, 2005.
  29. James Ramsay. fda: Functional Data Analysis, 2023. R package version 6.1.4.
  30. Ralph M. Reitan. Validity of the Trail Making Test as an Indicator of Organic Brain Damage. Perceptual and Motor Skills, 8(3):271–276, December 1958. Publisher: SAGE Publications Inc.
  31. Estimating the Mean and Covariance Structure Nonparametrically When the Data are Curves. Journal of the Royal Statistical Society. Series B (Methodological), 53(1):233–243, 1991. Publisher: [Royal Statistical Society, Wiley].
  32. Dimitris Rizopoulos. JM: An R package for the joint modelling of longitudinal and time-to-event data. Journal of Statistical Software, 35(9):1–33, 2010.
  33. Dimitris Rizopoulos. Joint Models for Longitudinal and Time-to-Event Data: With Applications in R. CRC Press, June 2012. Google-Books-ID: xotIpb2duaMC.
  34. Interpretation of mixed models and marginal models with cohort attrition due to death and drop-out. Statistical Methods in Medical Research, 28(2):343–356, February 2019. Publisher: SAGE Publications Ltd STM.
  35. Donald B. Rubin. Inference and Missing Data. Biometrika, 63(3):581–592, 1976. Publisher: [Oxford University Press, Biometrika Trust].
  36. A hypothesis testing procedure for random changepoint mixed models. Statistics in Medicine, 38(20):3791–3803, 2019. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1002/sim.8195.
  37. Han Lin Shang. A survey of functional principal component analysis. AStA Advances in Statistical Analysis, 98(2):121–142, April 2014.
  38. Longitudinal and time-to-drop-out joint models can lead to seriously biased estimates when the drop-out mechanism is at random. Biometrics, 75(1):58–68, 2019. _eprint: https://onlinelibrary.wiley.com/doi/pdf/10.1111/biom.12986.
  39. Geert Verbeke. Linear Mixed Models for Longitudinal Data. In Geert Verbeke and Geert Molenberghs, editors, Linear Mixed Models in Practice: A SAS-Oriented Approach, Lecture Notes in Statistics, pages 63–153. Springer, New York, NY, 1997.
  40. Review of Functional Data Analysis, July 2015. arXiv:1507.05135 [stat].
  41. face: Fast Covariance Estimation for Sparse Functional Data, 2022. R package version 0.1-7.
  42. Functional Data Analysis for Sparse Longitudinal Data. Journal of the American Statistical Association, 100(470):577–590, 2005. Publisher: [American Statistical Association, Taylor & Francis, Ltd.].
  43. fdapace: Functional Data Analysis and Empirical Dynamics, 2022. R package version 0.5.9.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com