Adaptive functional principal components analysis (2306.16091v4)
Abstract: Functional data analysis almost always involves smoothing discrete observations into curves, because they are never observed in continuous time and rarely without error. Although smoothing parameters affect the subsequent inference, data-driven methods for selecting these parameters are not well-developed, frustrated by the difficulty of using all the information shared by curves while being computationally efficient. On the one hand, smoothing individual curves in an isolated, albeit sophisticated way, ignores useful signals present in other curves. On the other hand, bandwidth selection by automatic procedures such as cross-validation after pooling all the curves together quickly become computationally unfeasible due to the large number of data points. In this paper we propose a new data-driven, adaptive kernel smoothing, specifically tailored for functional principal components analysis through the derivation of sharp, explicit risk bounds for the eigen-elements. The minimization of these quadratic risk bounds provide refined, yet computationally efficient bandwidth rules for each eigen-element separately. Both common and independent design cases are allowed. Rates of convergence for the estimators are derived. An extensive simulation study, designed in a versatile manner to closely mimic the characteristics of real data sets supports our methodological contribution. An illustration on a real data application is provided.
- Balança, P. (2015). Some sample path properties of multifractional Brownian motion. Stochastic Processes Appl., 125(10):3823–3850.
- Some new asymptotic theory for least squares series: pointwise and uniform results. J. Econometrics, 186(2):345–366.
- Common functional principal components. Ann. Statist., 37(1):1–34.
- Bosq, D. (2000). Linear processes in function spaces, volume 149 of Lecture Notes in Statistics. Springer-Verlag, New York. Theory and applications.
- Unexpected properties of bandwidth choice when smoothing discrete data for constructing a functional data classifier. Ann. Statist., 41(6):2739–2767.
- Gaïffas, S. (2007). On pointwise adaptive curve estimation based on inhomogeneous data. ESAIM: Probability and Statistics, 11:344–364.
- Learning the smoothness of noisy curves with application to online curve estimation. Electronic Journal of Statistics, 16(1):1485–1560.
- Adaptive optimal estimation of irregular mean and covariance functions. arxiv:2108.06507v2.
- On properties of functional principal components analysis. Journal of the Royal Statistical Society. Series B: Statistical Methodology, 68(1):109–126.
- Theory for high-order bounds in functional principal components analysis. Math. Proc. Cambridge Philos. Soc., 146(1):225–256.
- Properties of principal component methods for functional and longitudinal data analysis. Annals of Statistics, 34(3):1493–1517.
- Inference for functional data with applications. Springer Series in Statistics. Springer, New York.
- Local intrinsic stationarity and its inference. Ann. Statist., 44(5):2058 – 2088.
- Relative perturbation bounds with applications to empirical covariance operators. Advances in Mathematics, 412:108808.
- Uniform convergence rates for nonparametric regression and principal component analysis in functional/longitudinal data. Ann. Statist., 38(6):3321–3351.
- Functional data analysis with rough sample paths? Journal of Nonparametric Statistics, 36(1):4–22.
- Nonparametric estimation for SDE with sparsely sampled paths: An FDA perspective. Stochastic Processes and their Applications, 167:104239.
- Estimation of Heteroscedasticity in Regression Analysis. The Annals of Statistics, 15(2):610 – 625.
- Superconsistent Estimation of Points of Impact in Non-Parametric Regression with Functional Predictors. Journal of the Royal Statistical Society Series B: Statistical Methodology, 82(4):1115–1140.
- Functional data analysis. Springer Series in Statistics. Springer, New York, second edition.
- Sparsely observed functional time series: estimation and prediction. Electronic Journal of Statistics, 14(1):1137 – 1210.
- Tsybakov, A. B. (2009). Introduction to nonparametric estimation. Springer Series in Statistics. Springer, New York.
- From sparse to dense functional data and beyond. Annals of Statistics, 44(5):2281–2321.
- fdapace: Functional Data Analysis and Empirical Dynamics. R package version 0.5.9.