Shrinkage for Extreme Partial Least-Squares (2403.09503v2)
Abstract: This work focuses on dimension-reduction techniques for modelling conditional extreme values. Specifically, we investigate the idea that extreme values of a response variable can be explained by nonlinear functions derived from linear projections of an input random vector. In this context, the estimation of projection directions is examined, as approached by the Extreme Partial Least Squares (EPLS) method--an adaptation of the original Partial Least Squares (PLS) method tailored to the extreme-value framework. Further, a novel interpretation of EPLS directions as maximum likelihood estimators is introduced, utilizing the von Mises-Fisher distribution applied to hyperballs. The dimension reduction process is enhanced through the Bayesian paradigm, enabling the incorporation of prior information into the projection direction estimation. The maximum a posteriori estimator is derived in two specific cases, elucidating it as a regularization or shrinkage of the EPLS estimator. We also establish its asymptotic behavior as the sample size approaches infinity. A simulation data study is conducted in order to assess the practical utility of our proposed method. This clearly demonstrates its effectiveness even in moderate data problems within high-dimensional settings. Furthermore, we provide an illustrative example of the method's applicability using French farm income data, highlighting its efficacy in real-world scenarios.
- Handbook of Mathematical Functions: With Formulas, Graphs, and Mathematical Tables. Dover Publications.
- Tail inverse regression: Dimension reduction for prediction of extremes. Bernoulli, 30(1):503–533.
- Statistics of Extremes: Theory and Applications. Wiley, Chichester, England, UK.
- Gaussian Regularized Sliced Inverse Regression. Statistics and Computing, 19(1):85–98.
- Regular Variation. Encyclopedia of Mathematics and its Applications. Cambridge University Press.
- Extreme Partial Least-Squares. Journal of Multivariate Analysis, 194:105101.
- Bayesian inverse regression for supervised dimension reduction with small datasets. Journal of Statistical Computation and Simulation, 91(14):2817–2832.
- Student Sliced Inverse Regression. Computational Statistics & Data Analysis, 113:441–456.
- Sparse Partial Least Squares Regression for Simultaneous Dimension Reduction and Variable Selection. Journal of the Royal Statistical Society: Series B, 72(1):3–25.
- Cook, R. D. (2007). Fisher Lecture: Dimension Reduction in Regression. Statistical Science, 22(1):1–26.
- Envelopes and Partial Least Squares Regression. Journal of the Royal Statistical Society: Series B, 75(5):851–877.
- A new sliced inverse regression method for multivariate response. Computational Statistics & Data Analysis, 77:285–299.
- On kernel smoothing for extremal quantile regression. Bernoulli, 19(5B):2557–2589.
- Inference for extremal regression with dependent heavy-tailed data. The Annals of Statistics, 51(5):2040–2066.
- Gardes, L. (2018). Tail dimension reduction for extreme quantile estimation. Extremes, 21(1):57–95.
- Geenens, G. (2011). Curse of dimensionality and related issues in nonparametric functional regression. Statistics Surveys, 5:30–43.
- Advanced topics in Sliced Inverse Regression. Journal of Multivariate Analysis, 188:104852.
- Functional Extreme Partial Least-Squares. hal-04488561.
- Extreme conditional expectile estimation in heavy-tailed heteroscedastic regression models. The Annals of Statistics, 49(6):3358–3382.
- Extreme Value Theory. Springer, New York, NY, USA.
- Harold, J. (1946). An invariant form for the prior probability in estimation problems. Proceedings of the Royal Society of London: Series A, 186(1007):453–461.
- Hill, B. M. (1975). A Simple General Approach to Inference About the Tail of a Distribution. The Annals of Statistics, 3(5):1163–1174.
- Factor copula models for multivariate data. Journal of Multivariate Analysis, 120:85–101.
- Li, K.-C. (1991). Sliced Inverse Regression for Dimension Reduction. Journal of the American Statistical Association, 86(414):316–327.
- Partial inverse regression. Biometrika, 94(3):615–625.
- SEPaLS: Shrinkage for Extreme Partial Least-Squares in R. R package.
- Supervised Dimension Reduction Using Bayesian Mixture Modeling. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, pages 501–508.
- Mardia, K. V. (1975). Distribution Theory for the Von Mises-Fisher Distribution and Its Application. In A Modern Course on Statistical Distributions in Scientific Work, pages 113–130. Springer Netherlands, Dordrecht.
- Directional Statistics. John Wiley & Sons, Chichester, England, UK.
- Multivariate Calibration. Wiley, Hoboken, NJ, USA.
- Partial Least Squares Estimator for Single-Index Models. Journal of the Royal Statistical Society: Series B, 62(4):763–771.
- Nelsen, R. B. (2007). An Introduction to Copulas. Springer, New York, NY, USA.
- A Bayesian analysis of directional data using the von Mises-Fisher distribution. Communications in Statistics-Simulation and Computation, 34(4):989–999.
- Portier, F. (2016). An Empirical Process View of Inverse Regression. Scandinavian Journal of Statistics, 43(3):827–844.
- Sufficient Dimension Reduction via Bayesian Mixture Modeling. Biometrics, 67(3):886–895.
- Bayesian estimation of the von-Mises Fisher mixture model with variational inference. IEEE Transactions on Pattern Analysis and Machine Intelligence, 36(9):1701–1715.
- Tibshirani, R. (1996). Regression Shrinkage and Selection via the Lasso. Journal of the Royal Statistical Society: Series B, 58(1):267–288.
- Shrinkage priors for Bayesian penalized regression. Journal of Mathematical Psychology, 89:31–50.
- Bayesian Sparse Partial Least Squares. Neural Computation, 25(12):3318–3339.
- On the Construction of Significance Tests on the Circle and the Sphere. Biometrika, 43(3):344–352.
- Wold, H. (1975). Soft Modelling by Latent Variables: The Non-Linear Iterative Partial Least Squares (NIPALS) Approach. Journal of Applied Probability, 12:117–142.
- Extreme Quantile Estimation Based on the Tail Single-index Model. Statistica Sinica, 32(2):893–914.