Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
156 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Empirical Bayes Covariance Decomposition, and a solution to the Multiple Tuning Problem in Sparse PCA (2312.03274v1)

Published 6 Dec 2023 in stat.ME and stat.ML

Abstract: Sparse Principal Components Analysis (PCA) has been proposed as a way to improve both interpretability and reliability of PCA. However, use of sparse PCA in practice is hindered by the difficulty of tuning the multiple hyperparameters that control the sparsity of different PCs (the "multiple tuning problem", MTP). Here we present a solution to the MTP using Empirical Bayes methods. We first introduce a general formulation for penalized PCA of a data matrix $\mathbf{X}$, which includes some existing sparse PCA methods as special cases. We show that this formulation also leads to a penalized decomposition of the covariance (or Gram) matrix, $\mathbf{X}T\mathbf{X}$. We introduce empirical Bayes versions of these penalized problems, in which the penalties are determined by prior distributions that are estimated from the data by maximum likelihood rather than cross-validation. The resulting "Empirical Bayes Covariance Decomposition" provides a principled and efficient solution to the MTP in sparse PCA, and one that can be immediately extended to incorporate other structural assumptions (e.g. non-negative PCA). We illustrate the effectiveness of this approach on both simulated and real data examples.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. Adachi, K. and N. T. Trendafilov (2016). Sparse principal component analysis subject to prespecified cardinality of loadings. Computational Statistics 31(4), 1403–1427.
  2. Bauer, F. L. (1957). Das verfahren der treppeniteration und verwandte verfahren zur lösung algebraischer eigenwertprobleme. Zeitschrift für angewandte Mathematik und Physik ZAMP 8(3), 214–235.
  3. On the Bures–Wasserstein distance between positive definite matrices. Expositiones Mathematicae 37(2), 165–191.
  4. Spectral Methods for Data Science: A Statistical Perspective. Foundations and Trends® in Machine Learning 14(5), 566–806.
  5. A Direct Formulation for Sparse PCA Using Semidefinite Programming. In Advances in Neural Information Processing Systems, Volume 17. MIT Press.
  6. Convex and Semi-Nonnegative Matrix Factorizations. IEEE Transactions on Pattern Analysis and Machine Intelligence 32(1), 45–55.
  7. Economist (2022, December). America’s best firms…and the rest: New winners and losers are emerging after three turbulent years. The Economist 445(9324).
  8. Fama, E. F. and K. R. French (1993). Common risk factors in the returns on stocks and bonds. Journal of financial economics 33(1), 3–56.
  9. Golub, G. H. and C. F. Van Loan (2013). Matrix Computations (Fourth edition ed.). Johns Hopkins Studies in the Mathematical Sciences. Baltimore: The Johns Hopkins University Press.
  10. A Guide for Sparse PCA: Model Comparison and Applications. Psychometrika.
  11. Jolliffe, I. T. (2002). Principal Component Analysis (2nd ed ed.). Springer Series in Statistics. New York: Springer.
  12. Generalized power method for sparse principal component analysis. Journal of Machine Learning Research 11(15), 517–553.
  13. A flexible empirical Bayes approach to multiple linear regression and connections with penalized regression. arXiv:2208.10910.
  14. Robust binary component decompositions. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.  1–5. IEEE.
  15. Kueng, R. and J. A. Tropp (2021). Binary component decomposition part i: the positive-semidefinite case. SIAM Journal on Mathematics of Data Science 3(2), 544–572.
  16. Topic modeling on triage notes with semiorthogonal nonnegative matrix factorization. Journal of the American Statistical Association 116(536), 1609–1624.
  17. Dissecting tumor transcriptional heterogeneity from single-cell rna-seq data by generalized binary covariance decomposition. bioRxiv, 2023–08.
  18. Sparse principal component analysis and iterative thresholding. The Annals of Statistics 41(2).
  19. Mackey, L. (2008). Deflation Methods for Sparse PCA. In Advances in Neural Information Processing Systems, Volume 21. Curran Associates, Inc.
  20. Proximal algorithms. Foundations and trends® in Optimization 1(3), 127–239.
  21. LIII. On lines and planes of closest fit to systems of points in space. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science 2(11), 559–572.
  22. Principal components analysis for functional data. Functional data analysis, 147–172.
  23. Vintage factor analysis with Varimax performs statistical inference. Journal of the Royal Statistical Society Series B: Statistical Methodology 85(4), 1037–1060.
  24. Sparse principal component analysis via regularized low rank matrix approximation. Journal of Multivariate Analysis 99(6), 1015–1034.
  25. Additive clustering: Representation of similarities as combinations of discrete overlapping properties. Psychological Review 86(2), 87–123.
  26. Overlapping community detection via semi-binary matrix factorization: Identifiability and algorithms. IEEE Transactions on Signal Processing 70, 4321–4336.
  27. A flexible framework for sparse simultaneous component based data integration. BMC bioinformatics 12(1), 1–17.
  28. A simple new approach to variable selection in regression, with application to genetic fine mapping. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 82(5), 1273–1300.
  29. Empirical bayes matrix factorization. Journal of Machine Learning Research 22(120), 1–40.
  30. Wilkinson, J. H. J. H. (1965). The Algebraic Eigenvalue Problem,. Oxford,: Clarendon Press.
  31. Willwerscheid, J. (2021). Empirical Bayes Matrix Factorization: Methods and Applications. Ph. D. thesis, The University of Chicago.
  32. ebnm: An r package for solving the empirical bayes normal means problem using a variety of prior families. arXiv preprint arXiv:2110.00152.
  33. A penalized matrix decomposition, with applications to sparse principal components and canonical correlation analysis. Biostatistics 10(3), 515–534.
  34. Flexible signal denoising via flexible empirical bayes shrinkage. The Journal of Machine Learning Research 22(1), 4153–4180.
  35. Nonnegative sparse pca. Advances in neural information processing systems 19.
  36. Empirical Bayes PCA in high dimensions. Journal of the Royal Statistical Society: Series B (Statistical Methodology), rssb.12490.
  37. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67(2), 301–320.
  38. Sparse Principal Component Analysis. Journal of Computational and Graphical Statistics 15(2), 265–286.
  39. A Selective Overview of Sparse Principal Component Analysis. Proceedings of the IEEE 106(8), 1311–1320.

Summary

We haven't generated a summary for this paper yet.