Efficient Sparse PCA via Block-Diagonalization (2410.14092v2)
Abstract: Sparse Principal Component Analysis (Sparse PCA) is a pivotal tool in data analysis and dimensionality reduction. However, Sparse PCA is a challenging problem in both theory and practice: it is known to be NP-hard and current exact methods generally require exponential runtime. In this paper, we propose a novel framework to efficiently approximate Sparse PCA by (i) approximating the general input covariance matrix with a re-sorted block-diagonal matrix, (ii) solving the Sparse PCA sub-problem in each block, and (iii) reconstructing the solution to the original problem. Our framework is simple and powerful: it can leverage any off-the-shelf Sparse PCA algorithm and achieve significant computational speedups, with a minor additive error that is linear in the approximation error of the block-diagonal matrix. Suppose $g(k, d)$ is the runtime of an algorithm (approximately) solving Sparse PCA in dimension $d$ and with sparsity constant $k$. Our framework, when integrated with this algorithm, reduces the runtime to $\mathcal{O}\left(\frac{d}{d\star} \cdot g(k, d\star) + d2\right)$, where $d\star \leq d$ is the largest block size of the block-diagonal matrix. For instance, integrating our framework with the Branch-and-Bound algorithm reduces the complexity from $g(k, d) = \mathcal{O}(k3\cdot dk)$ to $\mathcal{O}(k3\cdot d \cdot (d\star){k-1})$, demonstrating exponential speedups if $d\star$ is small. We perform large-scale evaluations on many real-world datasets: for exact Sparse PCA algorithm, our method achieves an average speedup factor of 100.50, while maintaining an average approximation error of 0.61%; for approximate Sparse PCA algorithm, our method achieves an average speedup factor of 6.00 and an average approximation error of -0.91%, meaning that our method oftentimes finds better solutions.
- Primal-dual interior-point methods for semidefinite programming: convergence rates, stability and numerical results. SIAM Journal on Optimization, 8(3):746–768, 1998.
- Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. Proceedings of the National Academy of Sciences, 96(12):6745–6750, 1999.
- High-dimensional analysis of semidefinite relaxations for sparse principal components. In IEEE International Symposium on Information Theory, pages 2454–2458, 2008.
- Sparse pca via bipartite matchings. Advances in Neural Information Processing Systems, 28, 2015.
- Certifiably optimal sparse principal component analysis. Mathematical Programming Computation, 11:381–420, 2019.
- Solving large-scale sparse pca to certifiable (near) optimality. Journal of Machine Learning Research, 23(13):1–35, 2022.
- On the worst-case approximability of sparse pca. arXiv preprint arXiv:1507.05950, 2015.
- Selective inference for k-means clustering. arXiv preprint arXiv:2203.15267, 2022.
- Approximation algorithms for sparse principal component analysis. arXiv preprint arXiv:2006.12748, 2020.
- Adaptive robust estimation in sparse vector model. The Annals of Statistics, 49(3):1347–1377, 2021.
- Sparse pca with multiple components. arXiv preprint arXiv:2209.14790, 2022.
- Alberto Del Pia. Sparse pca on fixed-rank matrices. Mathematical Programming, pages 1–19, 2022.
- Sparse pca via covariance thresholding. Journal of Machine Learning Research, 17(141):1–41, 2016.
- Marcel Dettling. Bagboosting for tumor classification with gene expression data. Bioinformatics, 20(18):3583–3593, 2004.
- Block-diagonal covariance selection for high-dimensional gaussian graphical models. Journal of the American Statistical Association, 113(521):306–314, 2018.
- Using ℓ1subscriptℓ1\ell_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT-relaxation and integer programming to obtain dual bounds for sparse pca. Operations Research, 70(3):1914–1932, 2022a.
- Solving sparse principal component analysis with global support. Mathematical Programming, pages 1–39, 2022b.
- Subexponential-time algorithms for sparse pca. Foundations of Computational Mathematics, pages 1–50, 2023.
- Sparse pca: algorithms, adversarial perturbations and certificates. In 2020 IEEE 61st Annual Symposium on Foundations of Computer Science (FOCS), pages 553–564. IEEE, 2020.
- Robust subspace segmentation with block-diagonal prior. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 3818–3825, 2014.
- Competitive baseline methods set new standards for the nips 2003 feature selection benchmark. Pattern recognition letters, 28(12):1438–1444, 2007.
- Projection algorithms for nonconvex minimization with application to sparse principal component analysis. Journal of Global Optimization, 65:657–676, 2016.
- Hyperattention: Long-context attention in near-linear time. In The Twelfth International Conference on Learning Representations, 2023.
- Sparse principal component analysis in cancer research. Translational cancer research, 3(3):182, 2014.
- Sparsity meets correlation in gaussian sequence model, 2023.
- Optimal estimation of the null distribution in large-scale inference, 2024.
- Jundong Li. Feature selection datasets at asu. https://jundongl.github.io/scikit-feature/OLD/datasets_old.html, 2020. Accessed: 2024-09-27.
- Exact and approximation algorithms for sparse pca. arXiv preprint arXiv:2008.12438, 2020.
- Principal component analysis based methods in bioinformatics studies. Briefings in bioinformatics, 12(6):714–722, 2011.
- Malik Magdon-Ismail. Np-hardness and inapproximability of sparse PCA. Information Processing Letters, 126:35–38, 2017.
- Robust and tuning-free sparse linear regression via square-root slope. arXiv preprint arXiv:2210.16808, 2022.
- Sparse pca through low-rank approximations. In International Conference on Machine Learning, pages 747–755. PMLR, 2013.
- Roman Vershynin. High-dimensional probability: An introduction with applications in data science, volume 47. Cambridge university press, 2018.
- Truncated power method for sparse eigenvalue problems. Journal of Machine Learning Research, 14(4), 2013.
- Large-scale sparse principal component analysis with application to text data. Advances in Neural Information Processing Systems, 24, 2011.
- A technical review of canonical correlation analysis for neuroscience applications. Human brain mapping, 41(13):3807–3833, 2020.
- A selective overview of sparse principal component analysis. Proceedings of the IEEE, 106(8):1311–1320, 2018.
- Sparse principal component analysis. Journal of computational and graphical statistics, 15(2):265–286, 2006.