A Distributed Block Chebyshev-Davidson Algorithm for Parallel Spectral Clustering (2212.04443v2)
Abstract: We develop a distributed Block Chebyshev-Davidson algorithm to solve large-scale leading eigenvalue problems for spectral analysis in spectral clustering. First, the efficiency of the Chebyshev-Davidson algorithm relies on the prior knowledge of the eigenvalue spectrum, which could be expensive to estimate. This issue can be lessened by the analytic spectrum estimation of the Laplacian or normalized Laplacian matrices in spectral clustering, making the proposed algorithm very efficient for spectral clustering. Second, to make the proposed algorithm capable of analyzing big data, a distributed and parallel version has been developed with attractive scalability. The speedup by parallel computing is approximately equivalent to $\sqrt{p}$, where $p$ denotes the number of processes. {Numerical results will be provided to demonstrate its efficiency in spectral clustering and scalability advantage over existing eigensolvers used for spectral clustering in parallel computing environments.}
- Normalized cuts and image segmentation. IEEE Transactions on pattern analysis and machine intelligence, 22(8):888–905, 2000.
- Jeff Cheeger. A lower bound for the smallest eigenvalue of the laplacian. In Problems in analysis, pages 195–200. Princeton University Press, 2015.
- Algorithms for partitioning of graphs and computer logic based on eigenvectors of connection matrices. IBM Technical Disclosure Bulletin, 15(3):938–944, 1972.
- Miroslav Fiedler. Algebraic connectivity of graphs. Czechoslovak mathematical journal, 23(2):298–305, 1973.
- On the performance of spectral graph partitioning methods. Technical report, CARNEGIE-MELLON UNIV PITTSBURGH PA DEPT OF COMPUTER SCIENCE, 1994.
- Spectral partitioning works: Planar graphs and finite element meshes. In Proceedings of 37th conference on foundations of computer science, pages 96–105. IEEE, 1996.
- On spectral clustering: Analysis and an algorithm. Advances in neural information processing systems, 14, 2001.
- A chebyshev–davidson algorithm for large symmetric eigenproblems. SIAM Journal on Matrix Analysis and Applications, 29(3):954–971, 2007.
- Modern quantum chemistry: introduction to advanced electronic structure theory. Courier Corporation, 2012.
- Preconditioning orbital minimization method for planewave discretization. Multiscale Modeling & Simulation, 15(1):254–273, 2017.
- Interior eigensolver for sparse hermitian definite matrices based on zolotarev’s functions. Communications in Mathematical Sciences, 19(4):1113 – 1135, 2021.
- Chebyshev-filtered subspace iteration method free of sparse diagonalization for solving the kohn–sham equation. Journal of Computational Physics, 274:770–782, 2014.
- Yousef Saad. Numerical methods for large eigenvalue problems: revised edition. SIAM, 2011.
- Using chebyshev-filtered subspace iteration and windowing methods to solve the kohn-sham problem. Practical Aspects of Computational Chemistry I: An Overview of the Last Two Decades and Current Trends, pages 167–189, 2012.
- Accelerating large partial evd. SVD calculations by filtered block Davidson.
- Cun-Qiang Miao. A filtered-davidson method for large symmetric eigenvalue problems. East Asian Journal on Applied Mathematics, 7(1):21–37, 2017.
- The davidson method. SIAM Journal on Scientific Computing, 15(1):62–76, 1994.
- Gerard LG Sleijpen and Henk A Van der Vorst. A jacobi–davidson iteration method for linear eigenvalue problems. SIAM review, 42(2):267–293, 2000.
- Yunkai Zhou. A block chebyshev–davidson method with inner–outer restart for large eigenvalue problems. Journal of Computational Physics, 229(24):9188–9200, 2010.
- A block chebyshev-davidson method for linear response eigenvalue problems. Advances in Computational Mathematics, 42:1103–1128, 2016.
- Accelerating large partial evd/svd calculations by filtered block davidson methods. Science China Mathematics, 59:1635–1662, 2016.
- A block chebyshev-davidson method for solving symmetric eigenproblems. Journal on Numerica Methods and Computer Applications, 32(3):209, 2011.
- On flexible block chebyshev-davidson method for solving symmetric generalized eigenvalue problems. Advances in Computational Mathematics, 49(6):78, 2023.
- Cun-Qiang Miao. On chebyshev–davidson method for symmetric generalized eigenvalue problems. Journal of Scientific Computing, 85(3):53, 2020.
- A new subspace iteration algorithm for solving generalized eigenvalue problems. arXiv preprint arXiv:2212.14520, 2022.
- Patrice Koehl. Large eigenvalue problems in coarse-grained dynamic analyses of supramolecular systems. Journal of chemical theory and computation, 14(7):3903–3919, 2018.
- Block iterative eigensolvers for sequences of correlated eigenvalue problems. Computer physics communications, 184(11):2478–2488, 2013.
- Parallel self-consistent-field calculations via chebyshev-filtered subspace acceleration. Physical Review E, 74(6):066704, 2006.
- Self-consistent equations including exchange and correlation effects. Physical review, 140(4A):A1133, 1965.
- Elsi: A unified software interface for kohn–sham electronic structure solvers. Computer Physics Communications, 222:267–285, 2018.
- A cubic scaling algorithm for excited states calculations in particle–particle random phase approximation. Journal of Computational Physics, 340:297–308, 2017.
- Reorthogonalization and stable algorithms for updating the gram-schmidt qr factorization. Mathematics of Computation, 30(136):772–795, 1976.
- Lynn Elliot Cannon. A cellular computer to implement the Kalman filter algorithm. Montana State University, 1969.
- Robert A Van De Geijn and Jerrell Watts. Summa: Scalable universal matrix multiplication algorithm. Concurrency: Practice and Experience, 9(4):255–274, 1997.
- A three-dimensional approach to parallel matrix multiplication. IBM Journal of Research and Development, 39(5):575–582, 1995.
- Communication-optimal parallel 2.5 d matrix multiplication and lu factorization algorithms. In European Conference on Parallel Processing, pages 90–109. Springer, 2011.
- Exploiting multiple levels of parallelism in sparse matrix-matrix multiplication. SIAM Journal on Scientific Computing, 38(6):C624–C651, 2016.
- Parallel sparse matrix-matrix multiplication and indexing: Implementation and experiments. SIAM Journal on Scientific Computing, 34(4):C170–C191, 2012.
- Parallel matrix multiplication: A systematic journey. SIAM Journal on Scientific Computing, 38(6):C748–C781, 2016.
- Distributed-memory parallel algorithms for sparse times tall-skinny-dense matrix multiplication. In Proceedings of the ACM International Conference on Supercomputing, pages 431–442, 2021.
- Mpi-faun: An mpi-based framework for alternating-updating nonnegative matrix factorization. IEEE Transactions on Knowledge and Data Engineering, 30(3):544–558, 2017.
- Communication-optimal parallel and sequential qr and lu factorizations. SIAM Journal on Scientific Computing, 34(1):A206–A239, 2012.
- LAPACK Users’ Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, third edition, 1999.
- ScaLAPACK Users’ Guide. Society for Industrial and Applied Mathematics, Philadelphia, PA, 1997.
- Andrew V Knyazev. Toward the optimal preconditioned eigensolver: Locally optimal block preconditioned conjugate gradient method. SIAM journal on scientific computing, 23(2):517–541, 2001.
- ARPACK users’ guide: solution of large-scale eigenvalue problems with implicitly restarted Arnoldi methods. SIAM, 1998.
- Power iteration clustering. In ICML, 2010.
- Parallel spectral graph partitioning. NVIDIA, Santa Clara, CA, USA, Tech. Rep., NVR-2016-001, 2016.
- Parallel spectral clustering in distributed systems. IEEE transactions on pattern analysis and machine intelligence, 33(3):568–586, 2010.
- p-pic: parallel power iteration clustering for big data. Journal of Parallel and Distributed computing, 73(3):352–359, 2013.
- Designing an efficient parallel spectral clustering algorithm on multi-core processors in julia. Journal of Parallel and Distributed Computing, 138:211–221, 2020.
- Collective communication: theory, practice, and experience. Concurrency and Computation: Practice and Experience, 19(13):1749–1783, 2007.
- Mpi. jl: Julia bindings for the message passing interface. In Proceedings of the JuliaCon Conferences, volume 1, page 68, 2021.
- Open mpi: Goals, concept, and design of a next generation mpi implementation. In European Parallel Virtual Machine/Message Passing Interface Users’ Group Meeting, pages 97–104. Springer, 2004.
- Comparing partitions. Journal of classification, 2:193–218, 1985.
- Comparing community structure identification. Journal of statistical mechanics: Theory and experiment, 2005(09):P09008, 2005.
- Andrew Knyazev. Recent implementations, applications, and extensions of the locally optimal block preconditioned conjugate gradient method (lobpcg). arXiv preprint arXiv:1708.08354, 2017.
- PETSc Web page. https://petsc.org/, 2023.
- PETSc/TAO users manual. Technical Report ANL-21/39 - Revision 3.20, Argonne National Laboratory, 2023.
- Efficient management of parallelism in object oriented numerical software libraries. In E. Arge, A. M. Bruaset, and H. P. Langtangen, editors, Modern Software Tools in Scientific Computing, pages 163–202. Birkhäuser Press, 1997.
- Traffic data repository at the {{\{{WIDE}}\}} project. In 2000 USENIX Annual Technical Conference (USENIX ATC 00), 2000.
- Design, generation, and validation of extreme scale power-law graphs. arXiv preprint arXiv:1803.01281, 2018.