Impact of MF’s stack data movement on performance relative to LL and RL
Determine whether the data movement costs associated with managing the multifrontal (MF) method’s stack of update matrices—specifically the packing and related operations described in line 18 of Algorithm 1—are a primary cause of MF’s slower performance relative to the left-looking (LL) and right-looking (RL) serial supernodal sparse Cholesky factorization algorithms when using multithreaded BLAS.
References
We conjecture that the costs of the data movement associated with~MF's stack (see line~18 in Algorithm~\ref{alg:MF}) are hurting the performance of~MF relative to~LL and~RL.
— Some new techniques to use in serial sparse Cholesky factorization algorithms
(2409.13090 - Karsavuran et al., 19 Sep 2024) in Section 3.2 (Results)