Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MSREP: A Fast yet Light Sparse Matrix Framework for Multi-GPU Systems (2209.07552v1)

Published 15 Sep 2022 in cs.DC

Abstract: Sparse linear algebra kernels play a critical role in numerous applications, covering from exascale scientific simulation to large-scale data analytics. Offloading linear algebra kernels on one GPU will no longer be viable in these applications, simply because the rapidly growing data volume may exceed the memory capacity and computing power of a single GPU. Multi-GPU systems nowadays being ubiquitous in supercomputers and data-centers present great potentials in scaling up large sparse linear algebra kernels. In this work, we design a novel sparse matrix representation framework for multi-GPU systems called MSREP, to scale sparse linear algebra operations based on our augmented sparse matrix formats in a balanced pattern. Different from dense operations, sparsity significantly intensifies the difficulty of distributing the computation workload among multiple GPUs in a balanced manner. We enhance three mainstream sparse data formats -- CSR, CSC, and COO, to enable fine-grained data distribution. We take sparse matrix-vector multiplication (SpMV) as an example to demonstrate the efficiency of our MSREP framework. In addition, MSREP can be easily extended to support other sparse linear algebra kernels based on the three fundamental formats (i.e., CSR, CSC and COO).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Jieyang Chen (25 papers)
  2. Chenhao Xie (10 papers)
  3. Jesun S Firoz (2 papers)
  4. Jiajia Li (43 papers)
  5. Shuaiwen Leon Song (35 papers)
  6. Kevin Barker (16 papers)
  7. Mark Raugas (7 papers)
  8. Ang Li (472 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.