Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Compressed Multi-Row Storage Format for Sparse Matrices on Graphics Processing Units (1203.2946v2)

Published 13 Mar 2012 in physics.comp-ph and cs.DC

Abstract: A new format for storing sparse matrices is proposed for efficient sparse matrix-vector (SpMV) product calculation on modern graphics processing units (GPUs). This format extends the standard compressed row storage (CRS) format and can be quickly converted to and from it. Computational performance of two SpMV kernels for the new format is determined for over 130 sparse matrices on Fermi-class and Kepler-class GPUs and compared with that of five existing generic algorithms and industrial implementations, including Nvidia cuSparse CSR and HYB kernels. We found the speedup of up to $\approx 60%$ over the best of the five alternative kernels.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Zbigniew Koza (13 papers)
  2. Maciej Matyka (18 papers)
  3. Sebastian Szkoda (3 papers)
  4. Łukasz Mirosław (2 papers)
Citations (30)

Summary

We haven't generated a summary for this paper yet.