LRAMM -- Low precision approximates GEMM via RSVD (2405.16917v1)

Published 27 May 2024 in math.NA, cs.NA, and cs.PF

Abstract: Matrix multiplication computation acceleration has been a research hotspot across various domains. Due to the characteristics of some applications, approximate matrix multiplication can achieve significant performance improvements without losing much precision. In this paper, we propose LRAMM - a high-performance matrix multiplication approximation algorithm that combines mixed-precision quantized matrix multiplication with RSVD techniques, further enhancing efficiency within the error range of low-precision matrix multiplication by utilizing matrix low-rank decomposition technology.

References (41)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Authors (1)

Hongyaoxing Gu

Tweets

https://twitter.com/mathNAb/status/1795340917215510921

LRAMM -- Low precision approximates GEMM via RSVD (2405.16917v1)

Summary

Follow-up Questions

Related Papers

Authors (1)

Tweets