A Survey on Hardware Accelerators for Large Language Models (2401.09890v1)

Published 18 Jan 2024 in cs.AR, cs.CL, and cs.LG

Abstract: LLMs have emerged as powerful tools for natural language processing tasks, revolutionizing the field with their ability to understand and generate human-like text. As the demand for more sophisticated LLMs continues to grow, there is a pressing need to address the computational challenges associated with their scale and complexity. This paper presents a comprehensive survey on hardware accelerators designed to enhance the performance and energy efficiency of LLMs. By examining a diverse range of accelerators, including GPUs, FPGAs, and custom-designed architectures, we explore the landscape of hardware solutions tailored to meet the unique computational demands of LLMs. The survey encompasses an in-depth analysis of architecture, performance metrics, and energy efficiency considerations, providing valuable insights for researchers, engineers, and decision-makers aiming to optimize the deployment of LLMs in real-world applications.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (32)

Authors (1)

Christoforos Kachris (4 papers)

Citations (11)

View on Semantic Scholar

Tweets

https://twitter.com/morris_phd/status/1748780451106374042

A Survey on Hardware Accelerators for Large Language Models (2401.09890v1)

Related Papers

Tweets