Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Accelerating Neural Networks for Large Language Models and Graph Processing with Silicon Photonics (2401.06885v1)

Published 12 Jan 2024 in cs.AR and cs.LG

Abstract: In the rapidly evolving landscape of artificial intelligence, LLMs and graph processing have emerged as transformative technologies for NLP, computer vision, and graph-structured data applications. However, the complex structures of these models pose challenges for acceleration on conventional electronic platforms. In this paper, we describe novel hardware accelerators based on silicon photonics to accelerate transformer neural networks that are used in LLMs and graph neural networks for graph data processing. Our analysis demonstrates that both hardware accelerators achieve at least 10.2x throughput improvement and 3.8x better energy efficiency over multiple state-of-the-art electronic hardware accelerators designed for LLMs and graph processing.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Salma Afifi (6 papers)
  2. Febin Sunny (16 papers)
  3. Mahdi Nikdast (38 papers)
  4. Sudeep Pasricha (75 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.