Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A$^3$: Accelerating Attention Mechanisms in Neural Networks with Approximation (2002.10941v1)

Published 22 Feb 2020 in cs.DC and cs.LG

Abstract: With the increasing computational demands of neural networks, many hardware accelerators for the neural networks have been proposed. Such existing neural network accelerators often focus on popular neural network types such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs); however, not much attention has been paid to attention mechanisms, an emerging neural network primitive that enables neural networks to retrieve most relevant information from a knowledge-base, external memory, or past states. The attention mechanism is widely adopted by many state-of-the-art neural networks for computer vision, natural language processing, and machine translation, and accounts for a large portion of total execution time. We observe today's practice of implementing this mechanism using matrix-vector multiplication is suboptimal as the attention mechanism is semantically a content-based search where a large portion of computations ends up not being used. Based on this observation, we design and architect A3, which accelerates attention mechanisms in neural networks with algorithmic approximation and hardware specialization. Our proposed accelerator achieves multiple orders of magnitude improvement in energy efficiency (performance/watt) as well as substantial speedup over the state-of-the-art conventional hardware.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Tae Jun Ham (4 papers)
  2. Sung Jun Jung (1 paper)
  3. Seonghak Kim (22 papers)
  4. Young H. Oh (1 paper)
  5. Yeonhong Park (7 papers)
  6. Yoonho Song (1 paper)
  7. Jung-Hun Park (2 papers)
  8. Sanghee Lee (5 papers)
  9. Kyoung Park (2 papers)
  10. Jae W. Lee (9 papers)
  11. Deog-Kyoon Jeong (1 paper)
Citations (172)

Summary

Overview of IEEEtran.cls Demo Paper for IEEECS Conferences

The provided document serves as an introductory template for researchers intending to submit conference papers using the IEEEtran.cls class in LaTeX. The paper outlines fundamental aspects of document preparation, specifying formatting rules, font types, and structural guidelines necessary for authors adhering to IEEE conference standards.

Document Structure and Formatting Guidelines

The paper provides a systematic approach to organizing a manuscript using the IEEEtran.cls LaTeX class. Instructions for various document segments such as the title, author affiliations, and section headings are succinctly described to facilitate compliance with IEEE's stringent formatting requirements. Specifically, the emphasis on using Times Roman or Times New Roman fonts, along with recommendations regarding font embedding for mathematical symbols, reflects a concern for creating universally accessible documents that traverse software and system limitations.

Practical and Theoretical Implications

From a practical perspective, these guidelines are instrumental in streamlining the publication process for authors and editors alike by mitigating formatting issues. Adherence to standardized document formats ensures consistency across conference proceedings, thereby enhancing readability and accessibility of technical content for the IEEE research community.

Theoretically, the adoption of uniform formatting paradigms underscores the broader academic practice of creating cohesive and high-quality scholarly communications. This practice facilitates interdisciplinary research by reducing barriers associated with presentation and delivery of complex information.

Prospects for Future Developments

While the current template primarily focuses on the standardization of layout and font usage, future advancements may involve expanding customization options within the IEEEtran.cls to accommodate evolving multimedia presentation formats and novel data visualization techniques. Additionally, as digital accessibility standards progress, further revisions could enhance document compatibility with assistive technology and alternative dissemination platforms.

In summary, this paper acts as a foundational resource for authors navigating the intricacies of IEEE conference paper submission using LaTeX. The meticulous attention to detail and robust guidelines foster a more efficient preparation process, ultimately contributing to the dissemination and impact of high-caliber academic research.