Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Effective Algorithm-Accelerator Co-design for AI Solutions on Edge Devices (2010.07185v2)

Published 14 Oct 2020 in cs.AR and cs.LG

Abstract: High quality AI solutions require joint optimization of AI algorithms, such as deep neural networks (DNNs), and their hardware accelerators. To improve the overall solution quality as well as to boost the design productivity, efficient algorithm and accelerator co-design methodologies are indispensable. In this paper, we first discuss the motivations and challenges for the Algorithm/Accelerator co-design problem and then provide several effective solutions. Especially, we highlight three leading works of effective co-design methodologies: 1) the first simultaneous DNN/FPGA co-design method; 2) a bi-directional lightweight DNN and accelerator co-design method; 3) a differentiable and efficient DNN and accelerator co-search method. We demonstrate the effectiveness of the proposed co-design approaches using extensive experiments on both FPGAs and GPUs, with comparisons to existing works. This paper emphasizes the importance and efficacy of algorithm-accelerator co-design and calls for more research breakthroughs in this interesting and demanding area.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Cong Hao (51 papers)
  2. Yao Chen (187 papers)
  3. Xiaofan Zhang (79 papers)
  4. Yuhong Li (33 papers)
  5. Jinjun Xiong (118 papers)
  6. Wen-mei Hwu (62 papers)
  7. Deming Chen (62 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.