Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Bi-Directional Co-Design Approach to Enable Deep Learning on IoT Devices (1905.08369v1)

Published 20 May 2019 in cs.CV

Abstract: Developing deep learning models for resource-constrained Internet-of-Things (IoT) devices is challenging, as it is difficult to achieve both good quality of results (QoR), such as DNN model inference accuracy, and quality of service (QoS), such as inference latency, throughput, and power consumption. Existing approaches typically separate the DNN model development step from its deployment on IoT devices, resulting in suboptimal solutions. In this paper, we first introduce a few interesting but counterintuitive observations about such a separate design approach, and empirically show why it may lead to suboptimal designs. Motivated by these observations, we then propose a novel and practical bi-directional co-design approach: a bottom-up DNN model design strategy together with a top-down flow for DNN accelerator design. It enables a joint optimization of both DNN models and their deployment configurations on IoT devices as represented as FPGAs. We demonstrate the effectiveness of the proposed co-design approach on a real-life object detection application using Pynq-Z1 embedded FPGA. Our method obtains the state-of-the-art results on both QoR with high accuracy (IoU) and QoS with high throughput (FPS) and high energy efficiency.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Xiaofan Zhang (79 papers)
  2. Cong Hao (51 papers)
  3. Yuhong Li (33 papers)
  4. Yao Chen (187 papers)
  5. Jinjun Xiong (118 papers)
  6. Wen-mei Hwu (62 papers)
  7. Deming Chen (62 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.