Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

FFT-Based Deep Learning Deployment in Embedded Systems (1712.04910v1)

Published 13 Dec 2017 in cs.LG and stat.ML

Abstract: Deep learning has delivered its powerfulness in many application domains, especially in image and speech recognition. As the backbone of deep learning, deep neural networks (DNNs) consist of multiple layers of various types with hundreds to thousands of neurons. Embedded platforms are now becoming essential for deep learning deployment due to their portability, versatility, and energy efficiency. The large model size of DNNs, while providing excellent accuracy, also burdens the embedded platforms with intensive computation and storage. Researchers have investigated on reducing DNN model size with negligible accuracy loss. This work proposes a Fast Fourier Transform (FFT)-based DNN training and inference model suitable for embedded platforms with reduced asymptotic complexity of both computation and storage, making our approach distinguished from existing approaches. We develop the training and inference algorithms based on FFT as the computing kernel and deploy the FFT-based inference model on embedded platforms achieving extraordinary processing speed.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Sheng Lin (29 papers)
  2. Ning Liu (199 papers)
  3. Mahdi Nazemi (18 papers)
  4. Hongjia Li (11 papers)
  5. Caiwen Ding (98 papers)
  6. Yanzhi Wang (197 papers)
  7. Massoud Pedram (93 papers)
Citations (51)

Summary

We haven't generated a summary for this paper yet.