Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Benchmarking of DL Libraries and Models on Mobile Devices (2202.06512v2)

Published 14 Feb 2022 in cs.LG and cs.NI

Abstract: Deploying deep learning (DL) on mobile devices has been a notable trend in recent years. To support fast inference of on-device DL, DL libraries play a critical role as algorithms and hardware do. Unfortunately, no prior work ever dives deep into the ecosystem of modern DL libs and provides quantitative results on their performance. In this paper, we first build a comprehensive benchmark that includes 6 representative DL libs and 15 diversified DL models. We then perform extensive experiments on 10 mobile devices, which help reveal a complete landscape of the current mobile DL libs ecosystem. For example, we find that the best-performing DL lib is severely fragmented across different models and hardware, and the gap between those DL libs can be rather huge. In fact, the impacts of DL libs can overwhelm the optimizations from algorithms or hardware, e.g., model quantization and GPU/DSP-based heterogeneous computing. Finally, atop the observations, we summarize practical implications to different roles in the DL lib ecosystem.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Qiyang Zhang (17 papers)
  2. Xiang Li (1003 papers)
  3. Xiangying Che (1 paper)
  4. Xiao Ma (169 papers)
  5. Ao Zhou (31 papers)
  6. Mengwei Xu (62 papers)
  7. Shangguang Wang (58 papers)
  8. Yun Ma (38 papers)
  9. Xuanzhe Liu (59 papers)
Citations (42)

Summary

We haven't generated a summary for this paper yet.