Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A structure-aware framework for learning device placements on computation graphs (2405.14185v1)

Published 23 May 2024 in cs.LG and cs.PF

Abstract: Existing approaches for device placement ignore the topological features of computation graphs and rely mostly on heuristic methods for graph partitioning. At the same time, they either follow a grouper-placer or an encoder-placer architecture, which requires understanding the interaction structure between code operations. To bridge the gap between encoder-placer and grouper-placer techniques, we propose a novel framework for the task of device placement, relying on smaller computation graphs extracted from the OpenVINO toolkit using reinforcement learning. The framework consists of five steps, including graph coarsening, node representation learning and policy optimization. It facilitates end-to-end training and takes into consideration the directed and acyclic nature of the computation graphs. We also propose a model variant, inspired by graph parsing networks and complex network analysis, enabling graph representation learning and personalized graph partitioning jointly, using an unspecified number of groups. To train the entire framework, we utilize reinforcement learning techniques by employing the execution time of the suggested device placements to formulate the reward. We demonstrate the flexibility and effectiveness of our approach through multiple experiments with three benchmark models, namely Inception-V3, ResNet, and BERT. The robustness of the proposed framework is also highlighted through an ablation study. The suggested placements improve the inference speed for the benchmark models by up to $58.2\%$ over CPU execution and by up to $60.24\%$ compared to other commonly used baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Shukai Duan (11 papers)
  2. Heng Ping (9 papers)
  3. Nikos Kanakaris (9 papers)
  4. Xiongye Xiao (16 papers)
  5. Peiyu Zhang (23 papers)
  6. Panagiotis Kyriakis (2 papers)
  7. Nesreen K. Ahmed (76 papers)
  8. Guixiang Ma (20 papers)
  9. Mihai Capota (9 papers)
  10. Shahin Nazarian (31 papers)
  11. Theodore L. Willke (21 papers)
  12. Paul Bogdan (51 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.