Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

AdaptiveNet: Post-deployment Neural Architecture Adaptation for Diverse Edge Environments (2303.07129v1)

Published 13 Mar 2023 in cs.LG and cs.DC

Abstract: Deep learning models are increasingly deployed to edge devices for real-time applications. To ensure stable service quality across diverse edge environments, it is highly desirable to generate tailored model architectures for different conditions. However, conventional pre-deployment model generation approaches are not satisfactory due to the difficulty of handling the diversity of edge environments and the demand for edge information. In this paper, we propose to adapt the model architecture after deployment in the target environment, where the model quality can be precisely measured and private edge data can be retained. To achieve efficient and effective edge model generation, we introduce a pretraining-assisted on-cloud model elastification method and an edge-friendly on-device architecture search method. Model elastification generates a high-quality search space of model architectures with the guidance of a developer-specified oracle model. Each subnet in the space is a valid model with different environment affinity, and each device efficiently finds and maintains the most suitable subnet based on a series of edge-tailored optimizations. Extensive experiments on various edge devices demonstrate that our approach is able to achieve significantly better accuracy-latency tradeoffs (e.g. 46.74\% higher on average accuracy with a 60\% latency budget) than strong baselines with minimal overhead (13 GPU hours in the cloud and 2 minutes on the edge server).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Hao Wen (52 papers)
  2. Yuanchun Li (37 papers)
  3. Zunshuai Zhang (3 papers)
  4. Shiqi Jiang (27 papers)
  5. Xiaozhou Ye (18 papers)
  6. Ye Ouyang (16 papers)
  7. Ya-Qin Zhang (45 papers)
  8. Yunxin Liu (58 papers)
Citations (20)

Summary

We haven't generated a summary for this paper yet.