2000 character limit reached
A3C-S: Automated Agent Accelerator Co-Search towards Efficient Deep Reinforcement Learning (2106.06577v1)
Published 11 Jun 2021 in cs.LG
Abstract: Driven by the explosive interest in applying deep reinforcement learning (DRL) agents to numerous real-time control and decision-making applications, there has been a growing demand to deploy DRL agents to empower daily-life intelligent devices, while the prohibitive complexity of DRL stands at odds with limited on-device resources. In this work, we propose an Automated Agent Accelerator Co-Search (A3C-S) framework, which to our best knowledge is the first to automatically co-search the optimally matched DRL agents and accelerators that maximize both test scores and hardware efficiency. Extensive experiments consistently validate the superiority of our A3C-S over state-of-the-art techniques.
- Yonggan Fu (49 papers)
- Yongan Zhang (24 papers)
- Chaojian Li (34 papers)
- Zhongzhi Yu (25 papers)
- Yingyan Lin (67 papers)