Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

R^3: On-device Real-Time Deep Reinforcement Learning for Autonomous Robotics (2308.15039v2)

Published 29 Aug 2023 in cs.RO, cs.AI, cs.SY, and eess.SY

Abstract: Autonomous robotic systems, like autonomous vehicles and robotic search and rescue, require efficient on-device training for continuous adaptation of Deep Reinforcement Learning (DRL) models in dynamic environments. This research is fundamentally motivated by the need to understand and address the challenges of on-device real-time DRL, which involves balancing timing and algorithm performance under memory constraints, as exposed through our extensive empirical studies. This intricate balance requires co-optimizing two pivotal parameters of DRL training -- batch size and replay buffer size. Configuring these parameters significantly affects timing and algorithm performance, while both (unfortunately) require substantial memory allocation to achieve near-optimal performance. This paper presents R3, a holistic solution for managing timing, memory, and algorithm performance in on-device real-time DRL training. R3 employs (i) a deadline-driven feedback loop with dynamic batch sizing for optimizing timing, (ii) efficient memory management to reduce memory footprint and allow larger replay buffer sizes, and (iii) a runtime coordinator guided by heuristic analysis and a runtime profiler for dynamically adjusting memory resource reservations. These components collaboratively tackle the trade-offs in on-device DRL training, improving timing and algorithm performance while minimizing the risk of out-of-memory (OOM) errors. We implemented and evaluated R3 extensively across various DRL frameworks and benchmarks on three hardware platforms commonly adopted by autonomous robotic systems. Additionally, we integrate R3 with a popular realistic autonomous car simulator to demonstrate its real-world applicability. Evaluation results show that R3 achieves efficacy across diverse platforms, ensuring consistent latency performance and timing predictability with minimal overhead.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Zexin Li (16 papers)
  2. Aritra Samanta (4 papers)
  3. Yufei Li (29 papers)
  4. Andrea Soltoggio (20 papers)
  5. Hyoseung Kim (14 papers)
  6. Cong Liu (169 papers)
Citations (5)

Summary

We haven't generated a summary for this paper yet.