Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games (2212.04603v1)

Published 8 Dec 2022 in cs.LG and cs.AI

Abstract: As Artificial and Robotic Systems are increasingly deployed and relied upon for real-world applications, it is important that they exhibit the ability to continually learn and adapt in dynamically-changing environments, becoming Lifelong Learning Machines. Continual/lifelong learning (LL) involves minimizing catastrophic forgetting of old tasks while maximizing a model's capability to learn new tasks. This paper addresses the challenging lifelong reinforcement learning (L2RL) setting. Pushing the state-of-the-art forward in L2RL and making L2RL useful for practical applications requires more than developing individual L2RL algorithms; it requires making progress at the systems-level, especially research into the non-trivial problem of how to integrate multiple L2RL algorithms into a common framework. In this paper, we introduce the Lifelong Reinforcement Learning Components Framework (L2RLCF), which standardizes L2RL systems and assimilates different continual learning components (each addressing different aspects of the lifelong learning problem) into a unified system. As an instantiation of L2RLCF, we develop a standard API allowing easy integration of novel lifelong learning components. We describe a case study that demonstrates how multiple independently-developed LL components can be integrated into a single realized system. We also introduce an evaluation environment in order to measure the effect of combining various system components. Our evaluation environment employs different LL scenarios (sequences of tasks) consisting of Starcraft-2 minigames and allows for the fair, comprehensive, and quantitative comparison of different combinations of components within a challenging common evaluation environment.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (19)
  1. Indranil Sur (11 papers)
  2. Zachary Daniels (7 papers)
  3. Abrar Rahman (7 papers)
  4. Kamil Faber (11 papers)
  5. Gianmarco J. Gallardo (1 paper)
  6. Tyler L. Hayes (24 papers)
  7. Cameron E. Taylor (2 papers)
  8. Mustafa Burak Gurbuz (7 papers)
  9. James Smith (20 papers)
  10. Sahana Joshi (1 paper)
  11. Nathalie Japkowicz (19 papers)
  12. Michael Baron (3 papers)
  13. Zsolt Kira (110 papers)
  14. Christopher Kanan (72 papers)
  15. Roberto Corizzo (12 papers)
  16. Ajay Divakaran (43 papers)
  17. Michael Piacentino (8 papers)
  18. Jesse Hostetler (15 papers)
  19. Aswin Raghavan (18 papers)
Citations (2)