Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Virtual Augmented Reality for Atari Reinforcement Learning (2310.08683v1)

Published 12 Oct 2023 in cs.LG and cs.AI

Abstract: Reinforcement Learning (RL) has achieved significant milestones in the gaming domain, most notably Google DeepMind's AlphaGo defeating human Go champion Ken Jie. This victory was also made possible through the Atari Learning Environment (ALE): The ALE has been foundational in RL research, facilitating significant RL algorithm developments such as AlphaGo and others. In current Atari video game RL research, RL agents' perceptions of its environment is based on raw pixel data from the Atari video game screen with minimal image preprocessing. Contrarily, cutting-edge ML research, external to the Atari video game RL research domain, is focusing on enhancing image perception. A notable example is Meta Research's "Segment Anything Model" (SAM), a foundation model capable of segmenting images without prior training (zero-shot). This paper addresses a novel methodical question: Can state-of-the-art image segmentation models such as SAM improve the performance of RL agents playing Atari video games? The results suggest that SAM can serve as a "virtual augmented reality" for the RL agent, boosting its Atari video game playing performance under certain conditions. Comparing RL agent performance results from raw and augmented pixel inputs provides insight into these conditions. Although this paper was limited by computational constraints, the findings show improved RL agent performance for augmented pixel inputs and can inform broader research agendas in the domain of "virtual augmented reality for video game playing RL agents".

Definition Search Book Streamline Icon: https://streamlinehq.com
References (9)
  1. The arcade learning environment: An evaluation platform for general agents. 2013. doi: https://doi.org/10.1613/jair.3912.
  2. Unifying count-based exploration and intrinsic motivation. 2016. doi: https://doi.org/10.48550/arXiv.1606.01868.
  3. First return, then explore. 2021. doi: https://doi.org/10.1038/s41586-020-03157-9.
  4. Cleanrl: High-quality single-file implementations of deep reinforcement learning algorithms. 2021. doi: https://doi.org/10.48550/arXiv.2111.08819.
  5. Deep learning for video game playing. 2020. doi: https://doi.org/10.1109/TG.2019.2896986.
  6. Segment anything. 2023. doi: https://doi.org/10.48550/arXiv.2304.02643.
  7. Revisiting the arcade learning environment: Evaluation protocols and open problems for general agents. 2018. doi: https://doi.org/10.48550/arXiv.1709.06009.
  8. Playing atari with deep reinforcement learning. 2013. doi: https://doi.org/10.48550/arXiv.1312.5602.
  9. Mastering the game of go with deep neural networks and tree search. 2016. doi: https://doi.org/10.1038/nature16961.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (1)
Citations (1)