Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks (2306.13831v1)

Published 24 Jun 2023 in cs.LG

Abstract: We present the Minigrid and Miniworld libraries which provide a suite of goal-oriented 2D and 3D environments. The libraries were explicitly created with a minimalistic design paradigm to allow users to rapidly develop new environments for a wide range of research-specific needs. As a result, both have received widescale adoption by the RL community, facilitating research in a wide range of areas. In this paper, we outline the design philosophy, environment details, and their world generation API. We also showcase the additional capabilities brought by the unified API between Minigrid and Miniworld through case studies on transfer learning (for both RL agents and humans) between the different observation spaces. The source code of Minigrid and Miniworld can be found at https://github.com/Farama-Foundation/{Minigrid, Miniworld} along with their documentation at https://{minigrid, miniworld}.farama.org/.

References (29)

Citations (128)

View on Semantic Scholar

Summary

The paper presents modular RL environments that enable rapid creation of customizable 2D and 3D goal-oriented tasks.
It details a minimalistic design and seamless API integration with Python and Gym, reducing dependencies for easier adoption.
The work demonstrates practical utility through case studies in curriculum, exploration, and transfer learning across environments.

Overview of "Minigrid and Miniworld: Modular and Customizable Reinforcement Learning Environments for Goal-Oriented Tasks"

The paper presents the Minigrid and Miniworld libraries, which offer a set of customizable 2D and 3D environments for reinforcement learning (RL) focusing on goal-oriented tasks. Developed with a minimalistic design, these libraries aim to facilitate rapid environment creation and have achieved broad adoption within the RL community. This work outlines the design philosophy, environment details, and the APIs of these libraries, emphasizing their utility in various RL research areas.

Design Philosophy

Minigrid and Miniworld were created with simplicity and customizability as primary goals to accommodate diverse research-specific needs. Their implementation using Python and the Gym RL environment API ensures seamless integration with existing machine learning tools. The libraries have minimal dependencies (e.g., NumPy for Minigrid and Pyglet for Miniworld), which simplifies installation and reduces potential technical issues.

Environment Features

Minigrid:

Composed of 2D GridWorld environments with deterministic dynamics.
Observations include a rendered agent view, direction, and text-based mission.
Actions are discrete, encompassing movements and interactions with grid objects.
The reward function is sparse, customizable through user-defined modifications.

Miniworld:

Consists of 3D environments with room-object configurations.
Observations are RGB images from the agent’s perspective.
Actions include an additional "move back" action compared to Minigrid.
Like Minigrid, the reward structure is sparse and user-customizable.

Both libraries offer straightforward mechanisms for creating and extending environments, aided by comprehensive documentation and tutorials. They are compatible with reinforcement learning libraries such as Stable-Baselines3.

Research Adoption and Utility

Minigrid and Miniworld have been integrated extensively across various RL research domains:

Curriculum Learning: Used for generating scalable learning environments.
Exploration: Their sparse rewards make them ideal for developing robust exploration strategies.
Meta and Transfer Learning: Supports experimentation with new learning algorithms and transfer across diverse environments.

Case Studies

Two case studies demonstrate the libraries' capabilities in transfer learning:

RL Agent Transfer Learning: Policies trained on Minigrid were transferred to similar Miniworld environments. Different strategies in transferring model components were evaluated, revealing that certain configurations enhanced learning transfer.
Human Transfer Learning: Human subjects transitioned from Minigrid to Miniworld environments, showcasing adaptability in navigation tasks. This highlighted the potential for using these libraries in human-in-the-loop systems.

Implications and Future Directions

The Minigrid and Miniworld libraries provide vital infrastructure for experimenting with RL algorithms in customizable settings. Their design fosters experimentation in both theoretical and practical aspects of RL, such as safe RL, curiosity-driven exploration, and real-world applicability. Future developments include enhancing human-in-the-loop capabilities, despite the current limitations due to their simplified environment structures and Python-based implementation.

The comprehensive documentation and open-source availability on GitHub underline the libraries' accessibility and potential for further contribution by the broader research community.

PDF Markdown

Related Papers

GitHub

GitHub - Farama-Foundation/Miniworld: Simple and easily configurable 3D FPS-game-like environments for reinforcement learning (736 stars)