XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library (2312.16248v1)

Published 25 Dec 2023 in cs.LG, cs.AI, and cs.DL

Abstract: In this paper, we present XuanCe, a comprehensive and unified deep reinforcement learning (DRL) library designed to be compatible with PyTorch, TensorFlow, and MindSpore. XuanCe offers a wide range of functionalities, including over 40 classical DRL and multi-agent DRL algorithms, with the flexibility to easily incorporate new algorithms and environments. It is a versatile DRL library that supports CPU, GPU, and Ascend, and can be executed on various operating systems such as Ubuntu, Windows, MacOS, and EulerOS. Extensive benchmarks conducted on popular environments including MuJoCo, Atari, and StarCraftII multi-agent challenge demonstrate the library's impressive performance. XuanCe is open-source and can be accessed at https://github.com/agi-brain/xuance.git.

References (32)

Citations (2)

View on Semantic Scholar

Summary

The paper presents XuanCe, a unified library that streamlines both DRL and MARL research with support for over 40 algorithms.
The paper demonstrates versatile integration with PyTorch, TensorFlow, and MindSpore across diverse hardware and operating systems.
The paper validates XuanCe’s efficacy with benchmarks on environments like MuJoCo, Atari, and StarCraftII, showcasing its competitive performance.

Overview of XuanCe: A Deep Reinforcement Learning Library

The paper presents XuanCe, a sophisticated deep reinforcement learning (DRL) library that offers integration with major deep learning frameworks such as PyTorch, TensorFlow, and MindSpore. The primary aim of XuanCe is to address the heterogeneous nature of DRL algorithms and environments, providing a unified platform that simplifies the development and evaluation of both traditional single-agent DRL and multi-agent reinforcement learning (MARL) techniques. The library is open-source and supports a range of hardware and operating system platforms, thus providing flexibility and adaptability for users in diverse computing environments.

Key Features

XuanCe includes more than 40 algorithms across the realms of DRL and MARL, offering compatibility for multiple deep learning frameworks. This breadth of algorithmic content is an essential strength, as it provides researchers with a comprehensive toolkit for exploring a wide variety of DRL applications. The library's architecture is modular, allowing easier integration and testing of new algorithms and environments.

Versatility: Compatible with PyTorch, TensorFlow, and MindSpore, XuanCe facilitates the deployment of DRL models on CPUs, GPUs, and Ascend hardware across operating systems like Ubuntu, Windows, MacOS, and EulerOS.
Algorithmic Diversity: Supporting over 40 algorithms, XuanCe offers a toolbox spanning value-based, policy-based, and MARL algorithms, covering a wide swath of potential applications from simple control tasks to complex multi-agent scenarios.
Comprehensive Benchmarks: The library has been verified with benchmarks on common environments such as MuJoCo, Atari, and the StarCraftII multi-agent challenge. These benchmarks illustrate the library’s competitiveness when compared to results published with other DRL research.

Design and Implementation

XuanCe's design comprises four primary components:

Configs: Utilizing YAML files for hyper-parameter tuning and flexible environment and model configurations.
Common Tools: Provides tools for preparation and initialization of models pre-training, alongside memory utilities for experience replay, crucial for off-policy learning strategies.
Environments and Algorithms: Beyond supporting popular tasks, the library enhances sample efficiency through parallel environments. Algorithmically, XuanCe organizes its DRL capabilities into five unified modules: utils, representations, policies, learners, agents, and runners.

Related Work

The paper positions XuanCe in context with other existing DRL libraries, highlighting its combination of widespread algorithm support, modular structure, and support for multiple frameworks as distinguishing features. Libraries such as RLlib and others often focus on subsets of DRL strategies or specific framework supports, whereas XuanCe aims to provide a unified and holistic approach.

Conclusion and Implications

XuanCe is a robust and feature-rich DRL library ideal for advanced DRL research and application development. Its versatility across frameworks and environments makes it an invaluable tool for researchers aiming to experiment rapidly across a spectrum of scenarios. The availability of extensive benchmarking means users can trust its performance relative to existing solutions. Future developments may include extending its algorithm base or further optimizing its adaptability in diverse operational settings, effectively broadening the exploration potential within the AI community.

PDF Markdown

Related Papers

GitHub

GitHub - agi-brain/xuance: XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library (527 stars)

Tweets

https://twitter.com/sawubonagmbh/status/1845954874711888020