Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation (1904.06703v2)

Published 14 Apr 2019 in cs.LG, cs.AI, cs.RO, and stat.ML

Abstract: Robotic systems are ever more capable of automation and fulfilment of complex tasks, particularly with reliance on recent advances in intelligent systems, deep learning and artificial intelligence. However, as robots and humans come closer in their interactions, the matter of interpretability, or explainability of robot decision-making processes for the human grows in importance. A successful interaction and collaboration will only take place through mutual understanding of underlying representations of the environment and the task at hand. This is currently a challenge in deep learning systems. We present a hierarchical deep reinforcement learning system, consisting of a low-level agent handling the large actions/states space of a robotic system efficiently, by following the directives of a high-level agent which is learning the high-level dynamics of the environment and task. This high-level agent forms a representation of the world and task at hand that is interpretable for a human operator. The method, which we call Dot-to-Dot, is tested on a MuJoCo-based model of the Fetch Robotics Manipulator, as well as a Shadow Hand, to test its performance. Results show efficient learning of complex actions/states spaces by the low-level agent, and an interpretable representation of the task and decision-making process learned by the high-level agent.

PDF Abstract

Explainable Hierarchical Reinforcement Learning for Robotic Manipulation

This paper introduces a novel approach to robotic manipulation via explainable hierarchical reinforcement learning, termed Dot-to-Dot (DtD). Dot-to-Dot aims to address the challenge of interpretability in robotic systems that utilize deep reinforcement learning (DRL), particularly in environments characterized by high dimensional action and state spaces with sparse rewards. The research is built on the premise that a fundamental step towards successfully deploying intelligent robotic systems is ensuring that the decision-making processes are comprehensible to human operators.

Methodology

Dot-to-Dot employs a hierarchical system composed of a high-level agent and a low-level agent. The high-level agent crafts sub-goals that provide interpretability to the decision-making process, while the low-level agent efficiently navigates the complex actions and states of the robotic environment. The hierarchical approach divides tasks into smaller, manageable segments, simplifying the learning process using curriculum learning principles.

Key components of the Dot-to-Dot algorithm include Deep Deterministic Policy Gradients (DDPG) for policy optimization and Hindsight Experience Replay (HER) to enhance learning efficiency from unsuccessful episodes. HER enables the low-level agent to learn more effectively from its experiences by using achieved goals instead of intended ones during training.

Experimental Validation

The paper validates the Dot-to-Dot approach using the Fetch Robotics Manipulator and Shadow Hand simulation environments in MuJoCo, a well-regarded physics engine for robotic simulations. These setups provide a rigorous testing ground, with varied tasks such as FetchPush, FetchPickAndPlace, and HandManipulateBlock. Results demonstrate Dot-to-Dot's capability to efficiently learn intricate manipulation sequences, achieving performance levels comparable to current baselines like HER.

One significant outcome is Dot-to-Dot's ability to foster an intuitive representation of the task environment. By creating sub-goals that serve as waypoints in the task space, the system allows for human-readable insights into how the robot perceives and navigates its task environment. This interpretable framework is valuable in applications where human-robot interaction is pivotal, suggesting a pathway to improve trust and collaboration.

Implications and Future Work

The implications of Dot-to-Dot are substantial both theoretically and practically. The hierarchical reinforcement learning model posits a progressive methodology to tackle advanced robotic manipulation tasks while embedding an aspect of interpretability that is often lacking in deep learning systems. This interpretability is vital in scenarios where oversight and decision audit by human operators are required.

Future research could focus on enhancing the exploration strategies for sub-goal generation and further refining the interpretability aspect. Techniques such as intrinsic motivation or curiosity-driven exploration could be integrated to improve sub-goal selection efficiency. Additionally, transitioning Dot-to-Dot from simulated environments to physical robots could offer profound insights and further validate the robustness of the approach in real-world applications.

Conclusion

Dot-to-Dot represents an articulate advancement in the sphere of explainable reinforcement learning for robotic systems. By employing a hierarchical framework, it presents a viable solution to interpretability challenges, thus laying the groundwork for more sophisticated human-robot collaboration. The combination of DRL with clear actionable insights positions this approach as a promising direction for future intelligent robotic systems.

PDF Markdown Bookmark Chat (Pro)

Authors (3)

Benjamin Beyret (2 papers)
Ali Shafti (17 papers)
A. Aldo Faisal (39 papers)

Citations (68)

View on Semantic Scholar

Related Papers

Find Related Papers

YouTube

Show All Videos