Supervised Learning and Reinforcement Learning of Feedback Models for Reactive Behaviors: Tactile Feedback Testbed (2007.00450v2)

Published 29 Jun 2020 in cs.RO

Abstract: Robots need to be able to adapt to unexpected changes in the environment such that they can autonomously succeed in their tasks. However, hand-designing feedback models for adaptation is tedious, if at all possible, making data-driven methods a promising alternative. In this paper we introduce a full framework for learning feedback models for reactive motion planning. Our pipeline starts by segmenting demonstrations of a complete task into motion primitives via a semi-automated segmentation algorithm. Then, given additional demonstrations of successful adaptation behaviors, we learn initial feedback models through learning from demonstrations. In the final phase, a sample-efficient reinforcement learning algorithm fine-tunes these feedback models for novel task settings through few real system interactions. We evaluate our approach on a real anthropomorphic robot in learning a tactile feedback task.

Citations (4)

View on Semantic Scholar

Summary

The paper introduces a three-phased framework that integrates supervised and reinforcement learning to develop adaptive tactile feedback models for robotic motion planning.
The methodology uses semi-automated segmentation of human demonstrations and Phase-Modulated Neural Networks to capture phase-dependent reactive adaptations.
Experiments on a scraping task highlight significant performance improvements, demonstrating effective generalization in both known and unseen settings.

Insights into Learning Feedback Models for Reactive Behaviors

The paper presented by Sutanto et al. explores a robust framework for learning feedback models essential for reactive motion planning in robotics, with a focus on tactile feedback. This work is situated in the context of addressing the inherent challenges robots face in adapting to environmental changes during task execution. The approach leverages both supervised learning from demonstrations and reinforcement learning (RL) to refine feedback models, ensuring adaptability of the robot's motion plan.

The cornerstone of this research is a three-phased methodology integrating supervised learning and RL for reactive behaviors. Initially, movement primitives are derived from human demonstrations through segmentation, which is facilitated by a sophisticated semi-automated method combining dynamic time warping and weighted least squares. This allows the system to capture the nominal movements that serve as a baseline for adaptive behaviors.

Once nominal behavior is established, feedback models are trained using data from demonstration under varied environmental settings. The novelty lies in utilizing deviations in sensor traces, particularly tactile feedback, to generate adaptation via learned feedback models.

A key innovation in this paper is the introduction of Phase-Modulated Neural Networks (PMNNs). These networks, which differ from traditional feed-forward neural networks (FFNNs) by incorporating phase-dependent modulations, show superior performance in capturing the reactive adaptations needed for tactile-driven tasks. PMNNs ensure convergence to goal states and perform well in training and generalization tasks across various settings.

To address adaptability in new, unseen environments, the paper advocates a sample-efficient RL approach. This is achieved by fine-tuning the feedback model on novel settings, maintaining performance across known settings while effectively expanding the adaptability range.

Experimentation on a real robot performing a scraping task with varying board orientations underscores the effectiveness of the proposed framework. The task demands precise tactile feedback due to indirect interaction with the environment through a tool, a scenario challenging for traditional hand-crafted feedback mechanisms. Empirical results highlight significant performance improvements post RL, showing not only enhanced adaptation in unseen settings but also consistent execution in known settings. The system's ability to interpolate learning across unseen settings further reinforces its versatility.

In terms of practical implications, this framework offers a scalable solution for complex robotic tasks requiring feedback model-driven adaptations. The theoretic contributions include the formulation of PMNNs and the integration of semi-automated segmentation techniques conducive to efficient and accurate learning from demonstrations.

Looking ahead, this research opens pathways for further exploration of multi-modal sensor integrations, potentially enhancing adaptability to a broader range of environmental variations. The balance achieved between supervised and reinforcement learning in this context can inspire advanced methodologies for adaptive control systems, potentially extending beyond tactile feedback scenarios to encompass a variety of robotic applications demanding nuanced responsiveness to dynamic and complex environments.

PDF Markdown

Related Papers

YouTube

Show All Videos