Controlling Assistive Robots with Learned Latent Actions (1909.09674v3)

Published 20 Sep 2019 in cs.RO

Abstract: Assistive robotic arms enable users with physical disabilities to perform everyday tasks without relying on a caregiver. Unfortunately, the very dexterity that makes these arms useful also makes them challenging to teleoperate: the robot has more degrees-of-freedom than the human can directly coordinate with a handheld joystick. Our insight is that we can make assistive robots easier for humans to control by leveraging latent actions. Latent actions provide a low-dimensional embedding of high-dimensional robot behavior: for example, one latent dimension might guide the assistive arm along a pouring motion. In this paper, we design a teleoperation algorithm for assistive robots that learns latent actions from task demonstrations. We formulate the controllability, consistency, and scaling properties that user-friendly latent actions should have, and evaluate how different low-dimensional embeddings capture these properties. Finally, we conduct two user studies on a robotic arm to compare our latent action approach to both state-of-the-art shared autonomy baselines and a teleoperation strategy currently used by assistive arms. Participants completed assistive eating and cooking tasks more efficiently when leveraging our latent actions, and also subjectively reported that latent actions made the task easier to perform. The video accompanying this paper can be found at: https://youtu.be/wjnhrzugBj4.

Authors (5)

Dylan P. Losey (55 papers)
Krishnan Srinivasan (14 papers)
Ajay Mandlekar (41 papers)
Animesh Garg (129 papers)
Dorsa Sadigh (162 papers)

Citations (62)

View on Semantic Scholar

Summary

Controlling Assistive Robots with Learned Latent Actions

The paper presents a novel approach to facilitating the control of assistive robotic arms, specifically tailored for users with physical disabilities who rely on such technology for performing everyday tasks. The challenge lies in the high degree of dexterity and freedom these arms possess, which makes them difficult to maneuver through simple human input devices like joysticks. The proposed solution leverages learned latent actions to simplify this complex control problem by embedding high-dimensional robot actions into low-dimensional latent spaces that can be more intuitively manipulated by humans.

Methodology and Models

The authors introduce a teleoperation algorithm that learns these latent actions from demonstration data. They formulate three essential properties for user-friendly latent actions: controllability, consistency, and scalability. The paper makes use of multiple machine learning models, such as autoencoders (AE), variational autoencoders (VAE), and their conditional variants (cAE and cVAE), to learn these low-dimensional embeddings. By analyzing robot states conditioned by system state during decoding, cVAE models exhibit superior performance in capturing the user-friendly properties due to their ability to better map latent actions to intuitive high-DoF robot behaviors.

Numerical and Experimental Evaluation

The research includes simulations and user studies to evaluate the effectiveness of the proposed method. Results from simulation tasks, such as Sine, Rotate, Circle, and Reach, show that state-conditioned models like cVAE outperform non-conditioned models in terms of action reconstruction accuracy, controllability, and alignment with intuitive task dimensions. These tests revealed that cVAE models not only minimize reconstruction error but also ensure that latent actions remain controllable, consistent, and scalable across varying state spaces.

Two human studies further validate this approach. In scenarios with discrete goal spaces, robots utilizing learned latent actions outperformed shared autonomy baselines by achieving higher task success rates and requiring less input from users. Meanwhile, in continuous goal space settings such as a cooking task, users completed tasks faster and with less effort when controlling robots with latent actions compared to current end-effector control strategies. Subjectively, participants also reported a natural ease with cVAE, suggesting its potential for enhancing user experience in human-robot interaction.

Practical and Theoretical Implications

From a practical perspective, the paper demonstrates a promising pathway for making assistive robotic technology more accessible to individuals with physical limitations. By providing a framework that significantly alleviates the cognitive and physical demands of controlling high-DoF robotic arms through intuitive low-DoF interfaces, the methodology holds the potential to enhance user independence and quality of life.

Theoretically, the work contributes to the understanding of embedding high-dimensional robotic behaviors into manageable latent spaces and the implications of such embeddings on control interfaces under high uncertainty and variability in task execution demands. This shift towards intelligent model-based control mechanisms also poses interesting inquiries into further optimizing these embeddings concerning diverse robotic architectures and human inputs.

Conclusion and Future Work

In conclusion, the paper presents a comprehensive approach to leveraging learned latent actions for controlling assistive robots, providing substantial improvements in both objective task performance metrics and user subjective experience. Future research directions may involve integrating these latent action models with other control strategies, such as learning from fewer demonstrations, adapting to novel tasks without retraining, or even further merging reasoning with dynamic, environmental changes to better simulate real-world conditions.

This paper encapsulates a significant stride toward enhancing the usability of assistive robotics, opening the door for expanded deployment across various assistive domains where user expertise or physical limitations are a concern.

PDF Markdown

Related Papers

Find Related Papers

Tweets

https://twitter.com/sidgreddy/status/1594855908051668993

YouTube

Show All Videos