Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation (2401.02117v1)

Published 4 Jan 2024 in cs.RO, cs.AI, cs.CV, cs.LG, cs.SY, and eess.SY

Abstract: Imitation learning from human demonstrations has shown impressive performance in robotics. However, most results focus on table-top manipulation, lacking the mobility and dexterity necessary for generally useful tasks. In this work, we develop a system for imitating mobile manipulation tasks that are bimanual and require whole-body control. We first present Mobile ALOHA, a low-cost and whole-body teleoperation system for data collection. It augments the ALOHA system with a mobile base, and a whole-body teleoperation interface. Using data collected with Mobile ALOHA, we then perform supervised behavior cloning and find that co-training with existing static ALOHA datasets boosts performance on mobile manipulation tasks. With 50 demonstrations for each task, co-training can increase success rates by up to 90%, allowing Mobile ALOHA to autonomously complete complex mobile manipulation tasks such as sauteing and serving a piece of shrimp, opening a two-door wall cabinet to store heavy cooking pots, calling and entering an elevator, and lightly rinsing a used pan using a kitchen faucet. Project website: https://mobile-aloha.github.io

References (104)

Citations (165)

View on Semantic Scholar

Summary

The paper introduces a cost-effective teleoperation system with a mobile base that enables sophisticated bimanual manipulation tasks.
It employs an imitation learning framework with co-training from static and mobile datasets, achieving up to 90% task success with as few as 50 demonstrations.
The results demonstrate practical whole-body control for complex tasks like cooking and cleaning, paving the way for advanced domestic and industrial robotics.

Overview of "Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation"

The paper presents a sophisticated system named Mobile ALOHA aimed at advancing the field of robotics through imitation learning and teleoperation. The objective of Mobile ALOHA is to enhance the capability of robots to perform complex, bimanual mobile manipulation tasks using a whole-body approach. The system extends the functionality of the ALOHA framework by incorporating a mobile base and assimilating a portable, cost-effective mechanism for collecting teleoperation data. This advancement allows robots to undertake tasks that require both dexterity and mobility, such as cooking and cleaning, which are not feasible with static systems.

Contributions

Hardware Development:
- Mobile ALOHA integrates a low-cost teleoperation system with a mobile base, allowing bimanual control. The system supports the simultaneous control of the base and dual arms, enabling intricate maneuvers such as opening cabinet doors or cooking, where both mobility and manual dexterity are essential.
- The hardware setup is economically feasible, with a total cost of approximately $32,000, and is open-sourced, allowing broader accessibility and replication in various research environments.
Imitation Learning Approach:
- The paper emphasizes an imitation learning framework that leverages human demonstrations to train robots for mobile manipulation tasks. The use of co-training with existing datasets from static ALOHA substantially boosts the learning process.
- Remarkably, co-training with static data results in up to a 90% increase in task success rates with as few as 50 demonstrations, showcasing the data efficiency of this approach.
Complex Task Execution:
- Mobile ALOHA autonomously accomplishes a variety of sophisticated tasks. These include culinary actions like sautéing and serving food, household chores like cleaning and organizing, and navigation tasks like using elevators. This highlights the practical applicability of the system in dynamic and real-world environments.

Strong Numerical Results

The paper underscores significant success rates across various tasks, notably achieving over 80% success in most tasks with just 50 demonstrations. This attests to the robustness and efficacy of the co-training strategy, which utilizes both static and mobile manipulation datasets. Additionally, it leverages advanced imitation learning methods such as ACT and Diffusion Policy, further enhancing performance through data synergy from co-training.

Implications and Future Work

Practical Implications:

Affordability and Accessibility: By reducing the cost of teleoperation systems, Mobile ALOHA democratizes access to sophisticated robotics research, enabling more institutions to participate in advancing mobile robot capabilities.
Broadening Use Cases: The versatility demonstrated suggests potential applications in domestic service robotics, aiding elderly care, hospitality, and maintenance tasks, thereby increasing the societal impact of robotics.

Theoretical Implications:

Imitation Learning Advances: The results contribute to the growing literature on imitation learning in robotics, particularly demonstrating the effectiveness of co-training on diverse datasets.
Action Coordination and Control: The research suggests further exploration into coordination mechanisms for mobile manipulators, emphasizing whole-body control as a vehicle for nuanced task execution.

Speculation on Future Developments

Future iterations of mobile manipulation systems could see enhanced real-time adaptability through reinforcement learning paradigms, eliminating reliance on predefined tasks. The integration of advanced sensor fusion and augmented reality interfaces may further improve teleoperation efficacy, aligning robot actions more closely with human intuition. Moreover, evolving this research into multi-robot systems could revolutionize collaborative tasks, expanding the horizon for industrial and commercial robotics applications.

In conclusion, "Mobile ALOHA: Learning Bimanual Mobile Manipulation with Low-Cost Whole-Body Teleoperation" provides a substantial leap forward in both hardware and software dimensions of mobile manipulation. Its blend of cost-effective design and robust learning methodology makes it a notable contribution to the field of robotics, offering a blueprint for developing more adaptable, skillful robotic systems.

PDF Markdown

Related Papers

GitHub

Mobile ALOHA

Tweets

https://twitter.com/TechXplore_com/status/1744369426546213175

https://twitter.com/dimid_ml/status/1744388492442968154

https://twitter.com/WilliamLamkin/status/1748128589076193593

https://twitter.com/ryo694/status/1780588283220914465

https://twitter.com/ultimape/status/1768424318432588096

https://twitter.com/semisance/status/1743254557738565701