Multi-Task Interactive Robot Fleet Learning with Visual World Models (2410.22689v1)
Abstract: Recent advancements in large-scale multi-task robot learning offer the potential for deploying robot fleets in household and industrial settings, enabling them to perform diverse tasks across various environments. However, AI-enabled robots often face challenges with generalization and robustness when exposed to real-world variability and uncertainty. We introduce Sirius-Fleet, a multi-task interactive robot fleet learning framework to address these challenges. Sirius-Fleet monitors robot performance during deployment and involves humans to correct the robot's actions when necessary. We employ a visual world model to predict the outcomes of future actions and build anomaly predictors to predict whether they will likely result in anomalies. As the robot autonomy improves, the anomaly predictors automatically adapt their prediction criteria, leading to fewer requests for human intervention and gradually reducing human workload over time. Evaluations on large-scale benchmarks demonstrate Sirius-Fleet's effectiveness in improving multi-task policy performance and monitoring accuracy. We demonstrate Sirius-Fleet's performance in both RoboCasa in simulation and Mutex in the real world, two diverse, large-scale multi-task benchmarks. More information is available on the project website: https://ut-austin-rpl.github.io/sirius-fleet
- Rt-1: Robotics transformer for real-world control at scale, 2022.
- A generalist agent, 2022.
- Vint: A foundation model for visual navigation, 2023.
- Fleet-dagger: Interactive robot fleet learning with scalable human supervision. In Conference on Robot Learning, pages 368–380. PMLR, 2023.
- Robot fleet learning via policy merging, 2024.
- Openbot-fleet: A system for collective learning with real robots, 2024.
- Human-in-the-loop imitation learning using remote teleoperation. In arXiv preprint arXiv:2012.06733, 2020.
- Correct me if i am wrong: Interactive learning for robotic manipulation. In RAL, volume 7, pages 3695–3702, 2021.
- Learning from interventions: Human-robot interaction as both explicit and implicit feedback. In 16th Robotics: Science and Systems, RSS 2020. MIT Press Journals, 2020.
- Hg-dagger: Interactive imitation learning with human experts. 2019 International Conference on Robotics and Automation (ICRA), May 2019. doi:10.1109/icra.2019.8793698. URL http://dx.doi.org/10.1109/ICRA.2019.8793698.
- Lazydagger: Reducing context switching in interactive imitation learning. In 2021 IEEE 17th International Conference on Automation Science and Engineering (CASE), pages 502–509. IEEE, 2021a.
- Thriftydagger: Budget-aware novelty and risk gating for interactive imitation learning. arXiv preprint arXiv:2109.08273, 2021b.
- Robot learning on the job: Human-in-the-loop autonomy and learning during deployment. In Robotics: Science and Systems (RSS), 2023.
- Fleet-dagger: Interactive robot fleet learning with scalable human supervision, 2022.
- Efficient learning of safe driving policy via human-ai copilot optimization, 2022.
- E. Yel and N. Bezzo. Fast run-time monitoring, replanning, and recovery for safe autonomous system operations. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 1661–1667, 2019. doi:10.1109/IROS40897.2019.8968498.
- Model-based runtime monitoring with interactive imitation learning. In IEEE International Conference on Robotics and Automation (ICRA), 2024.
- A unified survey on anomaly, novelty, open-set, and out-of-distribution detection: Solutions and future challenges. CoRR, abs/2110.14051, 2021. URL https://arxiv.org/abs/2110.14051.
- C. Richter and N. Roy. Safe visual navigation via deep learning and novelty detection. Robotics: Science and Systems, 07 2017.
- Error-aware imitation learning from teleoperation data for mobile manipulation. In Proceedings of the 5th Conference on Robot Learning, volume 164 of Proceedings of Machine Learning Research, pages 1367–1378, 2022.
- Pato: Policy assisted teleoperation for scalable robot data collection, 2022.
- Video generation models as world simulators. 2024.
- Genie: Generative interactive environments, 2024.
- Pandora: Towards general world model with natural language actions and video states. 2024.
- Gaia-1: A generative world model for autonomous driving, 2023.
- Vima: General robot manipulation with multimodal prompts. In Fortieth International Conference on Machine Learning, 2023.
- Mutex: Learning unified policies from multimodal task specifications. In 7th Annual Conference on Robot Learning, 2023. URL https://openreview.net/forum?id=PwqiqaaEzJ.
- Rt-2: Vision-language-action models transfer web knowledge to robotic control, 2023.
- Bc-z: Zero-shot task generalization with robotic imitation learning, 2022.
- Robocat: A self-improving generalist agent for robotic manipulation, 2023.
- Octo: An open-source generalist robot policy, 2024.
- What matters in learning from offline human demonstrations for robot manipulation, 2021.
- Denoising diffusion probabilistic models, 2020.
- Diffusion policy: Visuomotor policy learning via action diffusion, 2024.
- Bridge data: Boosting generalization of robotic skills with cross-domain datasets, 2021.
- Open X-Embodiment Collaboration et al. Open X-Embodiment: Robotic learning datasets and RT-X models. In Proceedings of the IEEE International Conference on Robotics and Automation (ICRA), Yokohama, Japan, 2024.
- Droid: A large-scale in-the-wild robot manipulation dataset, 2024.
- Mimicgen: A data generation system for scalable robot learning using human demonstrations. In 7th Annual Conference on Robot Learning, 2023.
- Robocasa: Large-scale simulation of everyday tasks for generalist robots. In Robotics: Science and Systems (RSS), 2024.
- QT-Opt: Scalable deep reinforcement learning for vision-based robotic manipulation. In CoRL, 2018.
- Aloha 2: An enhanced low-cost hardware for bimanual teleoperation, 2024.
- Autort: Embodied foundation models for large scale orchestration of robotic agents, 2024.
- Scaled autonomy: Enabling human operators to control robot fleets, 2020.
- Decentralized data collection for robotic fleet learning: A game-theoretic approach. In K. Liu, D. Kulic, and J. Ichnowski, editors, Proceedings of The 6th Conference on Robot Learning, volume 205 of Proceedings of Machine Learning Research, pages 978–988. PMLR, 14–18 Dec 2023a. URL https://proceedings.mlr.press/v205/akcin23a.html.
- Fleet active learning: A submodular maximization approach. In J. Tan, M. Toussaint, and K. Darvish, editors, Proceedings of The 7th Conference on Robot Learning, volume 229 of Proceedings of Machine Learning Research, pages 1378–1399. PMLR, 06–09 Nov 2023b. URL https://proceedings.mlr.press/v229/akcin23a.html.
- The safety filter: A unified view of safety-critical control in autonomous systems. 2023. URL https://api.semanticscholar.org/CorpusID:261697421.
- Closing the loop on runtime monitors with fallback-safe mpc. In Proc. IEEE Conf. on Decision and Control, 2023.
- Not all errors are made equal: A regret metric for detecting system-level trajectory prediction failures. In 8th Conference on Robot Learning, 2024.
- A system-level view on out-of-distribution data in robotics, 2022.
- EnsembleDagger: A bayesian approach to safe imitation learning. In 2019 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 5041–5048. IEEE, 2019.
- Neural networks for prediction of robot failures. Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science, 228(8):1444–1458, 2014.
- Asking for help: Failure prediction in behavioral cloning through value approximation. ArXiv, abs/2302.04334, 2023.
- When to ask for help: Proactive interventions in autonomous reinforcement learning, 2022.
- Robots that ask for help: Uncertainty alignment for large language model planners, 2023.
- Reflect: Summarizing robot experiences for failure explanation and correction, 2023.
- ”task success” is not enough: Investigating the use of video-language models as behavior critics for catching undesirable agent behaviors, 2024.
- Model-based imitation learning for urban driving, 2022.
- U-net: Convolutional networks for biomedical image segmentation, 2015.
- Learning structured output representation using deep conditional generative models. In C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 28. Curran Associates, Inc., 2015. URL https://proceedings.neurips.cc/paper_files/paper/2015/file/8d55a249e6baa5c06772297520da2051-Paper.pdf.
- Daydreamer: World models for physical robot learning. In Conference on Robot Learning, pages 2226–2240. PMLR, 2023.
- Attention is all you need, 2023.
- Roboturk: A crowdsourcing platform for robotic skill learning through imitation. In CoRL, pages 879–893, 2018.
Sponsor
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.