2000 character limit reached
LIT: Large Language Model Driven Intention Tracking for Proactive Human-Robot Collaboration -- A Robot Sous-Chef Application (2406.13787v1)
Published 19 Jun 2024 in cs.RO and cs.CV
Abstract: LLMs (LLM) and Vision LLMs (VLM) enable robots to ground natural language prompts into control actions to achieve tasks in an open world. However, when applied to a long-horizon collaborative task, this formulation results in excessive prompting for initiating or clarifying robot actions at every step of the task. We propose Language-driven Intention Tracking (LIT), leveraging LLMs and VLMs to model the human user's long-term behavior and to predict the next human intention to guide the robot for proactive collaboration. We demonstrate smooth coordination between a LIT-based collaborative robot and the human user in collaborative cooking tasks.
- Zhe Huang (57 papers)
- John Pohovey (3 papers)
- Ananya Yammanuru (4 papers)
- Katherine Driggs-Campbell (77 papers)