VoicePilot: Harnessing LLMs as Speech Interfaces for Physically Assistive Robots (2404.04066v2)
Abstract: Physically assistive robots present an opportunity to significantly increase the well-being and independence of individuals with motor impairments or other forms of disability who are unable to complete activities of daily living. Speech interfaces, especially ones that utilize LLMs, can enable individuals to effectively and naturally communicate high-level commands and nuanced preferences to robots. Frameworks for integrating LLMs as interfaces to robots for high level task planning and code generation have been proposed, but fail to incorporate human-centric considerations which are essential while developing assistive interfaces. In this work, we present a framework for incorporating LLMs as speech interfaces for physically assistive robots, constructed iteratively with 3 stages of testing involving a feeding robot, culminating in an evaluation with 11 older adults at an independent living facility. We use both quantitative and qualitative data from the final study to validate our framework and additionally provide design guidelines for using LLMs as speech interfaces for assistive robots. Videos and supporting files are located on our project website: https://sites.google.com/andrew.cmu.edu/voicepilot/
- [n. d.]. ATLAS.ti — The #1 Software for Qualitative Data Analysis. https://atlasti.com/
- [n. d.]. Obi Feeding Robot. https://meetobi.com/
- [n. d.]. OpenAI API - GPT-3.5 Turbo. https://platform.openai.com/docs/models/gpt-3-5-turbo
- [n. d.]. OpenAI API - Whisper. https://platform.openai.com/docs/models/whisper
- [n. d.]. Porcupine Wake Word Python API. https://picovoice.ai/docs/api/porcupine-python/
- How to prompt your robot: A promptbook for manipulation skills with code as policies. In Towards Generalist Robots: Learning Paradigms for Scalable Skill Acquisition@ CoRL2023.
- Prevalence and causes of paralysis—United States, 2013. American journal of public health 106, 10 (2016), 1855–1857.
- Balancing efficiency and comfort in robot-assisted bite transfer. In 2022 International Conference on Robotics and Automation (ICRA). IEEE, 4757–4763.
- A community-centered design framework for robot-assisted feeding systems. In Proceedings of the 21st International ACM SIGACCESS Conference on Computers and Accessibility. 482–494.
- Is more autonomy always better? exploring preferences of users with mobility impairments in robot-assisted feeding. In Proceedings of the 2020 ACM/IEEE international conference on human-robot interaction. 181–190.
- An exploration of accessible remote tele-operation for assistive mobile manipulators in the home. In 2021 30th IEEE International Conference on Robot & Human Interactive Communication (RO-MAN). IEEE, 1202–1209.
- Robots for humanity: using assistive robotics to empower people with disabilities. IEEE Robotics & Automation Magazine 20, 1 (2013), 30–39.
- Mobile manipulation through an assistive home robot. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems. IEEE, 5313–5320.
- Control barrier functions for mechanical systems: Theory and application to robotic grasping. IEEE Transactions on Control Systems Technology 29, 2 (2019), 530–545.
- Health care robotics: qualitative exploration of key challenges and future directions. Journal of medical Internet research 20, 7 (2018), e10410.
- MPJM Dijkers. 2005. Quality of life of individuals with spinal cord injury: a review of conceptualization, measurement, and research findings. Journal of rehabilitation research and development 42, 3 (2005), 87.
- Health implications of physical activity in individuals with spinal cord injury: a literature review. Journal of health and human services administration 30 4 (2008), 468–502.
- Foundation models in robotics: Applications, challenges, and the future. arXiv preprint arXiv:2312.07843 (2023).
- Behavioral adaptation and late-life disability: a new spectrum for assessing public health impacts. American journal of public health 104, 2 (2014), e88–e94.
- Quality of life in adults with multiple sclerosis: a systematic review. BMJ open 10, 11 (2020), e041249.
- An Adaptable, Safe, and Portable Robot-Assisted Feeding System. In Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. 74–76.
- Emerging research fields in safety and ergonomics in industrial collaborative robotics: A systematic literature review. Robotics and Computer-Integrated Manufacturing 67 (2021), 101998.
- Assistive mobile manipulation for self-care tasks around the head. In 2014 IEEE Symposium on computational intelligence in robotic rehabilitation and assistive technologies (CIR2AT). IEEE, 16–25.
- The VoiceBot: a voice controlled robot arm. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems. 183–192.
- Deploying and Evaluating LLMs to Program Service Mobile Robots. IEEE Robotics and Automation Letters (2024).
- Shared autonomy via hindsight optimization for teleoperation and teaming. The International Journal of Robotics Research 37, 7 (2018), 717–742.
- Feel the Bite: Robot-Assisted Inside-Mouth Bite Transfer using Robust Mouth Perception and Physical Interaction-Aware Control. In Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. 313–322.
- Real-World Robot Applications of Foundation Models: A Review. arXiv preprint arXiv:2402.05741 (2024).
- Can an LLM-Powered Socially Assistive Robot Effectively and Safely Deliver Cognitive Behavioral Therapy? A Study With University Students. arXiv preprint arXiv:2402.17937 (2024).
- Understanding Large-Language Model (LLM)-powered Human-Robot Interaction. arXiv preprint arXiv:2401.03217 (2024).
- Dusty: an assistive mobile manipulator that retrieves dropped objects for people with motor impairments. Disability and Rehabilitation: Assistive Technology 7, 2 (2012), 168–179.
- Comparative performance analysis of M-IMU/EMG and voice user interfaces for assistive robots. In 2017 International Conference on Rehabilitation Robotics (ICORR). IEEE, 1001–1006.
- Code as policies: Language model programs for embodied control. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 9493–9500.
- Language, Camera, Autonomy! Prompt-engineered Robot Control for Rapidly Evolving Deployment. In Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. 717–721.
- Design principles for robot-assisted feeding in social contexts. In Proceedings of the 2023 ACM/IEEE International Conference on Human-Robot Interaction. 24–33.
- Physically Assistive Robots: A Systematic Review of Mobile and Manipulator Robots That Physically Assist People with Disabilities. Annual Review of Control, Robotics, and Autonomous Systems 7 (2023).
- Independence in the Home: A Wearable Interface for a Person with Quadriplegia to Teleoperate a Mobile Manipulator. In Proceedings of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. 542–551.
- HAT: Head-Worn Assistive Teleoperation of Mobile Manipulators. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 12542–12548.
- Abhilash Pandya. 2023. ChatGPT-enabled daVinci Surgical Robot prototype: advancements and limitations. Robotics 12, 4 (2023), 97.
- A multimodal anomaly detector for robot-assisted feeding using an lstm-based variational autoencoder. IEEE Robotics and Automation Letters 3, 3 (2018), 1544–1551.
- Voice control interface prototype for assistive robots for people living with upper limb disabilities. In 2019 IEEE 16th International Conference on Rehabilitation Robotics (ICORR). IEEE, 46–52.
- A voice control system for assistive robotic arms: preliminary usability tests on patients. In 2018 7th IEEE International Conference on Biomedical Robotics and Biomechatronics (Biorob). IEEE, 167–172.
- Vinitha Ranganeni. 2024. Customizing Tele-Operation Interfaces of Assistive Robots at Home with Occupational Therapists. In Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction. 142–144.
- Evaluating Customization of Remote Tele-operation Interfaces for Assistive Robots. In 2023 32nd IEEE International Conference on Robot and Human Interactive Communication (RO-MAN). IEEE, 1633–1640.
- Constrained robot control using control barrier functions. In 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 279–285.
- Defining aging in place: The intersectionality of space, person, and time. Innovation in aging 4, 4 (2020), igaa036.
- A systematic review of depression and anxiety measures used with individuals with spinal cord injury. Spinal Cord 47, 12 (2009), 841–851. https://doi.org/10.1038/sc.2009.93
- Progprompt: Generating situated robot task plans using large language models. In 2023 IEEE International Conference on Robotics and Automation (ICRA). IEEE, 11523–11530.
- What do older adults want from social robots? A qualitative research approach to human-robot interaction (HRI) studies. International Journal of Social Robotics 15, 3 (2023), 411–424.
- Chatgpt for robotics: Design principles and model abilities. arXiv preprint arXiv:2306.17582 (2023).
- Grid: A platform for general robot intelligence development. arXiv preprint arXiv:2310.00887 (2023).
- Tidybot: Personalized robot assistance with large language models. Autonomous Robots 47, 8 (2023), 1087–1102.
- Large language models for human-robot interaction: A review. Biomimetic Intelligence and Robotics (2023), 100131.