Large Language Models for Robotics: Opportunities, Challenges, and Perspectives (2401.04334v1)

Published 9 Jan 2024 in cs.RO and cs.AI

Abstract: LLMs have undergone significant expansion and have been increasingly integrated across various domains. Notably, in the realm of robot task planning, LLMs harness their advanced reasoning and language comprehension capabilities to formulate precise and efficient action plans based on natural language instructions. However, for embodied tasks, where robots interact with complex environments, text-only LLMs often face challenges due to a lack of compatibility with robotic visual perception. This study provides a comprehensive overview of the emerging integration of LLMs and multimodal LLMs into various robotic tasks. Additionally, we propose a framework that utilizes multimodal GPT-4V to enhance embodied task planning through the combination of natural language instructions and robot visual perceptions. Our results, based on diverse datasets, indicate that GPT-4V effectively enhances robot performance in embodied tasks. This extensive survey and evaluation of LLMs and multimodal LLMs across a variety of robotic tasks enriches the understanding of LLM-centric embodied intelligence and provides forward-looking insights toward bridging the gap in Human-Robot-Environment interaction.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (120)

Authors (20)

Jiaqi Wang (218 papers)
Zihao Wu (100 papers)
Yiwei Li (107 papers)
Hanqi Jiang (27 papers)
Peng Shu (34 papers)
Enze Shi (13 papers)
Huawen Hu (6 papers)
Chong Ma (28 papers)
Yiheng Liu (24 papers)
Xuhui Wang (22 papers)
Yincheng Yao (1 paper)
Xuan Liu (94 papers)
Huaqin Zhao (16 papers)
Zhengliang Liu (91 papers)
Haixing Dai (39 papers)
Lin Zhao (227 papers)
Bao Ge (17 papers)
Xiang Li (1002 papers)
Tianming Liu (161 papers)
Shu Zhang (286 papers)

Citations (42)

View on Semantic Scholar

Tweets

https://twitter.com/bloodbatmcgrath/status/1770718825807327410

https://twitter.com/brain_ai_lab/status/1744953825591763294

https://twitter.com/5h15h/status/1808621480005284247

https://twitter.com/seanluomdphd/status/1845248449236681098

Large Language Models for Robotics: Opportunities, Challenges, and Perspectives (2401.04334v1)

Related Papers

Tweets