Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

LLM as A Robotic Brain: Unifying Egocentric Memory and Control (2304.09349v4)

Published 19 Apr 2023 in cs.AI, cs.CL, and cs.RO

Abstract: Embodied AI focuses on the study and development of intelligent systems that possess a physical or virtual embodiment (i.e. robots) and are able to dynamically interact with their environment. Memory and control are the two essential parts of an embodied system and usually require separate frameworks to model each of them. In this paper, we propose a novel and generalizable framework called LLM-Brain: using Large-scale LLM as a robotic brain to unify egocentric memory and control. The LLM-Brain framework integrates multiple multimodal LLMs for robotic tasks, utilizing a zero-shot learning approach. All components within LLM-Brain communicate using natural language in closed-loop multi-round dialogues that encompass perception, planning, control, and memory. The core of the system is an embodied LLM to maintain egocentric memory and control the robot. We demonstrate LLM-Brain by examining two downstream tasks: active exploration and embodied question answering. The active exploration tasks require the robot to extensively explore an unknown environment within a limited number of actions. Meanwhile, the embodied question answering tasks necessitate that the robot answers questions based on observations acquired during prior explorations.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jinjie Mai (12 papers)
  2. Jun Chen (374 papers)
  3. Bing Li (374 papers)
  4. Guocheng Qian (23 papers)
  5. Mohamed Elhoseiny (102 papers)
  6. Bernard Ghanem (255 papers)
Citations (29)
Youtube Logo Streamline Icon: https://streamlinehq.com