Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Robots Can Multitask Too: Integrating a Memory Architecture and LLMs for Enhanced Cross-Task Robot Action Generation (2407.13505v2)

Published 18 Jul 2024 in cs.RO and cs.AI

Abstract: LLMs have been recently used in robot applications for grounding LLM common-sense reasoning with the robot's perception and physical abilities. In humanoid robots, memory also plays a critical role in fostering real-world embodiment and facilitating long-term interactive capabilities, especially in multi-task setups where the robot must remember previous task states, environment states, and executed actions. In this paper, we address incorporating memory processes with LLMs for generating cross-task robot actions, while the robot effectively switches between tasks. Our proposed dual-layered architecture features two LLMs, utilizing their complementary skills of reasoning and following instructions, combined with a memory model inspired by human cognition. Our results show a significant improvement in performance over a baseline of five robotic tasks, demonstrating the potential of integrating memory with LLMs for combining the robot's action and perception for adaptive task execution.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Hassan Ali (24 papers)
  2. Philipp Allgeuer (33 papers)
  3. Carlo Mazzola (8 papers)
  4. Giulia Belgiovine (8 papers)
  5. Burak Can Kaplan (1 paper)
  6. Stefan Wermter (157 papers)
  7. Lukáš Gajdošech (9 papers)
Youtube Logo Streamline Icon: https://streamlinehq.com