Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Interactive Planning Using Large Language Models for Partially Observable Robotics Tasks (2312.06876v1)

Published 11 Dec 2023 in cs.RO and cs.AI

Abstract: Designing robotic agents to perform open vocabulary tasks has been the long-standing goal in robotics and AI. Recently, LLMs have achieved impressive results in creating robotic agents for performing open vocabulary tasks. However, planning for these tasks in the presence of uncertainties is challenging as it requires \enquote{chain-of-thought} reasoning, aggregating information from the environment, updating state estimates, and generating actions based on the updated state estimates. In this paper, we present an interactive planning technique for partially observable tasks using LLMs. In the proposed method, an LLM is used to collect missing information from the environment using a robot and infer the state of the underlying problem from collected observations while guiding the robot to perform the required actions. We also use a fine-tuned Llama 2 model via self-instruct and compare its performance against a pre-trained LLM like GPT-4. Results are demonstrated on several tasks in simulation as well as real-world environments. A video describing our work along with some results could be found here.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Lingfeng Sun (25 papers)
  2. Devesh K. Jha (46 papers)
  3. Chiori Hori (21 papers)
  4. Siddarth Jain (13 papers)
  5. Radu Corcodel (8 papers)
  6. Xinghao Zhu (26 papers)
  7. Masayoshi Tomizuka (261 papers)
  8. Diego Romeres (45 papers)
Citations (10)