Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Interpreting and learning voice commands with a Large Language Model for a robot system (2407.21512v1)

Published 31 Jul 2024 in cs.RO, cs.CL, and cs.NE

Abstract: Robots are increasingly common in industry and daily life, such as in nursing homes where they can assist staff. A key challenge is developing intuitive interfaces for easy communication. The use of LLMs like GPT-4 has enhanced robot capabilities, allowing for real-time interaction and decision-making. This integration improves robots' adaptability and functionality. This project focuses on merging LLMs with databases to improve decision-making and enable knowledge acquisition for request interpretation problems.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (9)
  1. Voice in human–agent interaction: A survey. ACM Comput. Surv., 54(4), 2021. ISSN 0360-0300. 10.1145/3386867. URL https://doi.org/10.1145/3386867.
  2. Robots that can chat. Boston Dynamics Blog, 2023. URL https://bostondynamics.com/blog/robots-that-can-chat/. Accessed: 2024-02-04.
  3. Tiago: the modular robot that adapts to different research needs. 2016. URL https://api.semanticscholar.org/CorpusID:218478582.
  4. An intent-based approach for creating assistive robots’ control systems. CoRR, abs/2005.12106, 2020. URL https://arxiv.org/abs/2005.12106.
  5. Ros: an open-source robot operating system. In ICRA workshop on open source software, volume 3, page 5. Kobe, Japan, 2009.
  6. Winiarski, T. Meros: Sysml-based metamodel for ros-based systems. IEEE Access, 11:82802–82815, 2023. 10.1109/ACCESS.2023.3301727.
  7. Scheduling of a robot’s tasks with the tasker framework. IEEE Access, 8:161449–161471, 2020. 10.1109/ACCESS.2020.3020265.
  8. Dudek, W. Prudent management of interruptible tasks executed by a service robot. Ph.D. thesis, Warsaw University of Technology, 2021. URL https://robotyka.ia.pw.edu.pl/papers/phd_thesis_wd.pdf.
  9. The smach high-level executive [ros news]. IEEE Robotics & Automation Magazine, 17(4):18–20, 2010. 10.1109/MRA.2010.938836.
Citations (2)

Summary

We haven't generated a summary for this paper yet.