Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Collecting Interactive Multi-modal Datasets for Grounded Language Understanding (2211.06552v3)

Published 12 Nov 2022 in cs.CL and cs.AI

Abstract: Human intelligence can remarkably adapt quickly to new tasks and environments. Starting from a very young age, humans acquire new skills and learn how to solve new tasks either by imitating the behavior of others or by following provided natural language instructions. To facilitate research which can enable similar capabilities in machines, we made the following contributions (1) formalized the collaborative embodied agent using natural language task; (2) developed a tool for extensive and scalable data collection; and (3) collected the first dataset for interactive grounded language understanding.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Shrestha Mohanty (12 papers)
  2. Negar Arabzadeh (28 papers)
  3. Milagro Teruel (6 papers)
  4. Yuxuan Sun (79 papers)
  5. Artem Zholus (17 papers)
  6. Alexey Skrynnik (22 papers)
  7. Mikhail Burtsev (27 papers)
  8. Kavya Srinet (13 papers)
  9. Aleksandr Panov (26 papers)
  10. Arthur Szlam (86 papers)
  11. Marc-Alexandre Côté (42 papers)
  12. Julia Kiseleva (33 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.