Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

One-Shot Hierarchical Imitation Learning of Compound Visuomotor Tasks (1810.11043v1)

Published 25 Oct 2018 in cs.LG, cs.AI, cs.CV, cs.RO, and stat.ML

Abstract: We consider the problem of learning multi-stage vision-based tasks on a real robot from a single video of a human performing the task, while leveraging demonstration data of subtasks with other objects. This problem presents a number of major challenges. Video demonstrations without teleoperation are easy for humans to provide, but do not provide any direct supervision. Learning policies from raw pixels enables full generality but calls for large function approximators with many parameters to be learned. Finally, compound tasks can require impractical amounts of demonstration data, when treated as a monolithic skill. To address these challenges, we propose a method that learns both how to learn primitive behaviors from video demonstrations and how to dynamically compose these behaviors to perform multi-stage tasks by "watching" a human demonstrator. Our results on a simulated Sawyer robot and real PR2 robot illustrate our method for learning a variety of order fulfiLLMent and kitchen serving tasks with novel objects and raw pixel inputs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Tianhe Yu (36 papers)
  2. Pieter Abbeel (372 papers)
  3. Sergey Levine (531 papers)
  4. Chelsea Finn (264 papers)
Citations (67)