Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Behavioral Cloning via Search in Video PreTraining Latent Space (2212.13326v2)

Published 27 Dec 2022 in cs.LG, cs.AI, and cs.CV

Abstract: Our aim is to build autonomous agents that can solve tasks in environments like Minecraft. To do so, we used an imitation learning-based approach. We formulate our control problem as a search problem over a dataset of experts' demonstrations, where the agent copies actions from a similar demonstration trajectory of image-action pairs. We perform a proximity search over the BASALT MineRL-dataset in the latent representation of a Video PreTraining model. The agent copies the actions from the expert trajectory as long as the distance between the state representations of the agent and the selected expert trajectory from the dataset do not diverge. Then the proximity search is repeated. Our approach can effectively recover meaningful demonstration trajectories and show human-like behavior of an agent in the Minecraft environment.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Federico Malato (7 papers)
  2. Florian Leopold (4 papers)
  3. Amogh Raut (4 papers)
  4. Andrew Melnik (33 papers)
  5. Ville Hautamäki (30 papers)
Citations (10)
Youtube Logo Streamline Icon: https://streamlinehq.com