Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions (2211.00688v1)

Published 1 Nov 2022 in cs.AI and cs.CL

Abstract: The adoption of pre-trained LLMs to generate action plans for embodied agents is a promising research strategy. However, execution of instructions in real or simulated environments requires verification of the feasibility of actions as well as their relevance to the completion of a goal. We propose a new method that combines a LLM and reinforcement learning for the task of building objects in a Minecraft-like environment according to the natural language instructions. Our method first generates a set of consistently achievable sub-goals from the instructions and then completes associated sub-tasks with a pre-trained RL policy. The proposed method formed the RL baseline at the IGLU 2022 competition.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (12)

Alexey Skrynnik (21 papers)
Zoya Volovikova (6 papers)
Marc-Alexandre Côté (42 papers)
Anton Voronov (7 papers)
Artem Zholus (17 papers)
Negar Arabzadeh (28 papers)
Shrestha Mohanty (12 papers)
Milagro Teruel (6 papers)
Ahmed Awadallah (27 papers)
Aleksandr Panov (24 papers)
Mikhail Burtsev (27 papers)
Julia Kiseleva (33 papers)

Citations (9)

View on Semantic Scholar

Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions (2211.00688v1)

Related Papers