2000 character limit reached
Learning to Solve Voxel Building Embodied Tasks from Pixels and Natural Language Instructions (2211.00688v1)
Published 1 Nov 2022 in cs.AI and cs.CL
Abstract: The adoption of pre-trained LLMs to generate action plans for embodied agents is a promising research strategy. However, execution of instructions in real or simulated environments requires verification of the feasibility of actions as well as their relevance to the completion of a goal. We propose a new method that combines a LLM and reinforcement learning for the task of building objects in a Minecraft-like environment according to the natural language instructions. Our method first generates a set of consistently achievable sub-goals from the instructions and then completes associated sub-tasks with a pre-trained RL policy. The proposed method formed the RL baseline at the IGLU 2022 competition.
- Alexey Skrynnik (21 papers)
- Zoya Volovikova (6 papers)
- Marc-Alexandre Côté (42 papers)
- Anton Voronov (7 papers)
- Artem Zholus (17 papers)
- Negar Arabzadeh (28 papers)
- Shrestha Mohanty (12 papers)
- Milagro Teruel (6 papers)
- Ahmed Awadallah (27 papers)
- Aleksandr Panov (24 papers)
- Mikhail Burtsev (27 papers)
- Julia Kiseleva (33 papers)