Learning Dynamic Belief Graphs to Generalize on Text-Based Games (2002.09127v4)

Published 21 Feb 2020 in cs.CL and cs.LG

Abstract: Playing text-based games requires skills in processing natural language and sequential decision making. Achieving human-level performance on text-based games remains an open challenge, and prior research has largely relied on hand-crafted structured representations and heuristics. In this work, we investigate how an agent can plan and generalize in text-based games using graph-structured representations learned end-to-end from raw text. We propose a novel graph-aided transformer agent (GATA) that infers and updates latent belief graphs during planning to enable effective action selection by capturing the underlying game dynamics. GATA is trained using a combination of reinforcement and self-supervised learning. Our work demonstrates that the learned graph-based representations help agents converge to better policies than their text-only counterparts and facilitate effective generalization across game configurations. Experiments on 500+ unique games from the TextWorld suite show that our best agent outperforms text-based baselines by an average of 24.2%.

Authors (10)

Ashutosh Adhikari (5 papers)
Xingdi Yuan (46 papers)
Marc-Alexandre Côté (42 papers)
Mikuláš Zelinka (4 papers)
Marc-Antoine Rondeau (5 papers)
Romain Laroche (36 papers)
Pascal Poupart (80 papers)
Jian Tang (327 papers)
Adam Trischler (50 papers)
William L. Hamilton (46 papers)

Citations (77)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Learning Dynamic Belief Graphs to Generalize on Text-Based Games (2002.09127v4)

Summary

Related Papers