2000 character limit reached
Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play (2109.09597v1)
Published 20 Sep 2021 in cs.CL, cs.AI, and cs.GT
Abstract: Task-oriented dialog systems are often trained on human/human dialogs, such as collected from Wizard-of-Oz interfaces. However, human/human corpora are frequently too small for supervised training to be effective. This paper investigates two approaches to training agent-bots and user-bots through self-play, in which they autonomously explore an API environment, discovering communication strategies that enable them to solve the task. We give empirical results for both reinforcement learning and game-theoretic equilibrium finding.
- Arkady Arkhangorodsky (6 papers)
- Scot Fang (4 papers)
- Victoria Knight (1 paper)
- Ajay Nagesh (7 papers)
- Maria Ryskina (11 papers)
- Kevin Knight (29 papers)