Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Two Approaches to Building Collaborative, Task-Oriented Dialog Agents through Self-Play (2109.09597v1)

Published 20 Sep 2021 in cs.CL, cs.AI, and cs.GT

Abstract: Task-oriented dialog systems are often trained on human/human dialogs, such as collected from Wizard-of-Oz interfaces. However, human/human corpora are frequently too small for supervised training to be effective. This paper investigates two approaches to training agent-bots and user-bots through self-play, in which they autonomously explore an API environment, discovering communication strategies that enable them to solve the task. We give empirical results for both reinforcement learning and game-theoretic equilibrium finding.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Arkady Arkhangorodsky (6 papers)
  2. Scot Fang (4 papers)
  3. Victoria Knight (1 paper)
  4. Ajay Nagesh (7 papers)
  5. Maria Ryskina (11 papers)
  6. Kevin Knight (29 papers)