Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Graph Neural Network Policies and Imitation Learning for Multi-Domain Task-Oriented Dialogues (2210.05252v1)

Published 11 Oct 2022 in cs.CL

Abstract: Task-oriented dialogue systems are designed to achieve specific goals while conversing with humans. In practice, they may have to handle simultaneously several domains and tasks. The dialogue manager must therefore be able to take into account domain changes and plan over different domains/tasks in order to deal with multidomain dialogues. However, learning with reinforcement in such context becomes difficult because the state-action dimension is larger while the reward signal remains scarce. Our experimental results suggest that structured policies based on graph neural networks combined with different degrees of imitation learning can effectively handle multi-domain dialogues. The reported experiments underline the benefit of structured policies over standard policies.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Thibault Cordier (3 papers)
  2. Tanguy Urvoy (14 papers)
  3. Lina M. Rojas-Barahona (20 papers)
  4. Fabrice Lefèvre (8 papers)
Citations (2)