Meta Dialogue Policy Learning (2006.02588v1)

Published 3 Jun 2020 in cs.CL and cs.LG

Abstract: Dialog policy determines the next-step actions for agents and hence is central to a dialogue system. However, when migrated to novel domains with little data, a policy model can fail to adapt due to insufficient interactions with the new environment. We propose Deep Transferable Q-Network (DTQN) to utilize shareable low-level signals between domains, such as dialogue acts and slots. We decompose the state and action representation space into feature subspaces corresponding to these low-level components to facilitate cross-domain knowledge transfer. Furthermore, we embed DTQN in a meta-learning framework and introduce Meta-DTQN with a dual-replay mechanism to enable effective off-policy training and adaptation. In experiments, our model outperforms baseline models in terms of both success rate and dialogue efficiency on the multi-domain dialogue dataset MultiWOZ 2.0.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Yumo Xu (14 papers)
Chenguang Zhu (100 papers)
Baolin Peng (72 papers)
Michael Zeng (76 papers)

Citations (7)

View on Semantic Scholar

Meta Dialogue Policy Learning (2006.02588v1)

Related Papers