Interacting with Non-Cooperative User: A New Paradigm for Proactive Dialogue Policy (2204.07433v1)
Abstract: Proactive dialogue system is able to lead the conversation to a goal topic and has advantaged potential in bargain, persuasion and negotiation. Current corpus-based learning manner limits its practical application in real-world scenarios. To this end, we contribute to advance the study of the proactive dialogue policy to a more natural and challenging setting, i.e., interacting dynamically with users. Further, we call attention to the non-cooperative user behavior -- the user talks about off-path topics when he/she is not satisfied with the previous topics introduced by the agent. We argue that the targets of reaching the goal topic quickly and maintaining a high user satisfaction are not always converge, because the topics close to the goal and the topics user preferred may not be the same. Towards this issue, we propose a new solution named I-Pro that can learn Proactive policy in the Interactive setting. Specifically, we learn the trade-off via a learned goal weight, which consists of four factors (dialogue turn, goal completion difficulty, user satisfaction estimation, and cooperative degree). The experimental results demonstrate I-Pro significantly outperforms baselines in terms of effectiveness and interpretability.
- Wenqiang Lei (66 papers)
- Yao Zhang (537 papers)
- Feifan Song (14 papers)
- Hongru Liang (18 papers)
- Jiaxin Mao (47 papers)
- Jiancheng Lv (99 papers)
- Zhenglu Yang (20 papers)
- Tat-Seng Chua (359 papers)