Predict and Use Latent Patterns for Short-Text Conversation (2010.13982v2)

Published 27 Oct 2020 in cs.CL and cs.AI

Abstract: Many neural network models nowadays have achieved promising performances in Chit-chat settings. The majority of them rely on an encoder for understanding the post and a decoder for generating the response. Without given assigned semantics, the models lack the fine-grained control over responses as the semantic mapping between posts and responses is hidden on the fly within the end-to-end manners. Some previous works utilize sampled latent words as a controllable semantic form to drive the generated response around the work, but few works attempt to use more complex semantic patterns to guide the generation. In this paper, we propose to use more detailed semantic forms, including latent responses and part-of-speech sequences sampled from the corresponding distributions, as the controllable semantics to guide the generation. Our results show that the richer semantics are not only able to provide informative and diverse responses, but also increase the overall performance of response quality, including fluency and coherence.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Hung-Ting Chen (7 papers)
Yu-Chieh Chao (5 papers)
Ta-Hsuan Chao (3 papers)
Wei-Yun Ma (10 papers)

Predict and Use Latent Patterns for Short-Text Conversation (2010.13982v2)

Related Papers