Focus-Constrained Attention Mechanism for CVAE-based Response Generation (2009.12102v1)

Published 25 Sep 2020 in cs.CL

Abstract: To model diverse responses for a given post, one promising way is to introduce a latent variable into Seq2Seq models. The latent variable is supposed to capture the discourse-level information and encourage the informativeness of target responses. However, such discourse-level information is often too coarse for the decoder to be utilized. To tackle it, our idea is to transform the coarse-grained discourse-level information into fine-grained word-level information. Specifically, we firstly measure the semantic concentration of corresponding target response on the post words by introducing a fine-grained focus signal. Then, we propose a focus-constrained attention mechanism to take full advantage of focus in well aligning the input to the target response. The experimental results demonstrate that by exploiting the fine-grained signal, our model can generate more diverse and informative responses compared with several state-of-the-art models.

Authors (6)

Zhi Cui (5 papers)
Yanran Li (32 papers)
Jiayi Zhang (160 papers)
Jianwei Cui (18 papers)
Chen Wei (72 papers)
Bin Wang (751 papers)

Citations (7)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Focus-Constrained Attention Mechanism for CVAE-based Response Generation (2009.12102v1)

Summary

Related Papers