2000 character limit reached
Global Encoding for Abstractive Summarization (1805.03989v2)
Published 10 May 2018 in cs.CL, cs.AI, and cs.LG
Abstract: In neural abstractive summarization, the conventional sequence-to-sequence (seq2seq) model often suffers from repetition and semantic irrelevance. To tackle the problem, we propose a global encoding framework, which controls the information flow from the encoder to the decoder based on the global information of the source context. It consists of a convolutional gated unit to perform global encoding to improve the representations of the source-side information. Evaluations on the LCSTS and the English Gigaword both demonstrate that our model outperforms the baseline models, and the analysis shows that our model is capable of reducing repetition.
- Junyang Lin (99 papers)
- Xu Sun (194 papers)
- Shuming Ma (83 papers)
- Qi Su (58 papers)