Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DeepChannel: Salience Estimation by Contrastive Learning for Extractive Document Summarization (1811.02394v2)

Published 6 Nov 2018 in cs.CL

Abstract: We propose DeepChannel, a robust, data-efficient, and interpretable neural model for extractive document summarization. Given any document-summary pair, we estimate a salience score, which is modeled using an attention-based deep neural network, to represent the salience degree of the summary for yielding the document. We devise a contrastive training strategy to learn the salience estimation network, and then use the learned salience score as a guide and iteratively extract the most salient sentences from the document as our generated summary. In experiments, our model not only achieves state-of-the-art ROUGE scores on CNN/Daily Mail dataset, but also shows strong robustness in the out-of-domain test on DUC2007 test set. Moreover, our model reaches a ROUGE-1 F-1 score of 39.41 on CNN/Daily Mail test set with merely $1 / 100$ training set, demonstrating a tremendous data efficiency.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Jiaxin Shi (53 papers)
  2. Chen Liang (140 papers)
  3. Lei Hou (127 papers)
  4. Juanzi Li (144 papers)
  5. Zhiyuan Liu (433 papers)
  6. Hanwang Zhang (161 papers)
Citations (28)