Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

TED: A Pretrained Unsupervised Summarization Model with Theme Modeling and Denoising (2001.00725v3)

Published 3 Jan 2020 in cs.CL

Abstract: Text summarization aims to extract essential information from a piece of text and transform the text into a concise version. Existing unsupervised abstractive summarization models leverage recurrent neural networks framework while the recently proposed transformer exhibits much more capability. Moreover, most of previous summarization models ignore abundant unlabeled corpora resources available for pretraining. In order to address these issues, we propose TED, a transformer-based unsupervised abstractive summarization system with pretraining on large-scale data. We first leverage the lead bias in news articles to pretrain the model on millions of unlabeled corpora. Next, we finetune TED on target domains through theme modeling and a denoising autoencoder to enhance the quality of generated summaries. Notably, TED outperforms all unsupervised abstractive baselines on NYT, CNN/DM and English Gigaword datasets with various document styles. Further analysis shows that the summaries generated by TED are highly abstractive, and each component in the objective function of TED is highly effective.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ziyi Yang (77 papers)
  2. Chenguang Zhu (100 papers)
  3. Robert Gmyr (20 papers)
  4. Michael Zeng (76 papers)
  5. Xuedong Huang (22 papers)
  6. Eric Darve (72 papers)
Citations (59)