Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders (2012.07300v2)

Published 14 Dec 2020 in cs.CL

Abstract: Automatic chat summarization can help people quickly grasp important information from numerous chat messages. Unlike conventional documents, chat logs usually have fragmented and evolving topics. In addition, these logs contain a quantity of elliptical and interrogative sentences, which make the chat summarization highly context dependent. In this work, we propose a novel unsupervised framework called RankAE to perform chat summarization without employing manually labeled data. RankAE consists of a topic-oriented ranking strategy that selects topic utterances according to centrality and diversity simultaneously, as well as a denoising auto-encoder that is carefully designed to generate succinct but context-informative summaries based on the selected utterances. To evaluate the proposed method, we collect a large-scale dataset of chat logs from a customer service environment and build an annotated set only for model evaluation. Experimental results show that RankAE significantly outperforms other unsupervised methods and is able to generate high-quality summaries in terms of relevance and topic coverage.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Yicheng Zou (20 papers)
  2. Jun Lin (87 papers)
  3. Lujun Zhao (4 papers)
  4. Yangyang Kang (32 papers)
  5. Zhuoren Jiang (24 papers)
  6. Changlong Sun (37 papers)
  7. Qi Zhang (785 papers)
  8. Xuanjing Huang (287 papers)
  9. Xiaozhong Liu (71 papers)
Citations (24)

Summary

We haven't generated a summary for this paper yet.