Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SWING: Balancing Coverage and Faithfulness for Dialogue Summarization (2301.10483v1)

Published 25 Jan 2023 in cs.CL

Abstract: Missing information is a common issue of dialogue summarization where some information in the reference summaries is not covered in the generated summaries. To address this issue, we propose to utilize natural language inference (NLI) models to improve coverage while avoiding introducing factual inconsistencies. Specifically, we use NLI to compute fine-grained training signals to encourage the model to generate content in the reference summaries that have not been covered, as well as to distinguish between factually consistent and inconsistent generated sentences. Experiments on the DialogSum and SAMSum datasets confirm the effectiveness of the proposed approach in balancing coverage and faithfulness, validated with automatic metrics and human evaluations. Additionally, we compute the correlation between commonly used automatic metrics with human judgments in terms of three different dimensions regarding coverage and factual consistency to provide insight into the most suitable metric for evaluating dialogue summaries.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Kung-Hsiang Huang (22 papers)
  2. Siffi Singh (7 papers)
  3. Xiaofei Ma (31 papers)
  4. Wei Xiao (100 papers)
  5. Feng Nan (22 papers)
  6. Nicholas Dingwall (3 papers)
  7. William Yang Wang (254 papers)
  8. Kathleen McKeown (85 papers)
Citations (11)