Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Tackling Hallucinations in Neural Chart Summarization (2308.00399v1)

Published 1 Aug 2023 in cs.CL and cs.LG

Abstract: Hallucinations in text generation occur when the system produces text that is not grounded in the input. In this work, we tackle the problem of hallucinations in neural chart summarization. Our analysis shows that the target side of chart summarization training datasets often contains additional information, leading to hallucinations. We propose a natural language inference (NLI) based method to preprocess the training data and show through human evaluation that our method significantly reduces hallucinations. We also found that shortening long-distance dependencies in the input sequence and adding chart-related information like title and legends improves the overall performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Saad Obaid ul Islam (2 papers)
  2. Iza Škrjanec (3 papers)
  3. Ondřej Dušek (78 papers)
  4. Vera Demberg (48 papers)
Citations (5)