Alleviating Hallucinations of Large Language Models through Induced Hallucinations (2312.15710v2)

Published 25 Dec 2023 in cs.CL and cs.AI

Abstract: Despite their impressive capabilities, LLMs have been observed to generate responses that include inaccurate or fabricated information, a phenomenon commonly known as ``hallucination''. In this work, we propose a simple \textit{Induce-then-Contrast} Decoding (ICD) strategy to alleviate hallucinations. We first construct a factually weak LLM by inducing hallucinations from the original LLMs. Then, we penalize these induced hallucinations during decoding to enhance the factuality of the generated content. Concretely, we determine the final next-token predictions by amplifying the predictions from the original model and downplaying the induced untruthful predictions via contrastive decoding. Experimental results on both discrimination-based and generation-based hallucination evaluation benchmarks, such as TruthfulQA and \textsc{FActScore}, demonstrate that our proposed ICD methods can effectively enhance the factuality of LLMs across various model sizes and families. For example, when equipped with ICD, Llama2-7B-Chat and Mistral-7B-Instruct achieve performance comparable to ChatGPT and GPT4 on TruthfulQA, respectively.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (68)

Authors (4)

Yue Zhang (618 papers)
Leyang Cui (50 papers)
Wei Bi (62 papers)
Shuming Shi (126 papers)

Citations (41)

View on Semantic Scholar

YouTube

Show All Videos

Alleviating Hallucinations of Large Language Models through Induced Hallucinations (2312.15710v2)

Related Papers

YouTube