KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking (2404.02935v1)

Published 3 Apr 2024 in cs.CL, cs.AI, and cs.LG

Abstract: This paper introduces KnowHalu, a novel approach for detecting hallucinations in text generated by LLMs, utilizing step-wise reasoning, multi-formulation query, multi-form knowledge for factual checking, and fusion-based detection mechanism. As LLMs are increasingly applied across various domains, ensuring that their outputs are not hallucinated is critical. Recognizing the limitations of existing approaches that either rely on the self-consistency check of LLMs or perform post-hoc fact-checking without considering the complexity of queries or the form of knowledge, KnowHalu proposes a two-phase process for hallucination detection. In the first phase, it identifies non-fabrication hallucinations--responses that, while factually correct, are irrelevant or non-specific to the query. The second phase, multi-form based factual checking, contains five key steps: reasoning and query decomposition, knowledge retrieval, knowledge optimization, judgment generation, and judgment aggregation. Our extensive evaluations demonstrate that KnowHalu significantly outperforms SOTA baselines in detecting hallucinations across diverse tasks, e.g., improving by 15.65% in QA tasks and 5.50% in summarization tasks, highlighting its effectiveness and versatility in detecting hallucinations in LLM-generated content.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (43)

Authors (6)

Jiawei Zhang (529 papers)
Chejian Xu (18 papers)
Yu Gai (9 papers)
Freddy Lecue (36 papers)
Dawn Song (229 papers)
Bo Li (1107 papers)

Citations (5)

View on Semantic Scholar

Tweets

https://twitter.com/javaeeeee1/status/1789678958893486095

https://twitter.com/devcorporate/status/1791032163556548885

https://twitter.com/AiAutodidact/status/1790913337108160839

https://twitter.com/raghavan_anand/status/1790804396017693126

KnowHalu: Hallucination Detection via Multi-Form Knowledge Based Factual Checking (2404.02935v1)

Related Papers

Tweets