Augmenting LLMs with Knowledge: A survey on hallucination prevention (2309.16459v1)

Published 28 Sep 2023 in cs.CL, cs.AI, and cs.LG

Abstract: Large pre-trained LLMs have demonstrated their proficiency in storing factual knowledge within their parameters and achieving remarkable results when fine-tuned for downstream natural language processing tasks. Nonetheless, their capacity to access and manipulate knowledge with precision remains constrained, resulting in performance disparities on knowledge-intensive tasks when compared to task-specific architectures. Additionally, the challenges of providing provenance for model decisions and maintaining up-to-date world knowledge persist as open research frontiers. To address these limitations, the integration of pre-trained models with differentiable access mechanisms to explicit non-parametric memory emerges as a promising solution. This survey delves into the realm of LLMs (LMs) augmented with the ability to tap into external knowledge sources, including external knowledge bases and search engines. While adhering to the standard objective of predicting missing tokens, these augmented LMs leverage diverse, possibly non-parametric external modules to augment their contextual processing capabilities, departing from the conventional LLMing paradigm. Through an exploration of current advancements in augmenting LLMs with knowledge, this work concludes that this emerging research direction holds the potential to address prevalent issues in traditional LMs, such as hallucinations, un-grounded responses, and scalability challenges.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (43)

Authors (2)

Konstantinos Andriopoulos (1 paper)
Johan Pouwelse (39 papers)

Citations (15)

View on Semantic Scholar

HackerNews

Augmenting LLMs with Knowledge: A survey on hallucination prevention (2 points, 0 comments)

Augmenting LLMs with Knowledge: A survey on hallucination prevention (2309.16459v1)

Related Papers

HackerNews