Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity (2310.07521v3)

Published 11 Oct 2023 in cs.CL

Abstract: This survey addresses the crucial issue of factuality in LLMs. As LLMs find applications across diverse domains, the reliability and accuracy of their outputs become vital. We define the Factuality Issue as the probability of LLMs to produce content inconsistent with established facts. We first delve into the implications of these inaccuracies, highlighting the potential consequences and challenges posed by factual errors in LLM outputs. Subsequently, we analyze the mechanisms through which LLMs store and process facts, seeking the primary causes of factual errors. Our discussion then transitions to methodologies for evaluating LLM factuality, emphasizing key metrics, benchmarks, and studies. We further explore strategies for enhancing LLM factuality, including approaches tailored for specific domains. We focus two primary LLM configurations standalone LLMs and Retrieval-Augmented LLMs that utilizes external data, we detail their unique challenges and potential enhancements. Our survey offers a structured guide for researchers aiming to fortify the factual reliability of LLMs.

PDF HTML Abstract

Survey on Factuality in LLMs: Knowledge, Retrieval and Domain-Specificity

The paper "Survey on Factuality in LLMs: Knowledge, Retrieval and Domain-Specificity" is a comprehensive examination of the factual reliability of LLMs. As LLMs become integral to various applications, ensuring their output is factually accurate is crucial. This paper systematically explores the concerns regarding factuality in LLMs, presenting a detailed analysis of the mechanisms and strategies involved in enhancing their factual accuracy.

The research discusses the "factuality issue," defined as the likelihood of LLMs generating content inconsistent with established facts. It highlights the implications of these inaccuracies, shedding light on the potential challenges and consequences posed by factual errors in LLM-generated outputs. The authors provide a structured examination of methodologies for evaluating LLM factuality, placing emphasis on key metrics, benchmarks, and recent studies. Various strategies for improving factual accuracy, particularly through domain-specific approaches, are discussed.

The paper scrutinizes two main LLM configurations: standalone LLMs and retrieval-augmented LLMs. Standalone LLMs operate independently without external data inputs, whereas retrieval-augmented versions harness external data to refine their outputs. Each configuration comes with its set of challenges and opportunities for enhancement. The paper systematically reviews methods aimed at improving the factuality of LLMs in both settings, providing a valuable resource for researchers aiming to enhance the reliability of LLMs.

The authors also focus on domain-specific LLM applications. These involve tailoring LLMs to specific domains, such as medicine, finance, and law, where factual accuracy is particularly critical. The survey discusses various domain-specific enhancements that improve the factual reliability of LLMs, offering insight into how these tailored solutions can lead to more accurate and dependable model outputs in specialized fields.

Beyond individual methodologies, the survey emphasizes the importance of a holistic approach to addressing factual inaccuracies in LLMs. By synthesizing research from different domains and approaches, it provides a cohesive guide for fortifying the factual reliability of these models, ensuring they can serve as reliable tools in various academic and practical applications.

In conclusion, this paper serves as a resource for understanding and enhancing the factual accuracy of LLMs. It addresses a pivotal issue in the broader application of AI, providing researchers with the necessary insights and methodologies to develop more factually accurate models. The survey's comprehensive approach to evaluating, analyzing, and improving LLM factuality exemplifies the collaborative efforts required to ensure the ongoing utility and trustworthiness of these advanced computational tools.

PDF Markdown Bookmark Chat (Pro)

References (312)

Authors (16)

Cunxiang Wang (30 papers)
Xiaoze Liu (22 papers)
Yuanhao Yue (9 papers)
Xiangru Tang (62 papers)
Tianhang Zhang (16 papers)
Cheng Jiayang (11 papers)
Yunzhi Yao (27 papers)
Wenyang Gao (5 papers)
Xuming Hu (120 papers)
Zehan Qi (13 papers)
Yidong Wang (43 papers)
Linyi Yang (52 papers)
Jindong Wang (150 papers)
Xing Xie (220 papers)
Zheng Zhang (486 papers)
Yue Zhang (618 papers)

Citations (131)

View on Semantic Scholar

YouTube

Show All Videos

Survey on Factuality in Large Language Models: Knowledge, Retrieval and Domain-Specificity (2310.07521v3)

Survey on Factuality in LLMs: Knowledge, Retrieval and Domain-Specificity

Related Papers

YouTube