A Survey of Large Language Models Attribution (2311.03731v2)

Published 7 Nov 2023 in cs.CL

Abstract: Open-domain generative systems have gained significant attention in the field of conversational AI (e.g., generative search engines). This paper presents a comprehensive review of the attribution mechanisms employed by these systems, particularly LLMs. Though attribution or citation improve the factuality and verifiability, issues like ambiguous knowledge reservoirs, inherent biases, and the drawbacks of excessive attribution can hinder the effectiveness of these systems. The aim of this survey is to provide valuable insights for researchers, aiding in the refinement of attribution methodologies to enhance the reliability and veracity of responses generated by open-domain generative systems. We believe that this field is still in its early stages; hence, we maintain a repository to keep track of ongoing studies at https://github.com/HITsz-TMG/awesome-LLM-attributions.

References (84)

Citations (39)

View on Semantic Scholar

Summary

The paper presents a systematic taxonomy of attribution methods in LLMs to reduce hallucinations.
It analyzes various datasets and techniques, including direct generation and post-retrieval approaches, to provide verifiable citations.
The evaluation employs human and automatic metrics, establishing accuracy metrics like precision and F1-score for source credibility.

A Survey of LLMs Attribution: A Comprehensive Overview

The paper "A Survey of LLMs Attribution" proposes an extensive exploration into the mechanisms and methodologies that underpin attribution within LLMs. Attribution, within the context of LLMs, pertains to the provision of source citations for generated content, thus enhancing factual accuracy and credibility. This paper delineates the current landscape of attribution methods, highlighting the challenges and advances within this domain.

Introduction and Motivation

The paper outlines the growing prominence of open-domain generative systems, powered by LLMs. Despite the improved capabilities of these models, a looming challenge remains: the "hallucination" problem, where outputs include distorted facts without credible sourcing. Attribution is suggested as a realistic approach to mitigating hallucinations. By employing citations, LLMs can provide verifiable sources, thus allowing both developers and users to assess the factuality of the generated content.

Core Classification and Methodological Approaches

The paper employs a systematic taxonomy to categorize sources, datasets, and methods for attribution in LLMs.

Sources of Attribution:
- Pre-training Data: Attribution is inherently linked to the vast corpora from which LLMs learn. Identifying specific data subsets that influence model decisions can enhance the interpretability of these systems.
- Out-of-model Knowledge: The inclusion of external databases such as the web or knowledge graphs enhances model reliability by anchoring generated content to verifiable sources.
Datasets for Attribution:
- Datasets play a key role in evaluating attribution. This paper references various datasets that have been adapted for examining the accuracy of citation mechanisms within LLMs. These datasets vary in domain focus and citation granularity, ranging from general open-domain datasets to domain-specific ones.
Approach to Attribution:
- Direct Generated Attribution: Utilizing parametric knowledge from the model itself to provide attributions, although this method faces challenges in accuracy and relevance.
- Post-retrieval Answering: This involves leveraging retrieved information to form the basis of model-generated answers. While effective, it necessitates clear boundaries between retrieved and inherent model knowledge.
- Post-Generation Attribution: Conducting retrieval post-answer generation for sources, thereby ensuring that all claims are verifiable and grounded in reality.
Attribution Systems:
- Systems such as GopherCite and WebGPT implement real-time retrieval to improve accuracy and remain up-to-date with reliable references. These systems showcase the integration of external web sources and underscore the benefits of reinforcement learning for refining citation processes.

Evaluation and Challenges

The evaluation of attribution is primarily approached through both human and automatic metrics. Studies focus on recall, precision, and F1-score to determine the accuracy of cited references concerning generated text. Challenges in attribution revolve around granularity, mistaken synthesis, hallucination, excessive citation, bias, and evolving content. A fine balance is required to ensure accurate attribution while maintaining model dexterity.

Implications and Future Directions

The implications of this research are multifaceted:

Practical Impact: Improved citation mechanisms bolster the trust in LLM outputs, fostering reliability in applications across knowledge-intensive fields.
Theoretical Impact: Theoretical advancements in attribution can guide the refinement of LLM architectures and training processes.

Looking forward, the authors advocate for continuous updates to LLMs, akin to the real-time data refreshes seen in search engines. This progression ensures outputs remain timely and relevant. Additionally, a call is made for establishing robust frameworks to consistently evaluate source credibility.

Conclusion

This detailed examination of attribution within LLMs provides critical insights into improving model reliability. By systematically addressing attribution challenges and implementing structured approaches, the surveyed frameworks aim not only to enhance interpretability but also to advance the trustworthiness and practical utility of LLMs in real-world applications. As this field continues to evolve, robust attribution mechanisms will likely serve as a cornerstone for the integrity and advancement of AI systems.

PDF Markdown

GitHub

GitHub - HITsz-TMG/awesome-llm-attributions: A Survey of Attributions for Large Language Models (203 stars)

YouTube

Show All Videos