HALO: An Ontology for Representing and Categorizing Hallucinations in Large Language Models (2312.05209v2)

Published 8 Dec 2023 in cs.AI and cs.CL

Abstract: Recent progress in generative AI, including LLMs like ChatGPT, has opened up significant opportunities in fields ranging from natural language processing to knowledge discovery and data mining. However, there is also a growing awareness that the models can be prone to problems such as making information up or `hallucinations', and faulty reasoning on seemingly simple problems. Because of the popularity of models like ChatGPT, both academic scholars and citizen scientists have documented hallucinations of several different types and severity. Despite this body of work, a formal model for describing and representing these hallucinations (with relevant meta-data) at a fine-grained level, is still lacking. In this paper, we address this gap by presenting the Hallucination Ontology or HALO, a formal, extensible ontology written in OWL that currently offers support for six different types of hallucinations known to arise in LLMs, along with support for provenance and experimental metadata. We also collect and publish a dataset containing hallucinations that we inductively gathered across multiple independent Web sources, and show that HALO can be successfully used to model this dataset and answer competency questions.

References (37)

Citations (2)

View on Semantic Scholar

Collections

Summary

The paper presents HALO, a structured ontology that standardizes the categorization of hallucinations in large language models.
It divides the framework into two modules—hallucination and metadata—to capture diverse error types and essential experimental details.
Validation with real-world datasets shows HALO’s effectiveness in modeling hallucination instances and enabling comparative analysis across AI systems.

Introduction

The proliferation of generative AI systems, particularly LLMs, has led to remarkable breakthroughs in numerous applications. However, alongside these advances, a critical issue has emerged: these models can exhibit faulty reasoning or generate fabricated information, a phenomenon often referred to as "hallucinations". The paper in discussion introduces the Hallucination Ontology (HALO), a formal framework designed to represent and categorize hallucination instances in generative models, providing researchers with a standardized tool for systematic analysis and documentation.

Hallucination Challenges in AI

Hallucinations in AI raise substantial concerns, such as the potential for misinformation and the risk of undue reliance on the output of these systems. Despite widespread documentation of hallucinations across various platforms and studies, there has been a lack of formalized vocabulary or ontology to describe these occurrences systematically. This absence hinders empirical research and analysis, as data on hallucinations are often scattered and inconsistently described.

The HALO ontology steps into this gap, offering a structured, open license model in OWL format, adhering to FAIR principles. The ontology supports six known types of hallucinations, with the flexibility to expand as new hallucination patterns emerge. The model emphasizes extensibility and the inclusion of meta-data, such as provenance and experimental detail, facilitating comparisons between different models and hallucination instances.

HALO's Design and Features

HALO consists of two primary modules, the Hallucination Module, and the Metadata Module, which separate the diverse categories of hallucinations from the more standard concepts suited for capturing experimental data. This division allows the ontology to adapt and expand as further research uncovers additional categories or subtypes of hallucinations.

The ontology connects hallucination instances to external classes and aligns with the latest findings in AI research, aiming for a broad scope and interoperability with published vocabularies. Additionally, HALO supports metadata representation, crucial for cross-analysis and understanding the context of each hallucination, including details such as the specific LLM that generated the error, the date of occurrence, and the source of detection.

Evaluation and Implications

Using a dataset compiled from various web sources, HALO was tested for its ability to model and answer complex competency questions related to hallucinations in LLMs. The results verified that HALO is well-equipped to model the dataset successfully and could answer these questions accurately, demonstrating its practical utility.

The development of HALO is a step towards comprehensively understanding and mitigating hallucinations in AI, with the potential to inform future improvements in generative models. By enabling standardized documentation and analysis, researchers can systematically paper hallucinations and contribute to the ongoing refinement of AI systems.

PDF Markdown

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now