Papers

Topics

Authors

Recent

View all

Gemini 2.5 Flash

119 tokens/sec

GPT-4o

56 tokens/sec

Gemini 2.5 Pro Pro

43 tokens/sec

o3 Pro

6 tokens/sec

GPT-4.1 Pro

47 tokens/sec

DeepSeek R1 via Azure Pro

28 tokens/sec

2000 character limit reached

Fakes of Varying Shades: How Warning Affects Human Perception and Engagement Regarding LLM Hallucinations (2404.03745v3)

Published 4 Apr 2024 in cs.HC, cs.AI, and cs.CL

Abstract: The widespread adoption and transformative effects of LLMs have sparked concerns regarding their capacity to produce inaccurate and fictitious content, referred to as `hallucinations'. Given the potential risks associated with hallucinations, humans should be able to identify them. This research aims to understand the human perception of LLM hallucinations by systematically varying the degree of hallucination (genuine, minor hallucination, major hallucination) and examining its interaction with warning (i.e., a warning of potential inaccuracies: absent vs. present). Participants (N=419) from Prolific rated the perceived accuracy and engaged with content (e.g., like, dislike, share) in a Q/A format. Participants ranked content as truthful in the order of genuine, minor hallucination, and major hallucination, and user engagement behaviors mirrored this pattern. More importantly, we observed that warning improved the detection of hallucination without significantly affecting the perceived truthfulness of genuine content. We conclude by offering insights for future tools to aid human detection of hallucinations. All survey materials, demographic questions, and post-session questions are available at: https://github.com/MahjabinNahar/fakes-of-varying-shades-survey-materials

References (55)

Authors (5)

Mahjabin Nahar (4 papers)
Haeseung Seo (2 papers)
Eun-Ju Lee (6 papers)
Aiping Xiong (8 papers)
Dongwon Lee (65 papers)

Citations (6)

View on Semantic Scholar

Summary

Exploring the Human Detection of LLM-Generated Hallucinations: The Role of Warning Cues

Introduction

The proliferation of LLMs such as GPT-3 has heightened concerns over their tendency to generate content that deviates from factual correctness, known as hallucinations. Hallucinations in LLM outputs, particularly in sensitive fields like legal and medical domains, present substantial risks due to the potential dissemination of inaccurate information. This paper explores understanding human ability to identify LLM hallucinations and examines the impact of warnings on the accuracy perception and engagement behaviors of users when presented with genuine and hallucinated content.

Human Detection of Hallucinations

The paper engaged 419 participants, using a structured approach to systematically vary the degree of hallucination in LLM-generated content. Participants were exposed to content classified into three categories: genuine, minor hallucination, and major hallucination, with and without warning cues about potential inaccuracies. The research focused on how these variations influenced participants' perceptions of content accuracy and their engagement actions (likes, dislikes, shares).

Key Findings

Impact of Warning on Perception and Engagement:

The presence of a warning sign notably improved participants' detection of hallucinated content without adversely affecting their perception of genuine content's accuracy.
Warnings led to a significant increase in the likelihood of content being disliked if it contained hallucinations, reinforcing the utility of warnings in enhancing skepticism towards dubious content. However, likes and shares were not significantly affected by warnings, hinting at a complex interplay of factors governing user engagement behaviors beyond mere content veracity.

Differential Human Reaction to Hallucination Levels:

Participants demonstrated a clear stratification in content truthfulness ranking, perceiving genuine content as the most accurate, followed by minor hallucinations, and major hallucinations as the least accurate.
This differentiation extended to engagement behaviors, with genuine content receiving more likes and shares, indicating a preference for accuracy in user interactions. Conversely, major hallucinations elicited the most dislikes, reflecting an intuitive aversion to clearly fabricated content.

Correlation between Perceived Accuracy and Engagement Behaviors:

A notable correlation was observed between the perceived accuracy of content and engagement actions. Content deemed more accurate was more likely to be liked and shared, indicating that the perceived truthfulness of information could be a significant driver of user engagement on digital platforms.

Implications and Future Directions

This paper underscores the potential of warnings as a simple yet effective tool to mitigate the risk of LLM hallucinations by enhancing human discernment capabilities without inducing undue skepticism towards genuine content. The insights from this research hold considerable practical relevance, especially for developers and policymakers focused on leveraging LLM technologies in information-sensitive arenas. Moreover, the findings provide a foundation for future exploratory work into identifying more nuanced human factors influencing the perception and dissemination of LLM-generated content. The paper also prompts an investigation into the creation and effectiveness of computational models to support human detection of hallucinations, potentially incorporating user feedback to refine model outputs continually.

Conclusion

The research provides critical insights into the human capacity to discern LLM-generated genuine content from hallucinated variants and highlights the efficacy of warning cues in enhancing discernment without compromising the perception of genuine content. As the adoption of LLM technologies continues to grow, understanding and improving human-machine interaction paradigms will be pivotal in harnessing the full potential of these models while safeguarding against the dissemination of misinformation.

PDF Markdown

Tweets

https://twitter.com/aili_app/status/1780858393718411769

HackerNews

Warning Affects Human Perception and Engagement Regarding LLM Hallucinations (2 points, 0 comments)