Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Mitigating Hallucination in Fictional Character Role-Play (2406.17260v2)

Published 25 Jun 2024 in cs.CL

Abstract: Role-playing has wide-ranging applications in customer support, embodied agents, and computational social science. The influence of parametric world knowledge of LLMs often causes role-playing characters to act out of character and to hallucinate about things outside the scope of their knowledge. In this work, we focus on the evaluation and mitigation of hallucination in fictional character role-play. We introduce a dataset with over 2,000 characters and 72,000 interviews, including 18,000 adversarial questions. We propose RoleFact, a role-playing method that mitigates hallucination by modulating the influence of parametric knowledge using a pre-calibrated confidence threshold. Experiments show that the proposed method improves the factual precision of generated responses by 18% for adversarial questions with a 44% reduction in temporal hallucination for time-sensitive interviews. The code and the dataset are available at https://github.com/NafisSadeq/rolefact.git.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Nafis Sadeq (6 papers)
  2. Zhouhang Xie (17 papers)
  3. Byungkyu Kang (3 papers)
  4. Prarit Lamba (4 papers)
  5. Xiang Gao (210 papers)
  6. Julian McAuley (238 papers)
Citations (3)
Github Logo Streamline Icon: https://streamlinehq.com