Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documents (2404.16032v2)

Published 24 Apr 2024 in cs.LG

Abstract: Retrieval-augmented generation (RAG) mitigates many problems of fully parametric LLMs, such as temporal degradation, hallucinations, and lack of grounding. In RAG, the model's knowledge can be updated from documents provided in context. This leads to cases of conflict between the model's parametric knowledge and the contextual information, where the model may not always update its knowledge. Previous work studied context-memory knowledge conflicts by creating synthetic documents that contradict the model's correct parametric answers. We present a framework for studying such knowledge conflicts in a realistic setup. We update incorrect parametric knowledge using real conflicting documents. This reflects how knowledge conflicts arise in practice. In this realistic scenario, we find that knowledge updates fail less often than previously reported. In cases where the models still fail to update their answers, we find a parametric bias: the incorrect parametric answer appearing in context makes the knowledge update likelier to fail. These results suggest that the factual parametric knowledge of LLMs can negatively influence their reading abilities and behaviors. Our code is available at https://github.com/kortukov/realistic_knowledge_conflicts/ .

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Evgenii Kortukov (6 papers)
Alexander Rubinstein (9 papers)
Elisa Nguyen (7 papers)
Seong Joon Oh (60 papers)

Citations (1)

View on Semantic Scholar

GitHub

GitHub - kortukov/realistic_knowledge_conflicts: Code for the paper "Studying Large Language Model Behaviors Under Realistic Knowledge Conflicts" (15 stars)

Tweets

https://twitter.com/EKortukov/status/1783564158518129001

Studying Large Language Model Behaviors Under Context-Memory Conflicts With Real Documents (2404.16032v2)

Related Papers

GitHub

Tweets