Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering (2211.05655v1)

Published 10 Nov 2022 in cs.CL, cs.AI, and cs.LG

Abstract: Question answering models commonly have access to two sources of "knowledge" during inference time: (1) parametric knowledge - the factual knowledge encoded in the model weights, and (2) contextual knowledge - external knowledge (e.g., a Wikipedia passage) given to the model to generate a grounded answer. Having these two sources of knowledge entangled together is a core issue for generative QA models as it is unclear whether the answer stems from the given non-parametric knowledge or not. This unclarity has implications on issues of trust, interpretability and factuality. In this work, we propose a new paradigm in which QA models are trained to disentangle the two sources of knowledge. Using counterfactual data augmentation, we introduce a model that predicts two answers for a given question: one based on given contextual knowledge and one based on parametric knowledge. Our experiments on the Natural Questions dataset show that this approach improves the performance of QA models by making them more robust to knowledge conflicts between the two knowledge sources, while generating useful disentangled answers.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ella Neeman (2 papers)
  2. Roee Aharoni (35 papers)
  3. Or Honovich (9 papers)
  4. Leshem Choshen (78 papers)
  5. Idan Szpektor (47 papers)
  6. Omri Abend (75 papers)
Citations (65)
X Twitter Logo Streamline Icon: https://streamlinehq.com