Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CREPE: Open-Domain Question Answering with False Presuppositions (2211.17257v1)

Published 30 Nov 2022 in cs.CL and cs.AI

Abstract: Information seeking users often pose questions with false presuppositions, especially when asking about unfamiliar topics. Most existing question answering (QA) datasets, in contrast, assume all questions have well defined answers. We introduce CREPE, a QA dataset containing a natural distribution of presupposition failures from online information-seeking forums. We find that 25% of questions contain false presuppositions, and provide annotations for these presuppositions and their corrections. Through extensive baseline experiments, we show that adaptations of existing open-domain QA models can find presuppositions moderately well, but struggle when predicting whether a presupposition is factually correct. This is in large part due to difficulty in retrieving relevant evidence passages from a large text corpus. CREPE provides a benchmark to study question answering in the wild, and our analyses provide avenues for future work in better modeling and further studying the task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Xinyan Velocity Yu (10 papers)
  2. Sewon Min (45 papers)
  3. Luke Zettlemoyer (225 papers)
  4. Hannaneh Hajishirzi (176 papers)
Citations (39)

Summary

We haven't generated a summary for this paper yet.