Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Asking Clarification Questions to Handle Ambiguity in Open-Domain QA (2305.13808v2)

Published 23 May 2023 in cs.CL

Abstract: Ambiguous questions persist in open-domain question answering, because formulating a precise question with a unique answer is often challenging. Previously, Min et al. (2020) have tackled this issue by generating disambiguated questions for all possible interpretations of the ambiguous question. This can be effective, but not ideal for providing an answer to the user. Instead, we propose to ask a clarification question, where the user's response will help identify the interpretation that best aligns with the user's intention. We first present CAMBIGNQ, a dataset consisting of 5,654 ambiguous questions, each with relevant passages, possible answers, and a clarification question. The clarification questions were efficiently created by generating them using InstructGPT and manually revising them as necessary. We then define a pipeline of tasks and design appropriate evaluation metrics. Lastly, we achieve 61.3 F1 on ambiguity detection and 40.5 F1 on clarification-based QA, providing strong baselines for future work.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Dongryeol Lee (13 papers)
  2. Segwang Kim (4 papers)
  3. Minwoo Lee (31 papers)
  4. Hwanhee Lee (36 papers)
  5. Joonsuk Park (24 papers)
  6. Sang-Woo Lee (34 papers)
  7. Kyomin Jung (76 papers)
Citations (9)