Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

When to Read Documents or QA History: On Unified and Selective Open-domain QA (2306.04176v1)

Published 7 Jun 2023 in cs.CL and cs.AI

Abstract: This paper studies the problem of open-domain question answering, with the aim of answering a diverse range of questions leveraging knowledge resources. Two types of sources, QA-pair and document corpora, have been actively leveraged with the following complementary strength. The former is highly precise when the paraphrase of given question $q$ was seen and answered during training, often posed as a retrieval problem, while the latter generalizes better for unseen questions. A natural follow-up is thus leveraging both models, while a naive pipelining or integration approaches have failed to bring additional gains over either model alone. Our distinction is interpreting the problem as calibration, which estimates the confidence of predicted answers as an indicator to decide when to use a document or QA-pair corpus. The effectiveness of our method was validated on widely adopted benchmarks such as Natural Questions and TriviaQA.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Kyungjae Lee (37 papers)
  2. Sang-eun Han (3 papers)
  3. Seung-won Hwang (59 papers)
  4. Moontae Lee (54 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.