MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models (2005.02507v1)

Published 5 May 2020 in cs.CL and cs.LG

Abstract: Retrieval question answering (ReQA) is the task of retrieving a sentence-level answer to a question from an open corpus (Ahmad et al.,2019).This paper presents MultiReQA, anew multi-domain ReQA evaluation suite com-posed of eight retrieval QA tasks drawn from publicly available QA datasets. We provide the first systematic retrieval based evaluation over these datasets using two supervised neural models, based on fine-tuning BERT andUSE-QA models respectively, as well as a surprisingly strong information retrieval baseline,BM25. Five of these tasks contain both train-ing and test data, while three contain test data only. Performance on the five tasks with train-ing data shows that while a general model covering all domains is achievable, the best performance is often obtained by training exclusively on in-domain data.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (5)

Mandy Guo (21 papers)
Yinfei Yang (73 papers)
Daniel Cer (28 papers)
Qinlan Shen (6 papers)
Noah Constant (32 papers)

Citations (43)

View on Semantic Scholar

MultiReQA: A Cross-Domain Evaluation for Retrieval Question Answering Models (2005.02507v1)

Related Papers