Open Domain Question Answering with A Unified Knowledge Interface (2110.08417v2)

Published 16 Oct 2021 in cs.CL and cs.AI

Abstract: The retriever-reader framework is popular for open-domain question answering (ODQA) due to its ability to use explicit knowledge. Although prior work has sought to increase the knowledge coverage by incorporating structured knowledge beyond text, accessing heterogeneous knowledge sources through a unified interface remains an open question. While data-to-text generation has the potential to serve as a universal interface for data and text, its feasibility for downstream tasks remains largely unknown. In this work, we bridge this gap and use the data-to-text method as a means for encoding structured knowledge for ODQA. Specifically, we propose a verbalizer-retriever-reader framework for ODQA over data and text where verbalized tables from Wikipedia and graphs from Wikidata are used as augmented knowledge sources. We show that our Unified Data and Text QA, UDT-QA, can effectively benefit from the expanded knowledge index, leading to large gains over text-only baselines. Notably, our approach sets the single-model state-of-the-art on Natural Questions. Furthermore, our analyses indicate that verbalized knowledge is preferred for answer reasoning for both adapted and hot-swap settings.

View on arXiv

Authors (5)

Kaixin Ma (35 papers)
Hao Cheng (190 papers)
Xiaodong Liu (162 papers)
Eric Nyberg (39 papers)
Jianfeng Gao (344 papers)

Citations (36)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Open Domain Question Answering with A Unified Knowledge Interface (2110.08417v2)

Summary

Related Papers