Unsupervised multiple choices question answering via universal corpus (2402.17333v1)

Published 27 Feb 2024 in cs.CL

Abstract: Unsupervised question answering is a promising yet challenging task, which alleviates the burden of building large-scale annotated data in a new domain. It motivates us to study the unsupervised multiple-choice question answering (MCQA) problem. In this paper, we propose a novel framework designed to generate synthetic MCQA data barely based on contexts from the universal domain without relying on any form of manual annotation. Possible answers are extracted and used to produce related questions, then we leverage both named entities (NE) and knowledge graphs to discover plausible distractors to form complete synthetic samples. Experiments on multiple MCQA datasets demonstrate the effectiveness of our method.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (32)

Authors (4)

Qin Zhang (98 papers)
Hao Ge (49 papers)
Xiaojun Chen (100 papers)
Meng Fang (100 papers)

Citations (1)

View on Semantic Scholar

Tweets

https://twitter.com/dippatel1994/status/1762853707195073018

Unsupervised multiple choices question answering via universal corpus (2402.17333v1)

Related Papers

Tweets