Reference-based Weak Supervision for Answer Sentence Selection using Web Data (2104.08943v1)

Published 18 Apr 2021 in cs.CL, cs.AI, and cs.IR

Abstract: Answer sentence selection (AS2) modeling requires annotated data, i.e., hand-labeled question-answer pairs. We present a strategy to collect weakly supervised answers for a question based on its reference to improve AS2 modeling. Specifically, we introduce Reference-based Weak Supervision (RWS), a fully automatic large-scale data pipeline that harvests high-quality weakly-supervised answers from abundant Web data requiring only a question-reference pair as input. We study the efficacy and robustness of RWS in the setting of TANDA, a recent state-of-the-art fine-tuning approach specialized for AS2. Our experiments indicate that the produced data consistently bolsters TANDA. We achieve the state of the art in terms of P@1, 90.1%, and MAP, 92.9%, on WikiQA.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (3)

Vivek Krishnamurthy (1 paper)
Thuy Vu (13 papers)
Alessandro Moschitti (48 papers)

Citations (1)

View on Semantic Scholar

Reference-based Weak Supervision for Answer Sentence Selection using Web Data (2104.08943v1)

Related Papers