Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning to Ask Unanswerable Questions for Machine Reading Comprehension (1906.06045v1)

Published 14 Jun 2019 in cs.CL

Abstract: Machine reading comprehension with unanswerable questions is a challenging task. In this work, we propose a data augmentation technique by automatically generating relevant unanswerable questions according to an answerable question paired with its corresponding paragraph that contains the answer. We introduce a pair-to-sequence model for unanswerable question generation, which effectively captures the interactions between the question and the paragraph. We also present a way to construct training data for our question generation models by leveraging the existing reading comprehension dataset. Experimental results show that the pair-to-sequence model performs consistently better compared with the sequence-to-sequence baseline. We further use the automatically generated unanswerable questions as a means of data augmentation on the SQuAD 2.0 dataset, yielding 1.9 absolute F1 improvement with BERT-base model and 1.7 absolute F1 improvement with BERT-large model.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Haichao Zhu (9 papers)
  2. Li Dong (154 papers)
  3. Furu Wei (291 papers)
  4. Wenhui Wang (47 papers)
  5. Bing Qin (186 papers)
  6. Ting Liu (329 papers)
Citations (30)

Summary

We haven't generated a summary for this paper yet.