Learning to Ask: Neural Question Generation for Reading Comprehension (1705.00106v1)

Published 29 Apr 2017 in cs.CL and cs.AI

Abstract: We study automatic question generation for sentences from text passages in reading comprehension. We introduce an attention-based sequence learning model for the task and investigate the effect of encoding sentence- vs. paragraph-level information. In contrast to all previous work, our model does not rely on hand-crafted rules or a sophisticated NLP pipeline; it is instead trainable end-to-end via sequence-to-sequence learning. Automatic evaluation results show that our system significantly outperforms the state-of-the-art rule-based system. In human evaluations, questions generated by our system are also rated as being more natural (i.e., grammaticality, fluency) and as more difficult to answer (in terms of syntactic and lexical divergence from the original text and reasoning needed to answer).

Authors (3)

Xinya Du (41 papers)
Junru Shao (11 papers)
Claire Cardie (74 papers)

Citations (635)

View on Semantic Scholar

Summary

Analysis of "Learning to Ask: Neural Question Generation for Reading Comprehension"

The paper "Learning to Ask: Neural Question Generation for Reading Comprehension" by Du, Shao, and Cardie presents an innovative approach to automatic question generation (QG) using neural networks. The objective of this work is to create questions from text passages that facilitate reading comprehension, a task with substantial applications in education and other domains.

The authors introduce an end-to-end sequence-to-sequence learning model that utilizes a global attention mechanism. This approach is notably distinct from previous QG methods which largely relied on rule-based systems. The key innovation in their work is leveraging neural networks to bypass the need for handcrafted rules and extensive NLP pipelines, thereby enhancing the system's adaptability and performance.

Methodology

The proposed model encodes sentences and potentially entire paragraphs using a recurrent neural network (RNN) with Long Short-Term Memory (LSTM) cells enhanced by global attention. The attention mechanism enables the model to focus on relevant parts of the input text when generating each word in the question. The architecture is inspired by successful techniques in neural machine translation and abstractive summarization, adapted to the unique challenges of QG.

Two variations of the model are explored: one that processes sentence-level context and another that incorporates paragraph-level context. The latter aims to utilize broader contextual cues, although it introduces additional complexity.

Results

Evaluations were conducted using the Stanford Question Answering Dataset (SQuAD). The proposed models significantly outperform baseline systems, including a strong rule-based overgenerate-and-rank system. Notably, the neural QG system with pre-trained word embeddings achieves superior performance across multiple metrics, reflecting its efficacy in generating natural, grammatically sound, and challenging questions.

The paper also provides quantitative evidence that their system generates questions with greater syntactic and lexical divergence from the source text, fulfilling an important criterion for high-quality question generation. In human evaluations, the system's outputs received higher ratings for fluency, grammatical correctness, and difficulty compared to baseline outputs.

Implications and Future Directions

The implications of this paper are multifaceted. Practically, it shows potential for improved automated tools in educational technology, such as intelligent tutoring systems. Theoretically, it demonstrates the applicability of sequence-to-sequence models with attention mechanisms to NLP tasks beyond translation and summarization.

Future research might focus on further integrating paragraph-level context to improve question generation in more complex scenarios. Additionally, mechanisms like copying and paraphrasing could be explored to enhance the diversity and relevance of generated questions. This work also lays a foundation for investigating how question generation systems can be optimized for various domains, potentially integrating domain-specific knowledge to generate more contextually appropriate questions.

In summary, this paper provides a solid contribution to the field of NLP by effectively translating recent advances in neural network architectures into the domain of automatic question generation, achieving strong empirical results and opening pathways for continued research.

PDF Markdown