Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge (1805.07858v1)

Published 21 May 2018 in cs.CL

Abstract: We introduce a neural reading comprehension model that integrates external commonsense knowledge, encoded as a key-value memory, in a cloze-style setting. Instead of relying only on document-to-question interaction or discrete features as in prior work, our model attends to relevant external knowledge and combines this knowledge with the context representation before inferring the answer. This allows the model to attract and imply knowledge from an external knowledge source that is not explicitly stated in the text, but that is relevant for inferring the answer. Our model improves results over a very strong baseline on a hard Common Nouns dataset, making it a strong competitor of much more complex models. By including knowledge explicitly, our model can also provide evidence about the background knowledge used in the RC process.

Citations (174)

View on Semantic Scholar

Collections

Sign up for free to add this paper to one or more collections.

Sign Up

Summary

Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge

The paper "Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge" by Todor Mihaylov and Anette Frank from Heidelberg University proposes a neural reading comprehension model designed to improve performance on cloze-style tasks by incorporating external commonsense knowledge. This innovative approach seeks to augment traditional document-to-question interaction by explicitly integrating background knowledge that is not directly stated in the text, allowing for more informed inference during the reading comprehension process.

Model Overview

The proposed model introduces an enhanced architecture built on the Attention Sum Reader (AS Reader) framework, known for its robust performance in single-hop reading comprehension tasks. The model utilizes a key-value memory system to store pre-selected commonsense knowledge facts, facilitating their incorporation into the context representation of documents and questions. This explicit inclusion of external knowledge not only aids in addressing information gaps inherent in the document but also provides traceability and evidence regarding the utilized commonsense knowledge during inference.

Knowledge Retrieval and Integration

A central component of the model involves retrieving relevant knowledge from concept-specific sources like ConceptNet and WordNet and encoding this information in a manner compatible with the context representations of document and question tokens. The integration process involves attending to these knowledge facts based on their relevance to the tokens involved in the reading comprehension task, enhancing the document and question representations with this external information. This enrichment is achieved through a combination mechanism integrating the context and knowledge representations, subsequently affecting the answer prediction phase.

Empirical Results

The authors conducted extensive experiments on the Children's Book Test (CBT) dataset, focusing on two distinct datasets—Common Nouns (CN) and Named Entities (NE). The inclusion of commonsense knowledge significantly improved the model's performance on the CN dataset, which is inherently more challenging than the NE dataset. The approach demonstrated a marked reduction in the error rate compared to traditional context-only models, highlighting the utility of integrating external knowledge in cloze-style reading comprehension tasks.

Component Analysis and Ablation Studies

To understand the contributions of different components within the model, various ablation studies and component-level analyses were performed. The results indicated that combinations of context and context-plus-knowledge representations in document-to-question interactions led to improved predictive performance. The key-value memory strategies and fact selection thresholds were explored, with findings pointing towards the nuanced nature of knowledge retrieval and its utilization in improving reading comprehension outcomes.

Future Directions

The paper underscores the potential for future research endeavors aimed at further refining the integration of external commonsense knowledge into reading comprehension models. The transparency offered by the current model, due to its attention-based architecture, suggests opportunities for targeted improvements by analyzing the interaction between context and external knowledge sources. This line of inquiry holds promise for enhancing neural models across various task settings, including entailment and broader question-answering frameworks.

In conclusion, the work by Mihaylov and Frank represents a substantial advancement in cloze-style reading comprehension by systematically leveraging external commonsense knowledge. It opens novel pathways for addressing the semantic complexity of natural language understanding tasks, ultimately contributing to a deeper and more comprehensive interaction between neural networks and the nuanced field of real-world knowledge.

Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge (1805.07858v1)

Collections

Summary

Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge

Model Overview

Knowledge Retrieval and Integration

Empirical Results

Component Analysis and Ablation Studies

Future Directions

Follow-up Questions

Authors (2)

Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge (1805.07858v1)

Collections

Summary

Knowledgeable Reader: Enhancing Cloze-Style Reading Comprehension with External Commonsense Knowledge

Model Overview

Knowledge Retrieval and Integration

Empirical Results

Component Analysis and Ablation Studies

Future Directions

Follow-up Questions

Related Papers

Authors (2)