A Hybrid Neural Network Model for Commonsense Reasoning (1907.11983v1)

Published 27 Jul 2019 in cs.CL

Abstract: This paper proposes a hybrid neural network (HNN) model for commonsense reasoning. An HNN consists of two component models, a masked LLM and a semantic similarity model, which share a BERT-based contextual encoder but use different model-specific input and output layers. HNN obtains new state-of-the-art results on three classic commonsense reasoning tasks, pushing the WNLI benchmark to 89%, the Winograd Schema Challenge (WSC) benchmark to 75.1%, and the PDP60 benchmark to 90.0%. An ablation study shows that LLMs and semantic similarity models are complementary approaches to commonsense reasoning, and HNN effectively combines the strengths of both. The code and pre-trained models will be publicly available at https://github.com/namisan/mt-dnn.

Authors (4)

Pengcheng He (60 papers)
Xiaodong Liu (162 papers)
Weizhu Chen (128 papers)
Jianfeng Gao (344 papers)

Citations (29)

View on Semantic Scholar

Summary

A Hybrid Neural Network Model for Commonsense Reasoning

The paper "A Hybrid Neural Network Model for Commonsense Reasoning" presents a novel approach to enhance performance in commonsense reasoning tasks. The authors introduce a Hybrid Neural Network (HNN) model that synergizes two different methodologies: a masked LLM (MLM) and a semantic similarity model (SSM). Both components share a BERT-based contextual encoder, yet they utilize distinct model-specific input and output layers. This combination effectively leverages the strengths of both approaches, leading to superior results.

Contribution and Methodology

Commonsense reasoning remains a significant challenge in Natural Language Understanding (NLU). To address this, the authors employ a hybrid approach that draws from two prevalent model categories:

Masked LLMs (MLMs): These models predict the probability of a sequence by replacing the ambiguous pronoun with candidate antecedents and selecting the one with the highest likelihood.
Semantic Similarity Models (SSMs): These models evaluate semantic relatedness between potential antecedents and pronouns within their context.

The proposed HNN architecture combines these two methodologies, as articulated in Figure 1 of the paper. This integration allows for diverse inductive biases, catering to both semantic coherence and contextual similarity.

Results and Evaluation

The HNN demonstrates significant advancements on three benchmarks for commonsense reasoning—WNLI, Winograd Schema Challenge (WSC), and PDP60—achieving accuracy improvements to 89%, 75.1%, and 90%, respectively. These results underscore the effectiveness of integrating MLMs and SSMs. An ablation paper further elucidates that both component models provide complementary capabilities, as removing either component results in a measurable decline in performance.

Implications and Future Work

The implications of this research are twofold. Practically, the improved accuracies on challenging benchmarks suggest that combining LLMing with similarity assessments may enhance AI's capabilities in nuanced reasoning tasks. Theoretically, it underscores the potential of hybrid models in capturing multiple facets of language understanding.

The paper also raises prospects for extending such hybrid models to tasks where large-scale pre-trained LLMs underperform. Future research may explore more complex reasoning scenarios, potentially incorporating additional neural network architectures or extending beyond commonsense reasoning to address varied NLU tasks.

Conclusion

This paper presents a comprehensive hybrid approach that advances the field of commonsense reasoning by effectively merging the strengths of masked language and semantic similarity models. By substantiating the HNN's robustness through empirical evidence, it paves the way for innovations in multi-faceted reasoning tasks, offering compelling avenues for both applied and theoretical exploration in AI development.

PDF Markdown

Related Papers

Find Related Papers

GitHub

GitHub - namisan/mt-dnn: Multi-Task Deep Neural Networks for Natural Language Understanding (2,233 stars)