CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision (2107.09852v2)

Published 21 Jul 2021 in cs.CL

Abstract: Recent work has shown success in incorporating pre-trained models like BERT to improve NLP systems. However, existing pre-trained models lack of causal knowledge which prevents today's NLP systems from thinking like humans. In this paper, we investigate the problem of injecting causal knowledge into pre-trained models. There are two fundamental problems: 1) how to collect various granularities of causal pairs from unstructured texts; 2) how to effectively inject causal knowledge into pre-trained models. To address these issues, we extend the idea of CausalBERT from previous studies, and conduct experiments on various datasets to evaluate its effectiveness. In addition, we adopt a regularization-based method to preserve the already learned knowledge with an extra regularization term while injecting causal knowledge. Extensive experiments on 7 datasets, including four causal pair classification tasks, two causal QA tasks and a causal inference task, demonstrate that CausalBERT captures rich causal knowledge and outperforms all pre-trained models-based state-of-the-art methods, achieving a new causal inference benchmark.

Authors (5)

Zhongyang Li (75 papers)
Xiao Ding (38 papers)
Kuo Liao (5 papers)
Bing Qin (186 papers)
Ting Liu (329 papers)

Citations (15)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

CausalBERT: Injecting Causal Knowledge Into Pre-trained Models with Minimal Supervision (2107.09852v2)

Summary

Related Papers