Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Survey of Knowledge Enhanced Pre-trained Language Models (2211.05994v4)

Published 11 Nov 2022 in cs.CL

Abstract: Pre-trained LLMs (PLMs) which are trained on large text corpus via self-supervised learning method, have yielded promising performance on various tasks in NLP. However, though PLMs with huge parameters can effectively possess rich knowledge learned from massive training text and benefit downstream tasks at the fine-tuning stage, they still have some limitations such as poor reasoning ability due to the lack of external knowledge. Research has been dedicated to incorporating knowledge into PLMs to tackle these issues. In this paper, we present a comprehensive review of Knowledge Enhanced Pre-trained LLMs (KE-PLMs) to provide a clear insight into this thriving field. We introduce appropriate taxonomies respectively for Natural Language Understanding (NLU) and Natural Language Generation (NLG) to highlight these two main tasks of NLP. For NLU, we divide the types of knowledge into four categories: linguistic knowledge, text knowledge, knowledge graph (KG), and rule knowledge. The KE-PLMs for NLG are categorized into KG-based and retrieval-based methods. Finally, we point out some promising future directions of KE-PLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Linmei Hu (14 papers)
  2. Zeyi Liu (21 papers)
  3. Ziwang Zhao (4 papers)
  4. Lei Hou (127 papers)
  5. Liqiang Nie (191 papers)
  6. Juanzi Li (144 papers)
Citations (99)