Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Survey of Knowledge Enhanced Pre-trained Models (2110.00269v5)

Published 1 Oct 2021 in cs.CL and cs.AI

Abstract: Pre-trained LLMs learn informative word representations on a large-scale text corpus through self-supervised learning, which has achieved promising performance in fields of NLP after fine-tuning. These models, however, suffer from poor robustness and lack of interpretability. We refer to pre-trained LLMs with knowledge injection as knowledge-enhanced pre-trained LLMs (KEPLMs). These models demonstrate deep understanding and logical reasoning and introduce interpretability. In this survey, we provide a comprehensive overview of KEPLMs in NLP. We first discuss the advancements in pre-trained LLMs and knowledge representation learning. Then we systematically categorize existing KEPLMs from three different perspectives. Finally, we outline some potential directions of KEPLMs for future research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Jian Yang (503 papers)
  2. Xinyu Hu (32 papers)
  3. Gang Xiao (18 papers)
  4. Yulong Shen (47 papers)
Citations (1)