Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

K-PLUG: Knowledge-injected Pre-trained Language Model for Natural Language Understanding and Generation in E-Commerce (2104.06960v2)

Published 14 Apr 2021 in cs.CL

Abstract: Existing pre-trained LLMs (PLMs) have demonstrated the effectiveness of self-supervised learning for a broad range of NLP tasks. However, most of them are not explicitly aware of domain-specific knowledge, which is essential for downstream tasks in many domains, such as tasks in e-commerce scenarios. In this paper, we propose K-PLUG, a knowledge-injected pre-trained LLM based on the encoder-decoder transformer that can be transferred to both natural language understanding and generation tasks. We verify our method in a diverse range of e-commerce scenarios that require domain-specific knowledge. Specifically, we propose five knowledge-aware self-supervised pre-training objectives to formulate the learning of domain-specific knowledge, including e-commerce domain-specific knowledge-bases, aspects of product entities, categories of product entities, and unique selling propositions of product entities. K-PLUG achieves new state-of-the-art results on a suite of domain-specific NLP tasks, including product knowledge base completion, abstractive product summarization, and multi-turn dialogue, significantly outperforms baselines across the board, which demonstrates that the proposed method effectively learns a diverse set of domain-specific knowledge for both language understanding and generation tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Song Xu (10 papers)
  2. Haoran Li (166 papers)
  3. Peng Yuan (29 papers)
  4. Yujia Wang (29 papers)
  5. Youzheng Wu (32 papers)
  6. Xiaodong He (162 papers)
  7. Ying Liu (256 papers)
  8. Bowen Zhou (141 papers)
Citations (23)