Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation (2504.07754v1)

Published 10 Apr 2025 in cs.CL

Abstract: LLMs demonstrate remarkable text comprehension and generation capabilities but often lack the ability to utilize up-to-date or domain-specific knowledge not included in their training data. To address this gap, we introduce KEDiT, an efficient method for fine-tuning LLMs for knowledge-grounded dialogue generation. KEDiT operates in two main phases: first, it employs an information bottleneck to compress retrieved knowledge into learnable parameters, retaining essential information while minimizing computational overhead. Second, a lightweight knowledge-aware adapter integrates these compressed knowledge vectors into the LLM during fine-tuning, updating less than 2\% of the model parameters. The experimental results on the Wizard of Wikipedia and a newly constructed PubMed-Dialog dataset demonstrate that KEDiT excels in generating contextually relevant and informative responses, outperforming competitive baselines in automatic, LLM-based, and human evaluations. This approach effectively combines the strengths of pretrained LLMs with the adaptability needed for incorporating dynamic knowledge, presenting a scalable solution for fields such as medicine.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (7)

Bo Zhang (633 papers)
Hui Ma (87 papers)
Dailin Li (2 papers)
Jian Ding (132 papers)
Jian Wang (966 papers)
Bo Xu (212 papers)
Hongfei Lin (34 papers)

Efficient Tuning of Large Language Models for Knowledge-Grounded Dialogue Generation (2504.07754v1)

Related Papers