2000 character limit reached
Post-Training Dialogue Summarization using Pseudo-Paraphrasing (2204.13498v1)
Published 28 Apr 2022 in cs.CL
Abstract: Previous dialogue summarization techniques adapt LLMs pretrained on the narrative text by injecting dialogue-specific features into the models. These features either require additional knowledge to recognize or make the resulting models harder to tune. To bridge the format gap between dialogues and narrative summaries in dialogue summarization tasks, we propose to post-train pretrained LLMs (PLMs) to rephrase from dialogue to narratives. After that, the model is fine-tuned for dialogue summarization as usual. Comprehensive experiments show that our approach significantly improves vanilla PLMs on dialogue summarization and outperforms other SOTA models by the summary quality and implementation costs.
- Qi Jia (42 papers)
- Yizhu Liu (9 papers)
- Haifeng Tang (20 papers)
- Kenny Q. Zhu (50 papers)