Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Improving the Robustness of Knowledge-Grounded Dialogue via Contrastive Learning (2401.04361v1)

Published 9 Jan 2024 in cs.CL and cs.AI

Abstract: Knowledge-grounded dialogue (KGD) learns to generate an informative response based on a given dialogue context and external knowledge (\emph{e.g.}, knowledge graphs; KGs). Recently, the emergence of LLMs and pre-training techniques has brought great success to knowledge-grounded dialogue. However, when building KGD systems in real applications, there are various real-world noises that are inevitable to face. For example, the dialogue context might involve perturbations such as misspellings and abbreviations. In addition, KGs typically suffer from incompletion and also might contain erroneous and outdated facts. Such real-world noises pose a challenge to the robustness of KGD systems and hinder their applications in the real world. In this paper, we propose an entity-based contrastive learning framework for improving the robustness of KGD. Specifically, we make use of the entity information in a KGD sample to create both its positive and negative samples which involve semantic-irrelevant and semantic-relevant perturbations, respectively. The contrastive learning framework ensures the KGD model is aware of these two types of perturbations, thus generating informative responses with the potentially noisy inputs in real applications. Experimental results on three benchmark datasets show that our method achieves new state-of-the-art performance in terms of automatic evaluation scores, verifying its effectiveness and potentiality. Furthermore, we show that our method can generate better responses than comparison models in both the noisy and the few-shot settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (53)
  1. Learning to copy coherent knowledge for response generation. In Proceedings of AAAI 2021, 14, 12535–12543.
  2. What is a paraphrase? Computational Linguistics, 39(3): 463–472.
  3. When to Pre-Train Graph Neural Networks? From Data Generation Perspective! In Proceedings of KDD 2023.
  4. Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization. In Findings of ACL 2023, 7337–7345.
  5. Syntactically Diverse Adversarial Network for Knowledge-Grounded Conversation Generation. In Proceedings of EMNLP 2021.
  6. A Knowledge-Grounded Neural Conversation Model. In Proceedings of AAAI 2017.
  7. ParaZh-22M: A Large-Scale Chinese Parabank via Machine Translation. In Proceedings of COLING 2022.
  8. KRP-DS: A Knowledge Graph-Based Dialogue System with Inference-Aided Prediction. Sensors, 23(15): 6805.
  9. A survey on contrastive self-supervised learning. Technologies, 9(1): 2.
  10. Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation. In Proceedings of ACL 2023.
  11. AttnIO: Knowledge Graph Exploration with In-and-Out Attention Flow for Knowledge-Grounded Dialogue. In Proceedings of EMNLP 2020.
  12. Dense Passage Retrieval for Open-Domain Question Answering. In Proceedings of EMNLP 2020, 6769–6781.
  13. Sequential Latent Knowledge Selection for Knowledge-Grounded Dialogue. In Proceedings of ICLR 2020.
  14. BART: Denoising Sequence-to-Sequence Pre-training for Natural Language Generation, Translation, and Comprehension. In Proceedings of ACL 2019.
  15. A Diversity-Promoting Objective Function for Neural Conversation Models. In Proceedings of NAACL 2016.
  16. Attribute-Consistent Knowledge Graph Representation Learning for Multi-Modal Entity Alignment. In Proceedings of the ACM Web Conference 2023, 2499–2508.
  17. Knowledge-Grounded Dialogue Generation with a Unified Knowledge Representation. In Proceedings of NAACL 2021.
  18. Learning to Select Knowledge for Response Generation in Dialog Systems. In Proceedings of IJCAI 2019.
  19. Infusing Multi-Source Knowledge with Heterogeneous Graph Neural Network for Emotional Conversation Generation. In Proceedings of AAAI 2021.
  20. Generating Informative Conversational Response using Recurrent Knowledge-Interaction and Knowledge-Copy. In Proceedings of ACL 2020, 41–52.
  21. Learning Entity and Relation Embeddings for Knowledge Graph Completion. In Proceedings of AAAI 2015.
  22. Improving knowledge-based dialogue generation through two-stage knowledge selection and knowledge selection-guided pointer network. Journal of Intelligent Information Systems, 59(3): 591–611.
  23. Knowledge Aware Conversation Generation with Explainable Reasoning over Augmented Graphs. In Proceedings of EMNLP 2019.
  24. Towards Conversational Recommendation over Multi-Type Dialogs. In Proceedings of ACL 2020.
  25. Effective Approaches to Attention-based Neural Machine Translation. In Proceedings of EMNLP 2015, 1412–1421.
  26. OpenDialKG: Explainable Conversational Reasoning with Attention-based Walks over Knowledge Graphs. In Proceedings of ACL 2019.
  27. Bleu: a method for automatic evaluation of machine translation. In Proceedings of ACL 2002, 311–318.
  28. DialAug: Mixing up Dialogue Contexts in Contrastive Learning for Robust Conversational Modeling. In Proceedings of COLING 2022.
  29. Increasing Faithfulness in Knowledge-Grounded Dialogue with Controllable Features. In Proceedings of ACL 2021.
  30. On the robustness of intent classification and slot labeling in goal-oriented dialog systems to real-world noise. arXiv preprint arXiv:2104.07149.
  31. Building End-To-End Dialogue Systems Using Generative Hierarchical Neural Network Models. In Proceedings of AAAI 2015.
  32. End-to-end memory networks. Proceedings of NeurIPS 2015.
  33. Towards Fewer Hallucinations in Knowledge-Grounded Dialogue Generation via Augmentative and Contrastive Knowledge-Dialogue. In Proceedings of ACL 2023.
  34. Attention is All you Need. In Proceedings of NeurIPS 2017.
  35. A neural network approach for knowledge-driven response generation. In Proceedings of COLING 2016, 3370–3380.
  36. Knowledge enhanced sports game summarization. In Proceedings of WSDM 2022, 1045–1053.
  37. Is chatgpt a good nlg evaluator? a preliminary study. In Proceedings of the 4th New Frontiers in Summarization Workshop.
  38. Zero-shot cross-lingual summarization via large language models. In Proceedings of the 4th New Frontiers in Summarization Workshop, 12–23.
  39. Snowman: A Million-scale Chinese Commonsense Knowledge Graph Distilled from Foundation Model. arXiv preprint arXiv:2306.10241.
  40. On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective. In ICLR 2023 Workshop on Trustworthy and Reliable Large-Scale Machine Learning Models.
  41. Incorporating commonsense knowledge into story ending generation via heterogeneous graph networks. In Proceedings of DASFAA 2022, 85–100. Springer.
  42. RT-KGD: relation transition aware knowledge-grounded dialogue generation. In Proceedings of ISWC 2022, 319–335. Springer.
  43. Transformers: State-of-the-art natural language processing. In Proceedings of EMNLP 2020 (system demonstrations), 38–45.
  44. Section-aware commonsense knowledge-grounded dialogue generation with pre-trained language model. In Proceedings of COLING 2022, 521–531.
  45. Diverse and Informative Dialogue Generation with Context-Specific Commonsense Knowledge Awareness. In Proceedings of ACL 2020.
  46. Knowledge-Aware Dialogue Generation via Hierarchical Infobox Accessing and Infobox-Dialogue Interaction Graph Network. In Proceedings of IJCAI 2021.
  47. Proactive Human-Machine Conversation with Explicit Conversation Goal. In Proceedings of ACL 2019.
  48. AspectMMKG: A Multi-modal Knowledge Graph with Aspect-aware Entities. In Proceedings of CIKM 2023.
  49. Aligning internal regularity and external influence of multi-granularity for temporal knowledge graph embedding. In Proceedings of DASFAA 2022, 149–164. Springer.
  50. Enhancing Visual Dialog Questioner with Entity-based Strategy Learning and Augmented Guesser. In Findings of EMNLP 2021, 1839–1851.
  51. Commonsense Knowledge Aware Conversation Generation with Graph Attention. In Proceedings of IJCAI 2018.
  52. KdConv: A Chinese Multi-domain Dialogue Dataset Towards Multi-turn Knowledge-driven Conversation. In Proceedings of ACL 2020.
  53. KTGAT: Improving the Robustness of Knowledge-enhanced Text Generation via Adversarial Training. Proceedings of ICCEA 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jiaan Wang (35 papers)
  2. Jianfeng Qu (17 papers)
  3. Kexin Wang (41 papers)
  4. Zhixu Li (43 papers)
  5. Wen Hua (24 papers)
  6. Ximing Li (24 papers)
  7. An Liu (91 papers)
Citations (1)