Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation (2405.13037v1)
Abstract: Dialogue State Tracking (DST) is designed to monitor the evolving dialogue state in the conversations and plays a pivotal role in developing task-oriented dialogue systems. However, obtaining the annotated data for the DST task is usually a costly endeavor. In this paper, we focus on employing LLMs to generate dialogue data to reduce dialogue collection and annotation costs. Specifically, GPT-4 is used to simulate the user and agent interaction, generating thousands of dialogues annotated with DST labels. Then a two-stage fine-tuning on LLaMA 2 is performed on the generated data and the real data for the DST prediction. Experimental results on two public DST benchmarks show that with the generated dialogue data, our model performs better than the baseline trained solely on real data. In addition, our approach is also capable of adapting to the dynamic demands in real-world scenarios, generating dialogues in new domains swiftly. After replacing dialogue segments in any domain with the corresponding generated ones, the model achieves comparable performance to the model trained on real data.
- A slot-shared span prediction-based neural network for multi-domain dialogue state tracking. In ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 1–5. IEEE.
- Task-optimized adapters for an end-to-end task-oriented dialogue system. In Findings of the Association for Computational Linguistics: ACL 2023, pages 7355–7369, Toronto, Canada. Association for Computational Linguistics.
- Björn Bebensee and Haejun Lee. 2023. Span-selective linear attention transformers for effective and robust schema-guided dialogue state tracking. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 78–91, Toronto, Canada. Association for Computational Linguistics.
- MultiWOZ - a large-scale multi-domain Wizard-of-Oz dataset for task-oriented dialogue modelling. In Proc. of EMNLP.
- Schema-guided multi-domain dialogue state tracking with graph attention neural networks. In Proc. of AAAI.
- Unified language model pre-training for natural language understanding and generation. Advances in neural information processing systems, 32.
- MultiWOZ 2.1: A consolidated multi-domain dialogue dataset with state corrections and state tracking baselines. In Proc. of LREC.
- Towards llm-driven dialogue state tracking. In Proc. of EMNLP.
- From machine reading comprehension to dialogue state tracking: Bridging the gap. In Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI.
- Dialog state tracking: A neural reading comprehension approach. In Proc. of SIGDIAL.
- Multiwoz 2.3: A multi-domain task-oriented dialogue dataset enhanced with annotation corrections and co-reference annotation. In Proc. of NLPCC.
- Unified dialog model pre-training for task-oriented dialog understanding and generation. In Proceedings of the 45th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 187–200.
- TripPy: A triple copy strategy for value independent neural dialog state tracking. In Proceedings of the 21th Annual Meeting of the Special Interest Group on Discourse and Dialogue.
- The second dialog state tracking challenge. In Proc. of SIGDIAL.
- Lora: Low-rank adaptation of large language models. In Proc. of ICLR.
- Vojtěch Hudeček and Ondřej Dušek. 2023. Are large language models all you need for task-oriented dialogue? In Proceedings of the 24th Annual Meeting of the Special Interest Group on Discourse and Dialogue.
- Is gpt-3 all you need for low-data discovery in chemistry?
- Synthetic data generation in low-resource settings via fine-tuning of large language models. ArXiv preprint.
- Brendan King and Jeffrey Flanigan. 2023. Diverse retrieval-augmented in-context learning for dialogue state tracking. In Findings of the Association for Computational Linguistics: ACL 2023, pages 5570–5585.
- Diederik P. Kingma and Jimmy Ba. 2015. Adam: A method for stochastic optimization. In Proc. of ICLR.
- Audio augmentation for speech recognition. In Proc. of Interspeech.
- Imagenet classification with deep convolutional neural networks. In Proc. of NeurIPS.
- Efficient memory management for large language model serving with pagedattention. In Proceedings of the 29th Symposium on Operating Systems Principles, pages 611–626.
- Dialogue state tracking with a language model using schema-driven prompting. In Proc. of EMNLP.
- Diable: Efficient dialogue state tracking as operations on tables. In Proc. of ACL Findings.
- Controllable dialogue simulation with in-context learning. In Proc. of EMNLP.
- MinTL: Minimalist transfer learning for task-oriented dialogue systems. In Proc. of EMNLP.
- Ilya Loshchilov and Frank Hutter. 2016. Sgdr: Stochastic gradient descent with warm restarts. In International Conference on Learning Representations.
- Implicit discourse relation identification for open-domain dialogues. In Proc. of ACL.
- Neural belief tracker: Data-driven dialogue state tracking. In Proc. of ACL.
- OpenAI. 2023. Chatgpt. https://chat.openai.com.
- Specaugment: A simple data augmentation method for automatic speech recognition. In Proc. of INTERSPEECH.
- Soloist: Building task bots at scale with transfer learning and machine teaching. Transactions of the Association for Computational Linguistics.
- Towards universal dialogue state tracking. In Proc. of EMNLP.
- Hugginggpt: Solving ai tasks with chatgpt and its friends in huggingface. In Proc. of NeurIPS.
- Connor Shorten and Taghi M Khoshgoftaar. 2019. A survey on image data augmentation for deep learning. Journal of Big Data.
- Significant-gravitas. 2023. auto-gpt: An experimental open-source attempt to make gpt-4 fully autonomous. https://github.com/Significant-Gravitas/AutoGPT.
- Choice fusion as knowledge for zero-shot dialogue state tracking. In Proc. of ICASSP.
- On tracking dialogue state by inheriting slot values in mentioned slot pools. In Proceedings of the Thirty-First International Joint Conference on Artificial Intelligence, IJCAI-22, pages 4375–4382. International Joint Conferences on Artificial Intelligence Organization. Main Track.
- Amendable generation for dialogue state tracking. In Proceedings of the 3rd Workshop on Natural Language Processing for Conversational AI.
- Llama: Open and efficient foundation language models. ArXiv preprint.
- Llama 2: Open foundation and fine-tuned chat models. ArXiv preprint.
- Bootstrapping llm-based task-oriented dialogue agents via self-talk. ArXiv preprint.
- Divide, conquer, and combine: Mixture of semantic-independent experts for zero-shot dialogue state tracking. In Proc. of ACL.
- LUNA: Learning slot-turn alignment for dialogue state tracking. In Proc. of NAACL.
- Jason Wei and Kai Zou. 2019. EDA: Easy data augmentation techniques for boosting performance on text classification tasks. In Proc. of EMNLP.
- Transferable multi-domain state generator for task-oriented dialogue systems. In Proc. of ACL.
- Refgpt: Dialogue generation of gpt, by gpt, and for gpt. In Proc. of EMNLP Findings.
- Multi-domain dialogue state tracking with disentangled domain-slot attention. In Proc. of ACL Findings.
- MultiWOZ 2.4: A multi-domain task-oriented dialogue dataset with essential annotation corrections to improve state tracking evaluation. In Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue.
- Slot self-attentive dialogue state tracking. In Proc. of WWW.
- MultiWOZ 2.2 : A dialogue dataset with additional annotation corrections and state tracking baselines. In Proceedings of the 2nd Workshop on Natural Language Processing for Conversational AI.
- MoNET: Tackle state momentum via noise-enhanced training for dialogue state tracking. In Findings of the Association for Computational Linguistics: ACL 2023, pages 520–534, Toronto, Canada. Association for Computational Linguistics.
- Find or classify? dual strategy for slot-value predictions on multi-domain dialog state tracking. In Proceedings of the Ninth Joint Conference on Lexical and Computational Semantics.
- Character-level convolutional networks for text classification. In Proc. of NeurIPS.
- Description-driven task-oriented dialog modeling. ArXiv preprint.
- Pytorch fsdp: Experiences on scaling fully sharded data parallel. Proc. VLDB Endow.
- XQA-DST: Multi-domain and multi-lingual dialogue state tracking. In Proc. of ACL Findings.
- Continual prompt tuning for dialog state tracking. In Proc. of ACL.
- Cheng Niu (15 papers)
- Xingguang Wang (2 papers)
- Xuxin Cheng (42 papers)
- Juntong Song (5 papers)
- Tong Zhang (569 papers)