DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton (2402.04411v2)
Abstract: This paper introduces the retrieval-augmented LLM with Definite Finite Automaton (DFA-RAG), a novel framework designed to enhance the capabilities of conversational agents using LLMs. Traditional LLMs face challenges in generating regulated and compliant responses in special scenarios with predetermined response guidelines, like emotional support and customer service. Our framework addresses these challenges by embedding a Definite Finite Automaton (DFA), learned from training dialogues, within the LLM. This structured approach acts as a semantic router which enables the LLM to adhere to a deterministic response pathway. The routing is achieved by the retrieval-augmentation generation (RAG) strategy, which carefully selects dialogue examples aligned with the current conversational context. The advantages of DFA-RAG include an interpretable structure through human-readable DFA, context-aware retrieval for responses in conversations, and plug-and-play compatibility with existing LLMs. Extensive benchmarks validate DFA-RAG's effectiveness, indicating its potential as a valuable contribution to the conversational agent.
- Survey on chatbot design techniques in speech conversation systems. International Journal of Advanced Computer Science and Applications, 6(7), 2015.
- Chatbots: History, technology, and applications. Machine Learning with Applications, 2:100006, 2020.
- Axelbrooke, S. Customer support on twitter (kaggle competition). https://www.kaggle.com/datasets/thoughtvector/customer-support-on-twitter, 2017.
- Task-optimized adapters for an end-to-end task-oriented dialogue system. arXiv preprint arXiv:2305.02468, 2023.
- Improving access to online health information with conversational agents: a randomized controlled experiment. Journal of medical Internet research, 18(1):e1, 2016.
- Improving language models by retrieving from trillions of tokens. In International conference on machine learning, pp. 2206–2240. PMLR, 2022.
- Multiwoz–a large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling. arXiv preprint arXiv:1810.00278, 2018.
- Semantically conditioned dialog response generation via hierarchical disentangled self-attention. arXiv preprint arXiv:1905.12866, 2019.
- Neural machine translation with contrastive translation memories. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pp. 3591–3601, 2022.
- Lift yourself up: Retrieval-augmented text generation with self memory. arXiv preprint arXiv:2305.02437, 2023.
- Efficient task-oriented dialogue systems with response selection as an auxiliary task. arXiv preprint arXiv:2208.07097, 2022a.
- Efficient task-oriented dialogue systems with response selection as an auxiliary task. arXiv preprint arXiv:2208.07097, 2022b.
- Parameter-efficient fine-tuning of large-scale pre-trained language models. Nature Machine Intelligence, 5(3):220–235, 2023.
- A survey for in-context learning. arXiv preprint arXiv:2301.00234, 2022.
- Alpacafarm: A simulation framework for methods that learn from human feedback, 2023.
- Fantastic rewards and how to tame them: A case study on reward learning for task-oriented dialogue systems. In The Eleventh International Conference on Learning Representations, 2022.
- Galaxy: A generative pre-trained model for task-oriented dialog with semi-supervised learning and explicit policy injection. In Proceedings of the AAAI conference on artificial intelligence, volume 36, pp. 10749–10757, 2022.
- A survey on conversational agents/chatbots classification and design techniques. In Web, Artificial Intelligence and Network Applications: Proceedings of the Workshops of the 33rd International Conference on Advanced Information Networking and Applications (WAINA-2019) 33, pp. 946–956. Springer, 2019.
- Evaluating and informing the design of chatbots. In Proceedings of the 2018 designing interactive systems conference, pp. 895–906, 2018.
- Virtual assistance in any context: A taxonomy of design elements for domain-specific chatbots. Business & Information Systems Engineering, 62:211–225, 2020.
- Knowledge graph-augmented language models for knowledge-grounded dialogue generation. arXiv preprint arXiv:2305.18846, 2023.
- Demonstrate-search-predict: Composing retrieval and language models for knowledge-intensive nlp. arXiv preprint arXiv:2212.14024, 2022.
- Automata studies. Princeton, NJ, 1956.
- Augpt: Auxiliary tasks and data augmentation for end-to-end dialogue with pre-trained language models. arXiv preprint arXiv:2102.05126, 2021.
- Lee, Y. Improving end-to-end task-oriented dialog system with a simple auxiliary task. In Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 1296–1303, 2021.
- Retrieval-augmented generation for knowledge-intensive nlp tasks. Advances in Neural Information Processing Systems, 33:9459–9474, 2020.
- Finding support examples for in-context learning. In Findings of the Association for Computational Linguistics: EMNLP 2023, pp. 6219–6235, 2023.
- Towards humanlike chatbots helping users cope with stressful situations. In Computational Collective Intelligence: 11th International Conference, ICCCI 2019, Hendaye, France, September 4–6, 2019, Proceedings, Part I 11, pp. 232–243. Springer, 2019.
- Minsky, M. Some universal elements for finite automata. Automata studies, 34:117–128, 1956.
- Ret-llm: Towards a general read-write memory for large language models. arXiv preprint arXiv:2305.14322, 2023.
- OpenAI. Gpt-4 technical report. 2023.
- Training language models to follow instructions with human feedback. Advances in Neural Information Processing Systems, 35:27730–27744, 2022.
- Causal-aware safe policy improvement for task-oriented dialogue. arXiv preprint arXiv:2103.06370, 2021.
- Technical aspects of developing chatbots for medical applications: scoping review. Journal of medical Internet research, 22(12):e19127, 2020.
- Selvi, J. Exploring overfitting risks in large language models. https://research.nccgroup.com/2023/05/22/exploring-overfitting-risks-in-large-language-models/, May 2023.
- Retrieval augmentation reduces hallucination in conversation. In Findings of the Association for Computational Linguistics: EMNLP 2021, pp. 3784–3803, 2021.
- Multi-task pre-training for plug-and-play task-oriented dialogue system. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pp. 4661–4676, 2022.
- Transferable dialogue systems and user simulators. arXiv preprint arXiv:2107.11904, 2021.
- Decodingtrust: A comprehensive assessment of trustworthiness in gpt models. Advances in Neural Information Processing Systems, 2023a.
- Modelling hierarchical structure between dialogue policy and natural language generator with option framework for task-oriented dialogue system. arXiv preprint arXiv:2006.06814, 2020a.
- Multi-domain dialogue acts and response co-generation. arXiv preprint arXiv:2004.12363, 2020b.
- Chatcad: Interactive computer-aided diagnosis on medical image using large language models. arXiv preprint arXiv:2302.07257, 2023b.
- Knowledgpt: Enhancing large language models with retrieval and storage access on knowledge bases. arXiv preprint arXiv:2308.11761, 2023c.
- Retrieval-augmented multimodal language modeling. arXiv preprint arXiv:2211.12561, 2022.
- Krls: Improving end-to-end response generation in task oriented dialog with reinforced keywords learning. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pp. 12338–12358, 2023.
- Judging llm-as-a-judge with mt-bench and chatbot arena. arXiv preprint arXiv:2306.05685, 2023.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.