Papers
Topics
Authors
Recent
Search
2000 character limit reached

Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems

Published 24 May 2024 in cs.CL | (2405.15585v3)

Abstract: End-to-end Task-Oriented Dialog (TOD) systems typically require extensive training datasets to perform well. In contrast, LLM based TOD systems can excel even with limited data due to their ability to learn tasks through in-context exemplars. However, these models lack alignment with the style of responses in training data and often generate comprehensive responses, making it difficult for users to grasp the information quickly. In response, we propose SyncTOD that synergizes LLMs with task-specific hints to improve alignment in low-data settings. SyncTOD employs small auxiliary models to provide hints and select exemplars for in-context prompts. With ChatGPT, SyncTOD achieves superior performance compared to LLM-based baselines and SoTA models in low-data settings, while retaining competitive performance in full-data settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. A multitask, multilingual, multimodal evaluation of chatgpt on reasoning, hallucination, and interactivity. ArXiv, abs/2302.04023.
  2. Multiwoz - a large-scale multi-domain wizard-of-oz dataset for task-oriented dialogue modelling. In Conference on Empirical Methods in Natural Language Processing.
  3. Scaling instruction-finetuned language models. ArXiv, abs/2210.11416.
  4. Key-value retrieval networks for task-oriented dialogue. ArXiv, abs/1705.05414.
  5. Google. 2023. Palm 2 technical report. ArXiv, abs/2305.10403.
  6. Realm: Retrieval-augmented language model pre-training. ArXiv, abs/2002.08909.
  7. Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing.
  8. Fg2seq: Effectively encoding knowledge for end-to-end task-oriented dialog. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8029–8033.
  9. Task-oriented dialog generation with enhanced entity representation. In Interspeech.
  10. A simple language model for task-oriented dialogue. ArXiv, abs/2005.00796.
  11. In-context learning for few-shot dialogue state tracking. In Conference on Empirical Methods in Natural Language Processing.
  12. Autoregressive entity generation for end-to-end task-oriented dialog. ArXiv, abs/2209.08708.
  13. Vojtech Hudecek and Ondrej Dusek. 2023. Are llms all you need for task-oriented dialogue? ArXiv, abs/2304.06556.
  14. Prompting for explanations improves adversarial nli. is this true? {Yes} it is {true} because {it weakens superficial cues}. In Findings.
  15. Retrieval-augmented generation for knowledge-intensive nlp tasks. ArXiv, abs/2005.11401.
  16. Guiding large language models via directional stimulus prompting. ArXiv, abs/2302.11520.
  17. Rensis Likert. 1932. A technique for the measurement of attitude scales.
  18. Bitod: A bilingual multi-domain dataset for task-oriented dialogue modeling. arXiv preprint arXiv:2106.02787.
  19. What makes good in-context examples for gpt-3? In Workshop on Knowledge Extraction and Integration for Deep Learning Architectures; Deep Learning Inside Out.
  20. Ilya Loshchilov and Frank Hutter. 2017. Decoupled weight decay regularization. In International Conference on Learning Representations.
  21. Learning knowledge bases with parameters for task-oriented dialogue systems. ArXiv, abs/2009.13656.
  22. Mem2seq: Effectively incorporating knowledge bases into end-to-end task-oriented dialog systems. ArXiv, abs/1804.08217.
  23. Using in-context learning to improve dialogue safety. ArXiv, abs/2302.00871.
  24. Rodrigo Nogueira and Kyunghyun Cho. 2019. Passage re-ranking with bert. ArXiv, abs/1901.04085.
  25. Bleu: a method for automatic evaluation of machine translation. In Annual Meeting of the Association for Computational Linguistics.
  26. Entity-consistent end-to-end task-oriented dialogue system with kb retriever. ArXiv, abs/1909.06762.
  27. End-to-end task-oriented dialogue: A survey of tasks, methods, and future directions.
  28. Dynamic fusion network for multi-domain end-to-end task-oriented dialog. In Annual Meeting of the Association for Computational Linguistics.
  29. Constraint based knowledge base distillation in end-to-end task oriented dialogs. ArXiv, abs/2109.07396.
  30. In-context retrieval-augmented language models. Transactions of the Association for Computational Linguistics, 11:1316–1331.
  31. A network-based end-to-end trainable task-oriented dialogue system. In Conference of the European Chapter of the Association for Computational Linguistics.
  32. Dialokg: Knowledge-structure aware task-oriented dialogue generation. ArXiv, abs/2204.09149.
  33. Replug: Retrieval-augmented black-box language models. ArXiv, abs/2301.12652.
  34. Q-tod: A query-driven task-oriented dialogue system. In Conference on Empirical Methods in Natural Language Processing.
  35. Llama 2: Open foundation and fine-tuned chat models. ArXiv, abs/2307.09288.
  36. Multi-grained knowledge retrieval for end-to-end task-oriented dialog. In Annual Meeting of the Association for Computational Linguistics.
  37. Learning to retrieve in-context examples for large language models. ArXiv, abs/2307.07164.
  38. Sequence-to-sequence learning for task-oriented dialogue with dialogue state representation. In International Conference on Computational Linguistics.
  39. Global-to-local memory pointer networks for task-oriented dialogue. ArXiv, abs/1901.04713.
  40. Graphmemdialog: Optimizing end-to-end task-oriented dialog systems using graph memory networks. In AAAI Conference on Artificial Intelligence.
  41. C-pack: Packaged resources to advance general chinese embedding.
  42. Unifiedskg: Unifying and multi-tasking structured knowledge grounding with text-to-text language models. ArXiv, abs/2201.05966.
  43. Pomdp-based statistical spoken dialog systems: A review. Proceedings of the IEEE, 101:1160–1179.
  44. Enhancing performance on seen and unseen dialogue scenarios using retrieval-augmented end-to-end task-oriented system. In SIGDIAL Conferences.
  45. Retrieve anything to augment large language models. ArXiv, abs/2310.07554.
  46. Chatbot arena: Benchmarking llms in the wild with elo ratings. https://lmsys.org/blog/2023-05-03-arena/.
Citations (1)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 3 tweets with 32 likes about this paper.