BlendX: Complex Multi-Intent Detection with Blended Patterns
Abstract: Task-oriented dialogue (TOD) systems are commonly designed with the presumption that each utterance represents a single intent. However, this assumption may not accurately reflect real-world situations, where users frequently express multiple intents within a single utterance. While there is an emerging interest in multi-intent detection (MID), existing in-domain datasets such as MixATIS and MixSNIPS have limitations in their formulation. To address these issues, we present BlendX, a suite of refined datasets featuring more diverse patterns than their predecessors, elevating both its complexity and diversity. For dataset construction, we utilize both rule-based heuristics as well as a generative tool -- OpenAI's ChatGPT -- which is augmented with a similarity-driven strategy for utterance selection. To ensure the quality of the proposed datasets, we also introduce three novel metrics that assess the statistical properties of an utterance related to word count, conjunction use, and pronoun usage. Extensive experiments on BlendX reveal that state-of-the-art MID models struggle with the challenges posed by the new datasets, highlighting the need to reexamine the current state of the MID field. The dataset is available at https://github.com/HYU-NLP/BlendX.
- Slim: Explicit slot-intent mapping with bert for joint multi-intent detection and slot filling. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- Joint multiple intent detection and slot filling via self-distillation. In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
- A transformer-based threshold-free framework for multi-intent NLU. In Proceedings of the 29th International Conference on Computational Linguistics (COLING).
- A scope sensitive and result attentive model for multi-intent spoken language understanding. Proceedings of the AAAI Conference on Artificial Intelligence (AAAI).
- Rashmi Gangadharaiah and Balakrishnan Narayanaswamy. 2019. Joint multiple intent detection and slot labeling for goal-oriented dialog. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL).
- SPM: A split-parsing method for joint multi-intent detection and slot filling. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL).
- Stefan Larson and Kevin Leach. 2022. A survey of intent classification and slot-filling datasets for task-oriented dialog. arXiv preprint arXiv:2207.13211.
- DialogUSR: Complex dialogue utterance splitting and reformulation for multiple intent detection. In Findings of the Association for Computational Linguistics: EMNLP 2022.
- GL-GIN: Fast and accurate non-autoregressive model for joint multiple intent detection and slot filling. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP).
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP).
- Joint multiple intent detection and slot filling with supervised contrastive learning and self-distillation. In Frontiers in Artificial Intelligence and Applications. IOS Press.
- Bowen Xing and Ivor Tsang. 2022. Co-guiding net: Achieving mutual guidances between multiple intent detection and slot filling via heterogeneous semantics-label graphs. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP).
- Cecilia Ying and Stephen Thomas. 2022. Label errors in BANKING77. In Proceedings of the Third Workshop on Insights from Negative Results in NLP.
- SLURP: A Spoken Language Understanding Resource Package. PID https://github.com/pswietojanski/slurp.
- Iñigo Casanueva and Tadas Temcinas and Daniela Gerz and Matthew Henderson and Ivan Vulic. 2020. Efficient Intent Detection with Dual Sentence Encoders. PID https://github.com/PolyAI-LDN/task-specific-datasets/tree/master/banking_data.
- Snips Voice Platform: an embedded Spoken Language Understanding system for private-by-design voice interfaces. PID https://github.com/snipsco/snips-nlu.
- Redwood: Using Collision Detection to Grow a Large-Scale Intent Classification Dataset. Association for Computational Linguistics. PID https://github.com/gxlarson/redwood.
- An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction. Association for Computational Linguistics. PID https://github.com/clinc/oos-eval.
- Benchmarking Natural Language Understanding Services for building Conversational Agents. Springer. PID https://github.com/xliuhw/NLU-Evaluation-Data.
- ATIS - Seven Languages. Linguistic Data Consortium, ISLRN 713-838-074-718-6. PID https://catalog.ldc.upenn.edu/LDC2021T04.
- DialogUSR: Complex Dialogue Utterance Splitting and Reformulation for Multiple Intent Detection.
- AGIF: An Adaptive Graph-Interactive Framework for Joint Multiple Intent Detection and Slot Filling. PID https://github.com/LooperXX/AGIF.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.