A Novel Bi-directional Interrelated Model for Joint Intent Detection and Slot Filling (1907.00390v1)

Published 30 Jun 2019 in cs.CL, cs.AI, and eess.AS

Abstract: A spoken language understanding (SLU) system includes two main tasks, slot filling (SF) and intent detection (ID). The joint model for the two tasks is becoming a tendency in SLU. But the bi-directional interrelated connections between the intent and slots are not established in the existing joint models. In this paper, we propose a novel bi-directional interrelated model for joint intent detection and slot filling. We introduce an SF-ID network to establish direct connections for the two tasks to help them promote each other mutually. Besides, we design an entirely new iteration mechanism inside the SF-ID network to enhance the bi-directional interrelated connections. The experimental results show that the relative improvement in the sentence-level semantic frame accuracy of our model is 3.79% and 5.42% on ATIS and Snips datasets, respectively, compared to the state-of-the-art model.

PDF Abstract

Joint Intent Detection and Slot Filling via a Bi-directional Interrelated Model

This essay explores the research conducted by Haihong E, Peiqing Niu, Zhongfu Chen, and Meina Song on enhancing the joint tasks of intent detection (ID) and slot filling (SF) within spoken language understanding (SLU) systems. The authors introduce a bi-directional interrelated model titled SF-ID network designed to exploit the interdependencies between these tasks, potentially improving semantic frame accuracy and overall task performance.

The SF-ID network introduces an innovative structure comprising two subnets: the SF subnet, which incorporates intent information into slot filling, and the ID subnet, which integrates slot information into intent detection. This bi-directional mechanism inherently binds both tasks, promoting mutual reinforcement. A distinguishing feature of this model is its iteration mechanism, fostering continuous enhancement of connections between intent and slots through iterations. Experimental evaluations reveal performance improvements against prevailing models on benchmark datasets, namely ATIS and Snips.

Model Architecture

The SF-ID network builds upon the bi-directional LSTM (BLSTM) framework, utilizing an attention mechanism to extract context for both slots and intent. Such integration accounts for word dependencies and improves the accuracy of deriving semantic meaning from user utterances. The model operates in two modes: SF-First and ID-First, dictating the order in which subnets process information. Crucially, the iteration mechanism of the SF-ID network refines the relationship between slot and intent information iteratively, allowing dynamic adjustment of reinforce vectors, which ultimately enhances prediction accuracy. The authors also utilize a CRF layer, especially advantageous for sequence labeling in the SF task, leading to heightened slot prediction reliability.

Results and Analysis

The proposed model demonstrates superior performance on the ATIS and Snips datasets, exhibiting improved F1 scores, intent accuracy, and sentence-level semantic frame accuracy. Notably, the sentence-level semantic frame accuracy sees relative improvements of 3.79% on ATIS and 5.42% on Snips when compared to existing state-of-the-art models. The SF-ID network's strengths are evident in its favorable handling of complex interactions in the SF and ID tasks due to the bi-directional information flow and iteration mechanisms.

The iteration mechanism specifically benefits model performance by incrementally refining predictions through repeated processing of reinforce vectors. Analysis of different configurations indicates that while both ID-First and SF-First modes improve specific task performance, consistent enhancement across metrics demonstrates the efficacy of the bi-directional approach. Furthermore, the addition of a CRF layer assists in leveraging sequential dependencies, prominently improving slot filling outcomes.

Implications and Future Directions

The research presented in this paper provides pivotal insights into the integration and co-learning of intent detection and slot filling. By demonstrating a model that capitalizes on task interrelation via a bi-directional mechanism, the work contributes a significant methodological advancement to natural language understanding. The implications span both theoretical and practical realms, offering a framework for future exploration of interaction mechanisms in SLU and related fields.

Future explorations could delve into optimizing iterative processes for other interconnected tasks within AI or expanding the model's adaptability to diverse SLU applications. Researchers may also investigate augmenting the current model with additional contextual features or external knowledge sources to further refine task performance and application scope.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Haihong E (13 papers)
Peiqing Niu (1 paper)
Zhongfu Chen (2 papers)
Meina Song (14 papers)

Citations (232)

View on Semantic Scholar

A Novel Bi-directional Interrelated Model for Joint Intent Detection and Slot Filling (1907.00390v1)

Joint Intent Detection and Slot Filling via a Bi-directional Interrelated Model

Model Architecture

Results and Analysis

Implications and Future Directions

Related Papers