Discovering New Intents with Deep Aligned Clustering (2012.08987v7)

Published 16 Dec 2020 in cs.CL and cs.AI

Abstract: Discovering new intents is a crucial task in dialogue systems. Most existing methods are limited in transferring the prior knowledge from known intents to new intents. They also have difficulties in providing high-quality supervised signals to learn clustering-friendly features for grouping unlabeled intents. In this work, we propose an effective method, Deep Aligned Clustering, to discover new intents with the aid of the limited known intent data. Firstly, we leverage a few labeled known intent samples as prior knowledge to pre-train the model. Then, we perform k-means to produce cluster assignments as pseudo-labels. Moreover, we propose an alignment strategy to tackle the label inconsistency problem during clustering assignments. Finally, we learn the intent representations under the supervision of the aligned pseudo-labels. With an unknown number of new intents, we predict the number of intent categories by eliminating low-confidence intent-wise clusters. Extensive experiments on two benchmark datasets show that our method is more robust and achieves substantial improvements over the state-of-the-art methods. The codes are released at https://github.com/thuiar/DeepAligned-Clustering.

PDF Abstract

Discovering New Intents with Deep Aligned Clustering

The paper "Discovering New Intents with Deep Aligned Clustering" addresses the critical task of intent discovery in dialogue systems, contending with the dual challenges of transferring prior knowledge from known intents to novel ones and producing high-quality supervised signals for effective clustering. The proposed approach leverages limited labeled known-intent data to enhance the clustering of unlabeled data and achieve effective intent discovery. This method could advancing the robustness and precision of dialogue systems in understanding emergent user needs.

Methodology Overview

The authors introduce Deep Aligned Clustering (DAC), an approach that utilizes labeled known-intent samples to pre-train a LLM, employing BERT for extracting intent features. Following this, k-means clustering is employed to generate cluster assignments that serve as pseudo-labels. Crucially, the alignment strategy is proposed to address label inconsistency during clustering assignments, ensuring stable and reliable pseudo-label creation for subsequent learning stages.

The methodology proceeds by determining the number of intent categories, a typical challenge when the number of novel classes is unknown. This is achieved by using cluster confidence measures, where low-confidence clusters are filtered out to yield the final number of clusters. This strategy contrasts with methods like CDAC+ and DEC, which struggle with new-intent mixed data scenarios and rely on less discriminative signals.

Experimental Results

The paper contrasts their approach against various unsupervised and semi-supervised clustering methods, including traditional clustering methods such as K-Means and more advanced deep clustering strategies such as DEC, DCN, and DeepCluster. Experiments conducted on benchmark datasets—CLINC and BANKING—demonstrate that DAC significantly outperforms these state-of-the-art approaches.

The numerical results indicate superior performance with more substantial improvements across NMI, ARI, and ACC metrics. For instance, DAC achieves over 20% improvement in NMI on the CLINC dataset compared to BERT-DTC and similarly substantial gains in ARI and ACC, affirming the robustness of the alignment strategy in clustering both known and novel intents.

Theoretical and Practical Implications

The implications of this research are multifaceted. Theoretically, it presents a robust alignment mechanism to enhance clustering consistency without the need for classifier reinitialization, a common bottleneck in deep clustering frameworks. The alignment across epochs anchors stateful knowledge transfer, enhancing the utility of self-supervised learning paradigms in unsupervised tasks.

Practically, the capability to discover new intents accurately lends itself to real-world deployment scenarios, where dialogue systems need to adapt rapidly to evolving user demands without extensive manual annotation effort. Furthermore, the ability to predict unknown class numbers broadens the adaptability of AI systems in scenarios characterized by dynamic interaction patterns and intent formulation.

Speculations on Future Developments in AI

Future work could explore extending this approach to multimodal intent recognition, incorporating contextual data beyond textual inputs. Additionally, further refinement in intent representation, perhaps integrating few-shot learning paradigms or other self-supervised mechanisms, could enhance generalization. We might also see continued research into evaluating clustering results more dynamically and reducing computation overheads. As dialogue systems continue to grow in complexity and application scope, methods such as Deep Aligned Clustering are likely to play a central role in developing more agile, intelligent, and human-like conversational AI.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Hanlei Zhang (13 papers)
Hua Xu (78 papers)
Ting-En Lin (28 papers)
Rui Lyu (3 papers)

Citations (108)

View on Semantic Scholar

Related Papers

Find Related Papers

GitHub

GitHub - thuiar/DeepAligned-Clustering: Discovering New Intents with Deep Aligned Clustering (AAAI 2021) (120 stars)

Tweets

https://twitter.com/Quantum_Stat/status/1422044497026949122