TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Discovering New Intents with Deep Aligned Clustering

Discovering New Intents with Deep Aligned Clustering

Hanlei Zhang, Hua Xu, Ting-En Lin, Rui Lyu

2020-12-16Short Text ClusteringOpen Intent DiscoveryText ClusteringClustering
PaperPDFCode(official)Code(official)

Abstract

Discovering new intents is a crucial task in dialogue systems. Most existing methods are limited in transferring the prior knowledge from known intents to new intents. They also have difficulties in providing high-quality supervised signals to learn clustering-friendly features for grouping unlabeled intents. In this work, we propose an effective method, Deep Aligned Clustering, to discover new intents with the aid of the limited known intent data. Firstly, we leverage a few labeled known intent samples as prior knowledge to pre-train the model. Then, we perform k-means to produce cluster assignments as pseudo-labels. Moreover, we propose an alignment strategy to tackle the label inconsistency problem during clustering assignments. Finally, we learn the intent representations under the supervision of the aligned pseudo-labels. With an unknown number of new intents, we predict the number of intent categories by eliminating low-confidence intent-wise clusters. Extensive experiments on two benchmark datasets show that our method is more robust and achieves substantial improvements over the state-of-the-art methods. The codes are released at https://github.com/thuiar/DeepAligned-Clustering.

Results

TaskDatasetMetricValueModel
Text ClusteringCLINC150ACC86.49DeepAligned
Text ClusteringCLINC150ARI0.7975DeepAligned
Text ClusteringCLINC150NMI0.9389DeepAligned
Text ClusteringBANKING77ACC64.9DeepAligned
Text ClusteringBANKING77ARI0.5364DeepAligned
Text ClusteringBANKING77NMI0.7956DeepAligned
Open Intent DiscoveryCLINC150ACC86.49DeepAligned
Open Intent DiscoveryCLINC150ARI0.7975DeepAligned
Open Intent DiscoveryCLINC150NMI0.9389DeepAligned
Open Intent DiscoveryBANKING77ACC64.9DeepAligned
Open Intent DiscoveryBANKING77ARI0.5364DeepAligned
Open Intent DiscoveryBANKING77NMI0.7956DeepAligned

Related Papers

Tri-Learn Graph Fusion Network for Attributed Graph Clustering2025-07-18Ranking Vectors Clustering: Theory and Applications2025-07-16Car Object Counting and Position Estimation via Extension of the CLIP-EBC Framework2025-07-11GNN-ViTCap: GNN-Enhanced Multiple Instance Learning with Vision Transformers for Whole Slide Image Classification and Captioning2025-07-09Consistency and Inconsistency in $K$-Means Clustering2025-07-08MC-INR: Efficient Encoding of Multivariate Scientific Simulation Data using Meta-Learning and Clustered Implicit Neural Representations2025-07-03Supercm: Revisiting Clustering for Semi-Supervised Learning2025-06-30Temporal Rate Reduction Clustering for Human Motion Segmentation2025-06-26