A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Guanhua Chen, Yutong Yao, Derek F. Wong, Lidia S. Chao

2024-05-05Semantic Frame Parsing Natural Language Understanding Intent Detection Data Augmentation Slot Filling Prediction Contrastive Learning

Paper PDF

Abstract

Multi-intent natural language understanding (NLU) presents a formidable challenge due to the model confusion arising from multiple intents within a single utterance. While previous works train the model contrastively to increase the margin between different multi-intent labels, they are less suited to the nuances of multi-intent NLU. They ignore the rich information between the shared intents, which is beneficial to constructing a better embedding space, especially in low-data scenarios. We introduce a two-stage Prediction-Aware Contrastive Learning (PACL) framework for multi-intent NLU to harness this valuable knowledge. Our approach capitalizes on shared intent information by integrating word-level pre-training and prediction-aware contrastive fine-tuning. We construct a pre-training dataset using a word-level data augmentation strategy. Subsequently, our framework dynamically assigns roles to instances during contrastive fine-tuning while introducing a prediction-aware contrastive loss to maximize the impact of contrastive learning. We present experimental results and empirical analysis conducted on three widely used datasets, demonstrating that our method surpasses the performance of three prominent baselines on both low-data and full-data scenarios.

Results

Task	Dataset	Metric	Value	Model
Slot Filling	MixSNIPS	Micro F1	96.8	SLIM (PACL)
Slot Filling	MixSNIPS	Micro F1	96.3	TFMN (PACL)
Slot Filling	MixSNIPS	Micro F1	96.2	RoBERTa (PACL)
Slot Filling	MixATIS	Micro F1	87.3	SLIM (PACL)
Slot Filling	MixATIS	Micro F1	86.7	TFMN (PACL)
Slot Filling	MixATIS	Micro F1	86	RoBERTa (PACL)
Intent Detection	MixSNIPS	Accuracy	97.4	TFMN (PACL)
Intent Detection	MixSNIPS	Accuracy	96.9	SLIM (PACL)
Intent Detection	MixSNIPS	Accuracy	96.5	RoBERTa (PACL)
Intent Detection	MixATIS	Accuracy	82.9	TFMN (PACL)
Intent Detection	MixATIS	Accuracy	81.9	SLIM (PACL)
Intent Detection	MixATIS	Accuracy	79.1	RoBERTa (PACL)

A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Abstract

Results

Related Papers

A Two-Stage Prediction-Aware Contrastive Learning Framework for Multi-Intent NLU

Abstract

Results

Related Papers