TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Deep Unknown Intent Detection with Margin Loss

Deep Unknown Intent Detection with Margin Loss

Ting-En Lin, Hua Xu

2019-06-02ACL 2019 7Open Intent DetectionIntent DetectionNovelty Detection
PaperPDFCode

Abstract

Identifying the unknown (novel) user intents that have never appeared in the training set is a challenging task in the dialogue system. In this paper, we present a two-stage method for detecting unknown intents. We use bidirectional long short-term memory (BiLSTM) network with the margin loss as the feature extractor. With margin loss, we can learn discriminative deep features by forcing the network to maximize inter-class variance and to minimize intra-class variance. Then, we feed the feature vectors to the density-based novelty detection algorithm, local outlier factor (LOF), to detect unknown intents. Experiments on two benchmark datasets show that our method can yield consistent improvements compared with the baseline methods.

Results

TaskDatasetMetricValueModel
Intent DetectionATIS (25% known)F10.696LMCL
Intent DetectionSNIPS (50% known)F10.841LMCL
Intent DetectionATIS (50% known)F10.396LMCL
Intent DetectionSNIPS (25% known)F10.792LMCL
Intent DetectionSNIPS (75% known)F10.788LMCL

Related Papers

Flippi: End To End GenAI Assistant for E-Commerce2025-07-08An Interdisciplinary Review of Commonsense Reasoning and Intent Detection2025-06-16Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation2025-06-12Integration of Old and New Knowledge for Generalized Intent Discovery: A Consistency-driven Prototype-Prompting Framework2025-06-10Building a Few-Shot Cross-Domain Multilingual NLU Model for Customer Care2025-06-04Harnessing Large Language Models for Scientific Novelty Detection2025-05-30SuperAD: A Training-free Anomaly Classification and Segmentation Method for CVPR 2025 VAND 3.0 Workshop Challenge Track 1: Adapt & Detect2025-05-26Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Intent Manipulation2025-05-24