TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/EVI: Multilingual Spoken Dialogue Tasks and Dataset for Kn...

EVI: Multilingual Spoken Dialogue Tasks and Dataset for Knowledge-Based Enrolment, Verification, and Identification

Georgios P. Spithourakis, Ivan Vulić, Michał Lis, Iñigo Casanueva, Paweł Budzianowski

2022-04-28Findings (NAACL) 2022 7Speaker IdentificationSpeaker VerificationSpoken Dialogue Systems
PaperPDFCode(official)

Abstract

Knowledge-based authentication is crucial for task-oriented spoken dialogue systems that offer personalised and privacy-focused services. Such systems should be able to enrol (E), verify (V), and identify (I) new and recurring users based on their personal information, e.g. postcode, name, and date of birth. In this work, we formalise the three authentication tasks and their evaluation protocols, and we present EVI, a challenging spoken multilingual dataset with 5,506 dialogues in English, Polish, and French. Our proposed models set the first competitive benchmarks, explore the challenges of multilingual natural language processing of spoken dialogue, and set directions for future research.

Results

TaskDatasetMetricValueModel
Speaker IdentificationEVI en-GBTop-1 (%)67.77Fuzzy Retrieval
Speaker IdentificationEVI pl-PLTop-1 (%)95.13Fuzzy Retrieval
Speaker IdentificationEVI fr-FRTop-1 (%)80.83Fuzzy Retrieval

Related Papers

SHIELD: A Secure and Highly Enhanced Integrated Learning for Robust Deepfake Detection against Adversarial Attacks2025-07-17Prompt-Guided Turn-Taking Prediction2025-06-26SSAVSV: Towards Unified Model for Self-Supervised Audio-Visual Speaker Verification2025-06-21Pushing the Performance of Synthetic Speech Detection with Kolmogorov-Arnold Networks and Self-Supervised Learning Models2025-06-17A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments2025-06-17Mitigating Non-Target Speaker Bias in Guided Speaker Embedding2025-06-14CoLMbo: Speaker Language Model for Descriptive Profiling2025-06-11You Are What You Say: Exploiting Linguistic Content for VoicePrivacy Attacks2025-06-11