TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Timers and Such: A Practical Benchmark for Spoken Language...

Timers and Such: A Practical Benchmark for Spoken Language Understanding with Numbers

Loren Lugosch, Piyush Papreja, Mirco Ravanelli, Abdelwahab Heba, Titouan Parcollet

2021-04-04Spoken Language Understanding
PaperPDFCode(official)Code(official)

Abstract

This paper introduces Timers and Such, a new open source dataset of spoken English commands for common voice control use cases involving numbers. We describe the gap in existing spoken language understanding datasets that Timers and Such fills, the design and creation of the dataset, and experiments with a number of ASR-based and end-to-end baseline models, the code for which has been made available as part of the SpeechBrain toolkit.

Results

TaskDatasetMetricValueModel
DialogueTimers and SuchAccuracy (%)81.6Baseline
Spoken Language UnderstandingTimers and SuchAccuracy (%)81.6Baseline
Dialogue UnderstandingTimers and SuchAccuracy (%)81.6Baseline

Related Papers

MMSU: A Massive Multi-task Spoken Language Understanding and Reasoning Benchmark2025-06-05ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs2025-05-26"KAN you hear me?" Exploring Kolmogorov-Arnold Networks for Spoken Language Understanding2025-05-26Exploring the Effect of Segmentation and Vocabulary Size on Speech Tokenization for Speech Language Models2025-05-23"Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding2025-05-21QUADS: QUAntized Distillation Framework for Efficient Speech Language Understanding2025-05-19Spoken Language Understanding on Unseen Tasks With In-Context Learning2025-05-12LiveLongBench: Tackling Long-Context Understanding for Spoken Texts from Live Streams2025-04-24