Improving End-to-End Speech-to-Intent Classification with Reptile

Yusheng Tian, Philip John Gorinski

2020-08-05Speech Recognition Meta-Learning intent-classification speech-recognition Spoken Language Understanding General Classification Classification Intent Classification

Paper PDF

Abstract

End-to-end spoken language understanding (SLU) systems have many advantages over conventional pipeline systems, but collecting in-domain speech data to train an end-to-end system is costly and time consuming. One question arises from this: how to train an end-to-end SLU with limited amounts of data? Many researchers have explored approaches that make use of other related data resources, typically by pre-training parts of the model on high-resource speech recognition. In this paper, we suggest improving the generalization performance of SLU models with a non-standard learning algorithm, Reptile. Though Reptile was originally proposed for model-agnostic meta learning, we argue that it can also be used to directly learn a target task and result in better generalization than conventional gradient descent. In this work, we employ Reptile to the task of end-to-end spoken intent classification. Experiments on four datasets of different languages and domains show improvement of intent prediction accuracy, both when Reptile is used alone and used in addition to pre-training.

Results

Task	Dataset	Metric	Value	Model
Dialogue	Fluent Speech Commands	Accuracy (%)	99.2	Reptile
Spoken Language Understanding	Fluent Speech Commands	Accuracy (%)	99.2	Reptile
Dialogue Understanding	Fluent Speech Commands	Accuracy (%)	99.2	Reptile

Related Papers

Task-Specific Audio Coding for Machines: Machine-Learned Latent Features Are Codes for That Machine2025-07-17 NonverbalTTS: A Public English Corpus of Text-Aligned Nonverbal Vocalizations with Emotion Annotations for Text-to-Speech2025-07-17 Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17 Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16 Imbalanced Regression Pipeline Recommendation2025-07-16 CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16 Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16 Safeguarding Federated Learning-based Road Condition Classification2025-07-16