TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement

xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement

Nikolai Lund Kühne, Jan Østergaard, Jesper Jensen, Zheng-Hua Tan

2025-01-10Speech Enhancement
PaperPDFCode(official)

Abstract

While attention-based architectures, such as Conformers, excel in speech enhancement, they face challenges such as scalability with respect to input sequence length. In contrast, the recently proposed Extended Long Short-Term Memory (xLSTM) architecture offers linear scalability. However, xLSTM-based models remain unexplored for speech enhancement. This paper introduces xLSTM-SENet, the first xLSTM-based single-channel speech enhancement system. A comparative analysis reveals that xLSTM-and notably, even LSTM-can match or outperform state-of-the-art Mamba- and Conformer-based systems across various model sizes in speech enhancement on the VoiceBank+Demand dataset. Through ablation studies, we identify key architectural design choices such as exponential gating and bidirectionality contributing to its effectiveness. Our best xLSTM-based model, xLSTM-SENet2, outperforms state-of-the-art Mamba- and Conformer-based systems of similar complexity on the Voicebank+DEMAND dataset.

Results

TaskDatasetMetricValueModel
Speech EnhancementVoiceBank + DEMANDCBAK3.98xLSTM-SENet2
Speech EnhancementVoiceBank + DEMANDCOVL4.27xLSTM-SENet2
Speech EnhancementVoiceBank + DEMANDCSIG4.78xLSTM-SENet2
Speech EnhancementVoiceBank + DEMANDPESQ (wb)3.53xLSTM-SENet2
Speech EnhancementVoiceBank + DEMANDPara. (M)2.27xLSTM-SENet2
Speech EnhancementVoiceBank + DEMANDSTOI0.96xLSTM-SENet2

Related Papers

Autoregressive Speech Enhancement via Acoustic Tokens2025-07-17P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge2025-07-15Robust One-step Speech Enhancement via Consistency Distillation2025-07-08Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis2025-07-08MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement2025-07-01Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement2025-06-23EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training2025-06-19A Comparative Evaluation of Deep Learning Models for Speech Enhancement in Real-World Noisy Environments2025-06-17