TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Stabilizing Label Assignment for Speech Separation by Self...

Stabilizing Label Assignment for Speech Separation by Self-supervised Pre-training

Sung-Feng Huang, Shun-Po Chuang, Da-Rong Liu, Yi-Chen Chen, Gene-Ping Yang, Hung-Yi Lee

2020-10-29Speech SeparationSpeech EnhancementSpeaker Separation
PaperPDFCode(official)

Abstract

Speech separation has been well developed, with the very successful permutation invariant training (PIT) approach, although the frequent label assignment switching happening during PIT training remains to be a problem when better convergence speed and achievable performance are desired. In this paper, we propose to perform self-supervised pre-training to stabilize the label assignment in training the speech separation model. Experiments over several types of self-supervised approaches, several typical speech separation models and two different datasets showed that very good improvements are achievable if a proper self-supervised approach is chosen.

Results

TaskDatasetMetricValueModel
Speech SeparationWSJ0-2mixSDRi21.5DPTNet (Libri1Mix speech enhancement pre-trained)
Speech SeparationWSJ0-2mixSI-SDRi21.3DPTNet (Libri1Mix speech enhancement pre-trained)
Speech SeparationLibri2MixSDRi14.6Conv-Tasnet (Libri1Mix speech enhancement pre-trained)
Speech SeparationLibri2MixSI-SDRi14.1Conv-Tasnet (Libri1Mix speech enhancement pre-trained)
Speech SeparationLibri2MixSDRi14.1Conv-Tasnet (Libri1Mix speech enhancement multi-task)
Speech SeparationLibri2MixSI-SDRi13.7Conv-Tasnet (Libri1Mix speech enhancement multi-task)
Speech SeparationLibri2MixSDRi13.6Conv-Tasnet
Speech SeparationLibri2MixSI-SDRi13.2Conv-Tasnet

Related Papers

Autoregressive Speech Enhancement via Acoustic Tokens2025-07-17P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge2025-07-15Dynamic Slimmable Networks for Efficient Speech Separation2025-07-08Robust One-step Speech Enhancement via Consistency Distillation2025-07-08Speech Quality Assessment Model Based on Mixture of Experts: System-Level Performance Enhancement and Utterance-Level Challenge Analysis2025-07-08MambAttention: Mamba with Multi-Head Attention for Generalizable Single-Channel Speech Enhancement2025-07-01Frequency-Weighted Training Losses for Phoneme-Level DNN-based Speech Enhancement2025-06-23EDNet: A Distortion-Agnostic Speech Enhancement Framework with Gating Mamba Mechanism and Phase Shift-Invariant Training2025-06-19