TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Continual self-training with bootstrapped remixing for spe...

Continual self-training with bootstrapped remixing for speech enhancement

Efthymios Tzinis, Yossi Adi, Vamsi K. Ithapu, Buye Xu, Anurag Kumar

2021-10-19Unsupervised Domain AdaptationSpeech EnhancementDomain Adaptation
PaperPDFCode

Abstract

We propose RemixIT, a simple and novel self-supervised training method for speech enhancement. The proposed method is based on a continuously self-training scheme that overcomes limitations from previous studies including assumptions for the in-domain noise distribution and having access to clean target signals. Specifically, a separation teacher model is pre-trained on an out-of-domain dataset and is used to infer estimated target signals for a batch of in-domain mixtures. Next, we bootstrap the mixing process by generating artificial mixtures using permuted estimated clean and noise signals. Finally, the student model is trained using the permuted estimated sources as targets while we periodically update teacher's weights using the latest student model. Our experiments show that RemixIT outperforms several previous state-of-the-art self-supervised methods under multiple speech enhancement tasks. Additionally, RemixIT provides a seamless alternative for semi-supervised and unsupervised domain adaptation for speech enhancement tasks, while being general enough to be applied to any separation task and paired with any separation model.

Results

TaskDatasetMetricValueModel
Speech EnhancementDeep Noise Suppression (DNS) ChallengePESQ-WB2.69Sudo rm-rf (U=8)
Speech EnhancementDeep Noise Suppression (DNS) ChallengeSI-SDR-WB18.6Sudo rm-rf (U=8)
Speech EnhancementDeep Noise Suppression (DNS) ChallengePESQ-WB2.6RemixIT (w Sudo U=32)
Speech EnhancementDeep Noise Suppression (DNS) ChallengeSI-SDR-WB18RemixIT (w Sudo U=32)

Related Papers

Autoregressive Speech Enhancement via Acoustic Tokens2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17P.808 Multilingual Speech Enhancement Testing: Approach and Results of URGENT 2025 Challenge2025-07-15Domain Borders Are There to Be Crossed With Federated Few-Shot Adaptation2025-07-14An Offline Mobile Conversational Agent for Mental Health Support: Learning from Emotional Dialogues and Psychological Texts with Student-Centered Evaluation2025-07-11The Bayesian Approach to Continual Learning: An Overview2025-07-11Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection2025-07-10Robust One-step Speech Enhancement via Consistency Distillation2025-07-08