TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Speech/Speech Separation/WSJ0-2mix

Speech Separation on WSJ0-2mix

Metric: SI-SDRi (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕SI-SDRi▼Extra DataPaperDate↕Code
1TF-Locoformer (L) + DM25.1NoTF-Locoformer: Transformer with Local Modeling b...2024-08-06Code
2SepReformer-L25.1NoSeparate and Reconstruct: Asymmetric Encoder-Dec...2024-06-10Code
3TF-Locoformer (M) + DM24.6NoTF-Locoformer: Transformer with Local Modeling b...2024-08-06Code
4TF-Locoformer (L)24.2NoTF-Locoformer: Transformer with Local Modeling b...2024-08-06Code
5MossFormer2 (L)24.1No--Code
6SepTDA (L=12)24NoBoosting Unknown-number Speaker Separation with ...2024-01-23-
7Separate And Diffuse23.9NoSeparate And Diffuse: Using a Pretrained Diffusi...2023-01-25-
8TF-Locoformer (M)23.6NoTF-Locoformer: Transformer with Local Modeling b...2024-08-06Code
9TF-Locoformer (S) + DM22.8NoTF-Locoformer: Transformer with Local Modeling b...2024-08-06Code
10MossFormer (L) + DM22.8NoMossFormer: Pushing the Performance Limit of Mon...2023-02-23Code
11SepMamba + DM (M)22.7NoSepMamba: State-space models for speaker separat...2024-10-28Code
12SPGM + DM22.7NoSPGM: Prioritizing Local Features for enhanced s...2023-09-22Code
13MossFormer (M) + DM22.5NoMossFormer: Pushing the Performance Limit of Mon...2023-02-23Code
14SepIt22.4NoSepIt: Approaching a Single Channel Speech Separ...2022-05-24-
15SepFormer22.3NoAttention is All You Need in Speech Separation2020-10-25Code
16Wavesplit v222.2NoWavesplit: End-to-End Speech Separation by Speak...2020-02-20-
17SPGM22.1NoSPGM: Prioritizing Local Features for enhanced s...2023-09-22Code
18TF-Locoformer (S)22NoTF-Locoformer: Transformer with Local Modeling b...2024-08-06Code
19DPTNet (Libri1Mix speech enhancement pre-trained)21.3YesStabilizing Label Assignment for Speech Separati...2020-10-29Code
20SepMamba + DM (S)21.2NoSepMamba: State-space models for speaker separat...2024-10-28Code
21TD-Conformer (XL) + DM21.2NoOn Time Domain Conformer Models for Monaural Spe...2023-10-09Code
22Sandglasset21NoSandglasset: A Light Multi-Granularity Self-atte...2021-03-01Code
23GALR20.3NoEffective Low-Cost Time-Domain Audio Separation ...2021-01-13Code
24DPTNet20.2No--Code
25Gated DualPathRNN20.12NoVoice Separation with an Unknown Number of Multi...2020-02-29Code
26Sudo rm -rf (U=36)19.5NoCompute and memory efficient universal sound sou...2021-03-03Code
27Wavesplit v119NoWavesplit: End-to-End Speech Separation by Speak...2020-02-20-
28Sudo rm -rf XL18.9NoSudo rm -rf: Efficient Networks for Universal Au...2020-07-14Code
29Dual-path RNN18.8NoDual-path RNN: efficient long sequence modeling ...2019-10-14Code
30DeepCASA17.7NoDivide and Conquer: A Deep CASA Approach to Talk...2019-04-25Code
31IAC-PIT Tasnet17.5NoInterrupted and cascaded permutation invariant t...2019-10-28Code
32Deformable TCN + Dynamic Mixing17.2NoDeformable Temporal Convolutional Networks for M...2022-10-27Code
33Hybrid-Tasnet16.6NoImproved Speech Separation with Time-and-Frequen...2019-04-16Code
34Deformable TCN + Shared Weights + Dynamic Mixing16.1NoDeformable Temporal Convolutional Networks for M...2022-10-27Code
35Two-step Conv-TasNet16.1NoTwo-Step Sound Source Separation: Training on Le...2019-10-22Code
36Conv-TasNet15.3NoConv-TasNet: Surpassing Ideal Time-Frequency Mag...2018-09-20Code
37TasNet v213.2No--Code
38Chimera++11.5No--Code
39TasNet10.8NoTasNet: time-domain audio separation network for...2017-11-01Code
40Deep Clustering ++10.8NoDeep clustering: Discriminative embeddings for s...2015-08-18Code