TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Sams-Net: A Sliced Attention-based Neural Network for Musi...

Sams-Net: A Sliced Attention-based Neural Network for Music Source Separation

Tingle Li, Jia-Wei Chen, Haowen Hou, Ming Li

2019-09-12Audio Source SeparationMusic Source Separation
PaperPDFCode

Abstract

Convolutional Neural Network (CNN) or Long short-term memory (LSTM) based models with the input of spectrogram or waveforms are commonly used for deep learning based audio source separation. In this paper, we propose a Sliced Attention-based neural network (Sams-Net) in the spectrogram domain for the music source separation task. It enables spectral feature interactions with multi-head attention mechanism, achieves easier parallel computing and has a larger receptive field compared with LSTMs and CNNs respectively. Experimental results on the MUSDB18 dataset show that the proposed method, with fewer parameters, outperforms most of the state-of-the-art DNN-based methods.

Results

TaskDatasetMetricValueModel
Music Source SeparationMUSDB18SDR (avg)5.65Sams-Net
Music Source SeparationMUSDB18SDR (bass)5.25Sams-Net
Music Source SeparationMUSDB18SDR (drums)6.63Sams-Net
Music Source SeparationMUSDB18SDR (other)4.09Sams-Net
Music Source SeparationMUSDB18SDR (vocals)6.61Sams-Net
2D ClassificationMUSDB18SDR (avg)5.65Sams-Net
2D ClassificationMUSDB18SDR (bass)5.25Sams-Net
2D ClassificationMUSDB18SDR (drums)6.63Sams-Net
2D ClassificationMUSDB18SDR (other)4.09Sams-Net
2D ClassificationMUSDB18SDR (vocals)6.61Sams-Net

Related Papers

Towards Reliable Objective Evaluation Metrics for Generative Singing Voice Separation Models2025-07-15DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization2025-06-03ZeroSep: Separate Anything in Audio with Zero Training2025-05-29Text-Queried Audio Source Separation via Hierarchical Modeling2025-05-27Music Source Restoration2025-05-27Training-Free Multi-Step Audio Source Separation2025-05-26Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation2025-05-12Score Distillation Sampling for Audio: Source Separation, Synthesis, and Beyond2025-05-07