End-to-end music source separation: is it possible in the waveform domain?

Francesc Lluís, Jordi Pons, Xavier Serra

2018-10-29Deep Learning Music Source Separation

Abstract

Most of the currently successful source separation techniques use the magnitude spectrogram as input, and are therefore by default omitting part of the signal: the phase. To avoid omitting potentially useful information, we study the viability of using end-to-end models for music source separation --- which take into account all the information available in the raw audio signal, including the phase. Although during the last decades end-to-end music source separation has been considered almost unattainable, our results confirm that waveform-based models can perform similarly (if not better) than a spectrogram-based deep learning model. Namely: a Wavenet-based model we propose and Wave-U-Net can outperform DeepConvSep, a recent spectrogram-based deep learning model.

Results

Task	Dataset	Metric	Value	Model
Music Source Separation	MUSDB18	SDR (avg)	3.5	Wavenet
Music Source Separation	MUSDB18	SDR (bass)	2.49	Wavenet
Music Source Separation	MUSDB18	SDR (drums)	4.6	Wavenet
Music Source Separation	MUSDB18	SDR (other)	0.54	Wavenet
Music Source Separation	MUSDB18	SDR (vocals)	3.46	Wavenet
2D Classification	MUSDB18	SDR (avg)	3.5	Wavenet
2D Classification	MUSDB18	SDR (bass)	2.49	Wavenet
2D Classification	MUSDB18	SDR (drums)	4.6	Wavenet
2D Classification	MUSDB18	SDR (other)	0.54	Wavenet
2D Classification	MUSDB18	SDR (vocals)	3.46	Wavenet

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18 A Survey of Deep Learning for Geometry Problem Solving2025-07-16 Uncertainty Quantification for Motor Imagery BCI -- Machine Learning vs. Deep Learning2025-07-10 Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems2025-07-08 Deep Learning Optimization of Two-State Pinching Antennas Systems2025-07-08 AXLearn: Modular Large Model Training on Heterogeneous Infrastructure2025-07-07 Determination Of Structural Cracks Using Deep Learning Frameworks2025-07-03 Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains2025-07-02