TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Hybrid Spectrogram and Waveform Source Separation

Hybrid Spectrogram and Waveform Source Separation

Alexandre Défossez

2021-11-05Music Source Separation
PaperPDFCode(official)

Abstract

Source separation models either work on the spectrogram or waveform domain. In this work, we show how to perform end-to-end hybrid source separation, letting the model decide which domain is best suited for each source, and even combining both. The proposed hybrid version of the Demucs architecture won the Music Demixing Challenge 2021 organized by Sony. This architecture also comes with additional improvements, such as compressed residual branches, local attention or singular value regularization. Overall, a 1.4 dB improvement of the Signal-To-Distortion (SDR) was observed across all sources as measured on the MusDB HQ dataset, an improvement confirmed by human subjective evaluation, with an overall quality rated at 2.83 out of 5 (2.36 for the non hybrid Demucs), and absence of contamination at 3.04 (against 2.37 for the non hybrid Demucs and 2.44 for the second ranking model submitted at the competition).

Results

TaskDatasetMetricValueModel
Music Source SeparationMUSDB18SDR (avg)7.72Hybrid Demucs
Music Source SeparationMUSDB18SDR (bass)8.67Hybrid Demucs
Music Source SeparationMUSDB18SDR (drums)8.58Hybrid Demucs
Music Source SeparationMUSDB18SDR (other)5.59Hybrid Demucs
Music Source SeparationMUSDB18SDR (vocals)8.04Hybrid Demucs
Music Source SeparationMUSDB18-HQSDR (avg)7.68Hybrid Demucs
Music Source SeparationMUSDB18-HQSDR (bass)8.76Hybrid Demucs
Music Source SeparationMUSDB18-HQSDR (drums)8.24Hybrid Demucs
Music Source SeparationMUSDB18-HQSDR (others)5.59Hybrid Demucs
Music Source SeparationMUSDB18-HQSDR (vocals)8.13Hybrid Demucs
2D ClassificationMUSDB18SDR (avg)7.72Hybrid Demucs
2D ClassificationMUSDB18SDR (bass)8.67Hybrid Demucs
2D ClassificationMUSDB18SDR (drums)8.58Hybrid Demucs
2D ClassificationMUSDB18SDR (other)5.59Hybrid Demucs
2D ClassificationMUSDB18SDR (vocals)8.04Hybrid Demucs
2D ClassificationMUSDB18-HQSDR (avg)7.68Hybrid Demucs
2D ClassificationMUSDB18-HQSDR (bass)8.76Hybrid Demucs
2D ClassificationMUSDB18-HQSDR (drums)8.24Hybrid Demucs
2D ClassificationMUSDB18-HQSDR (others)5.59Hybrid Demucs
2D ClassificationMUSDB18-HQSDR (vocals)8.13Hybrid Demucs

Related Papers

Music Source Restoration2025-05-27Training-Free Multi-Step Audio Source Separation2025-05-26Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation2025-05-12Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline2025-04-30Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music2025-03-10Separate This, and All of these Things Around It: Music Source Separation via Hyperellipsoidal Queries2025-01-27Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music2025-01-12MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation2025-01-07