TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/All for One and One for All: Improving Music Separation by...

All for One and One for All: Improving Music Separation by Bridging Networks

Ryosuke Sawata, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji

2020-10-08AllMusic Source Separation
PaperPDFCodeCode(official)Code(official)CodeCode

Abstract

This paper proposes several improvements for music separation with deep neural networks (DNNs), namely a multi-domain loss (MDL) and two combination schemes. First, by using MDL we take advantage of the frequency and time domain representation of audio signals. Next, we utilize the relationship among instruments by jointly considering them. We do this on the one hand by modifying the network architecture and introducing a CrossNet structure. On the other hand, we consider combinations of instrument estimates by using a new combination loss (CL). MDL and CL can easily be applied to many existing DNN-based separation methods as they are merely loss functions which are only used during training and which do not affect the inference step. Experimental results show that the performance of Open-Unmix (UMX), a well-known and state-of-the-art open source library for music separation, can be improved by utilizing our above schemes. Our modifications of UMX are open-sourced together with this paper.

Results

TaskDatasetMetricValueModel
Music Source SeparationMUSDB18SDR (avg)5.79X-UMX
Music Source SeparationMUSDB18SDR (bass)5.43X-UMX
Music Source SeparationMUSDB18SDR (drums)6.47X-UMX
Music Source SeparationMUSDB18SDR (other)4.64X-UMX
Music Source SeparationMUSDB18SDR (vocals)6.61X-UMX
2D ClassificationMUSDB18SDR (avg)5.79X-UMX
2D ClassificationMUSDB18SDR (bass)5.43X-UMX
2D ClassificationMUSDB18SDR (drums)6.47X-UMX
2D ClassificationMUSDB18SDR (other)4.64X-UMX
2D ClassificationMUSDB18SDR (vocals)6.61X-UMX

Related Papers

Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15Is Diversity All You Need for Scalable Robotic Manipulation?2025-07-08DESIGN AND IMPLEMENTATION OF ONLINE CLEARANCE REPORT.2025-07-07Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models2025-07-03Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays2025-07-01State and Memory is All You Need for Robust and Reliable AI Agents2025-06-30EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27