TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LaSAFT: Latent Source Attentive Frequency Transformation f...

LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation

Woosung Choi, Minseok Kim, Jaehwa Chung, Soonyoung Jung

2020-10-22Music Source Separation
PaperPDFCode(official)

Abstract

Recent deep-learning approaches have shown that Frequency Transformation (FT) blocks can significantly improve spectrogram-based single-source separation models by capturing frequency patterns. The goal of this paper is to extend the FT block to fit the multi-source task. We propose the Latent Source Attentive Frequency Transformation (LaSAFT) block to capture source-dependent frequency patterns. We also propose the Gated Point-wise Convolutional Modulation (GPoCM), an extension of Feature-wise Linear Modulation (FiLM), to modulate internal features. By employing these two novel methods, we extend the Conditioned-U-Net (CUNet) for multi-source separation, and the experimental results indicate that our LaSAFT and GPoCM can improve the CUNet's performance, achieving state-of-the-art SDR performance on several MUSDB18 source separation tasks.

Results

TaskDatasetMetricValueModel
Music Source SeparationMUSDB18SDR (avg)5.88LaSAFT+GPoCM
Music Source SeparationMUSDB18SDR (bass)5.63LaSAFT+GPoCM
Music Source SeparationMUSDB18SDR (drums)5.68LaSAFT+GPoCM
Music Source SeparationMUSDB18SDR (other)4.87LaSAFT+GPoCM
Music Source SeparationMUSDB18SDR (vocals)7.33LaSAFT+GPoCM
2D ClassificationMUSDB18SDR (avg)5.88LaSAFT+GPoCM
2D ClassificationMUSDB18SDR (bass)5.63LaSAFT+GPoCM
2D ClassificationMUSDB18SDR (drums)5.68LaSAFT+GPoCM
2D ClassificationMUSDB18SDR (other)4.87LaSAFT+GPoCM
2D ClassificationMUSDB18SDR (vocals)7.33LaSAFT+GPoCM

Related Papers

Music Source Restoration2025-05-27Training-Free Multi-Step Audio Source Separation2025-05-26Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation2025-05-12Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline2025-04-30Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music2025-03-10Separate This, and All of these Things Around It: Music Source Separation via Hyperellipsoidal Queries2025-01-27Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music2025-01-12MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation2025-01-07