Music Source Separation Based on a Lightweight Deep Learning Framework (DTTNET: DUAL-PATH TFC-TDF UNET)

Junyu Chen, Susmitha Vekkot, Pancham Shukla

2023-09-15Music Source Separation

Abstract

Music source separation (MSS) aims to extract 'vocals', 'drums', 'bass' and 'other' tracks from a piece of mixed music. While deep learning methods have shown impressive results, there is a trend toward larger models. In our paper, we introduce a novel and lightweight architecture called DTTNet, which is based on Dual-Path Module and Time-Frequency Convolutions Time-Distributed Fully-connected UNet (TFC-TDF UNet). DTTNet achieves 10.12 dB cSDR on 'vocals' compared to 10.01 dB reported for Bandsplit RNN (BSRNN) but with 86.7% fewer parameters. We also assess pattern-specific performance and model generalization for intricate audio patterns.

Results

Task	Dataset	Metric	Value	Model
Music Source Separation	MUSDB18-HQ	SDR (avg)	8.15	Dual-Path TFC-TDF UNet (DTTNet)
Music Source Separation	MUSDB18-HQ	SDR (bass)	7.55	Dual-Path TFC-TDF UNet (DTTNet)
Music Source Separation	MUSDB18-HQ	SDR (drums)	7.82	Dual-Path TFC-TDF UNet (DTTNet)
Music Source Separation	MUSDB18-HQ	SDR (others)	7.02	Dual-Path TFC-TDF UNet (DTTNet)
Music Source Separation	MUSDB18-HQ	SDR (vocals)	10.21	Dual-Path TFC-TDF UNet (DTTNet)
2D Classification	MUSDB18-HQ	SDR (avg)	8.15	Dual-Path TFC-TDF UNet (DTTNet)
2D Classification	MUSDB18-HQ	SDR (bass)	7.55	Dual-Path TFC-TDF UNet (DTTNet)
2D Classification	MUSDB18-HQ	SDR (drums)	7.82	Dual-Path TFC-TDF UNet (DTTNet)
2D Classification	MUSDB18-HQ	SDR (others)	7.02	Dual-Path TFC-TDF UNet (DTTNet)
2D Classification	MUSDB18-HQ	SDR (vocals)	10.21	Dual-Path TFC-TDF UNet (DTTNet)

Related Papers

Music Source Restoration2025-05-27 Training-Free Multi-Step Audio Source Separation2025-05-26 Is MixIT Really Unsuitable for Correlated Sources? Exploring MixIT for Unsupervised Pre-training in Music Source Separation2025-05-12 Solving Copyright Infringement on Short Video Platforms: Novel Datasets and an Audio Restoration Deep Learning Pipeline2025-04-30 Score-informed Music Source Separation: Improving Synthetic-to-real Generalization in Classical Music2025-03-10 Separate This, and All of these Things Around It: Music Source Separation via Hyperellipsoidal Queries2025-01-27 Sanidha: A Studio Quality Multi-Modal Dataset for Carnatic Music2025-01-12 MAJL: A Model-Agnostic Joint Learning Framework for Music Source Separation and Pitch Estimation2025-01-07