Minseok Kim, Jun Hyung Lee, Soonyoung Jung
In this report, we present our award-winning solutions for the Music Demixing Track of Sound Demixing Challenge 2023. First, we propose TFC-TDF-UNet v3, a time-efficient music source separation model that achieves state-of-the-art results on the MUSDB benchmark. We then give full details regarding our solutions for each Leaderboard, including a loss masking approach for noise-robust training. Code for reproducing model training and final submissions is available at github.com/kuielab/sdx23.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Music Source Separation | MUSDB18 | SDR (avg) | 8.34 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18 | SDR (bass) | 8.45 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18 | SDR (drums) | 8.44 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18 | SDR (other) | 6.86 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18 | SDR (vocals) | 9.59 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18-HQ | SDR (avg) | 8.34 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18-HQ | SDR (bass) | 8.45 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18-HQ | SDR (drums) | 8.44 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18-HQ | SDR (others) | 6.86 | TFC-TDF-UNet (v3) |
| Music Source Separation | MUSDB18-HQ | SDR (vocals) | 9.59 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18 | SDR (avg) | 8.34 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18 | SDR (bass) | 8.45 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18 | SDR (drums) | 8.44 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18 | SDR (other) | 6.86 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18 | SDR (vocals) | 9.59 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18-HQ | SDR (avg) | 8.34 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18-HQ | SDR (bass) | 8.45 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18-HQ | SDR (drums) | 8.44 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18-HQ | SDR (others) | 6.86 | TFC-TDF-UNet (v3) |
| 2D Classification | MUSDB18-HQ | SDR (vocals) | 9.59 | TFC-TDF-UNet (v3) |