Audio Embeddings as Teachers for Music Classification

Yiwei Ding, Alexander Lerch

2023-06-30Transfer Learning Music Auto-Tagging Information Retrieval Retrieval Classification Knowledge Distillation Music Classification Music Information Retrieval Instrument Recognition

Paper PDF Code(official)

Abstract

Music classification has been one of the most popular tasks in the field of music information retrieval. With the development of deep learning models, the last decade has seen impressive improvements in a wide range of classification tasks. However, the increasing model complexity makes both training and inference computationally expensive. In this paper, we integrate the ideas of transfer learning and feature-based knowledge distillation and systematically investigate using pre-trained audio embeddings as teachers to guide the training of low-complexity student networks. By regularizing the feature space of the student networks with the pre-trained embeddings, the knowledge in the teacher embeddings can be transferred to the students. We use various pre-trained audio embeddings and test the effectiveness of the method on the tasks of musical instrument classification and music auto-tagging. Results show that our method significantly improves the results in comparison to the identical model trained without the teacher's knowledge. This technique can also be combined with classical knowledge distillation approaches to further improve the model's performance.

Results

Task	Dataset	Metric	Value	Model
Music Auto-Tagging	MagnaTagATune (clean)	PR-AUC	46.1	EAsT-KD + PaSST
Music Auto-Tagging	MagnaTagATune (clean)	ROC-AUC	91.5	EAsT-KD + PaSST
Music Auto-Tagging	MagnaTagATune (clean)	PR-AUC	45.9	EAsT-Final + PaSST
Music Auto-Tagging	MagnaTagATune (clean)	ROC-AUC	91.2	EAsT-Final + PaSST
Instrument Recognition	OpenMIC-2018	mean average precision	0.852	EAsT-KD + PaSST
Instrument Recognition	OpenMIC-2018	mean average precision	0.847	EAsT-Final + PaSST

Audio Embeddings as Teachers for Music Classification

Abstract

Results

Related Papers

Audio Embeddings as Teachers for Music Classification

Abstract

Results

Related Papers