WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System

Yang Xiao, Rohan Kumar Das

2024-07-04Sound Event Detection Event Detection Large Language Model Language Modelling

Abstract

This work aims to advance sound event detection (SED) research by presenting a new large language model (LLM)-powered dataset namely wild domestic environment sound event detection (WildDESED). It is crafted as an extension to the original DESED dataset to reflect diverse acoustic variability and complex noises in home settings. We leveraged LLMs to generate eight different domestic scenarios based on target sound categories of the DESED dataset. Then we enriched the scenarios with a carefully tailored mixture of noises selected from AudioSet and ensured no overlap with target sound. We consider widely popular convolutional neural recurrent network to study WildDESED dataset, which depicts its challenging nature. We then apply curriculum learning by gradually increasing noise complexity to enhance the model's generalization capabilities across various noise levels. Our results with this approach show improvements within the noisy environment, validating the effectiveness on the WildDESED dataset promoting noise-robust SED advancements.

Results

Task	Dataset	Metric	Value	Model
Sound Event Detection	WildDESED	PSDS1 (-5dB)	0.049	CRNN (WildDESED + Curriculrm learning)
Sound Event Detection	WildDESED	PSDS1 (0dB)	0.114	CRNN (WildDESED + Curriculrm learning)
Sound Event Detection	WildDESED	PSDS1 (10dB)	0.212	CRNN (WildDESED + Curriculrm learning)
Sound Event Detection	WildDESED	PSDS1 (5dB)	0.175	CRNN (WildDESED + Curriculrm learning)
Sound Event Detection	WildDESED	PSDS1 (Clean)	0.265	CRNN (WildDESED + Curriculrm learning)
Sound Event Detection	WildDESED	PSDS1 (-5dB)	0.048	CRNN (WildDESED)
Sound Event Detection	WildDESED	PSDS1 (0dB)	0.087	CRNN (WildDESED)
Sound Event Detection	WildDESED	PSDS1 (10dB)	0.175	CRNN (WildDESED)
Sound Event Detection	WildDESED	PSDS1 (5dB)	0.135	CRNN (WildDESED)
Sound Event Detection	WildDESED	PSDS1 (Clean)	0.2	CRNN (WildDESED)
Sound Event Detection	WildDESED	PSDS1 (-5dB)	0.017	CRNN
Sound Event Detection	WildDESED	PSDS1 (0dB)	0.064	CRNN
Sound Event Detection	WildDESED	PSDS1 (10dB)	0.222	CRNN
Sound Event Detection	WildDESED	PSDS1 (5dB)	0.148	CRNN
Sound Event Detection	WildDESED	PSDS1 (Clean)	0.348	CRNN

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System

Abstract

Results

Related Papers

WildDESED: An LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection System

Abstract

Results

Related Papers