Papers With Code 2 | ML Benchmarks, SotA Results & Code

BIOSED-ACPD: BIOacoustic Sound Event Detection - Adaptive Change Point Detection dataset

Description. This dataset contains the generated audio mixtures with strong annotations, and pre-computed BirdNET embeddings used in the paper "From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning". The results of the paper can be reproduced at three different levels using this dataset:

results_eusipco_2024.zip : contains the experiment results used to produce all tables and figures in the paper. data.zip : contains the pre-computed BirdNET embeddings, labels and the JAMS files from Scaper. audio_data.tar.gz : contains the pre-generated audio mixtures for the datasets. At level (1) the figures and tables are produced using the exact results from the experiments presented in the paper, at level (2) the pre-computed embeddings and annotations are used to run the experiments and reproduce the results, and at level (3) the embeddings can be computed using BirdNET on the audio mixtures. For the audio mixtures download all parts and run "cat audio_data.tar.gz.* > audio_data.tar.gz".

source code : https://github.com/johnmartinsson/adaptive-change-point-detection/blob/main/README.md paper link : https://arxiv.org/abs/2403.08525

Cite as.

@misc{martinsson2024weak, title={From Weak to Strong Sound Event Labels using Adaptive Change-Point Detection and Active Learning}, author={John Martinsson and Olof Mogren and Maria Sandsten and Tuomas Virtanen}, year={2024}, eprint={2403.08525}, archivePrefix={arXiv}, primaryClass={cs.SD} }

Attribution. The audio mixtures are derived from three openly available audio datasets:

Nolasco, I., Singh, S., Strandburg-Peshkin, A., Gill, L., Pamula, H., Grout, E., Morford, J., Emmerson, M., Jensens, F., Whitehead, H., Kiskin, I., Vidana-Vila, E., Lostanlen, V., & Stowell, D. (2022). DCASE 2022 Task 5: Few-shot Bioacoustic Event Detection Evaluation Set [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6517414, available under the Creative Commons Attribution 4.0 International License Trowitzsch, I., Taghia, J., Kashef, Y., & Obermayer, K. (2019). NIGENS general sound events data