Deeply vocal characterizer
AudioSpeechIntroduced 2021-01-27
Deeply vocal characterizer is a human nonverbal vocalization dataset. This sample dataset consists of about 0.6 hours(56.7 hours in the full set) of audio(16 kHz, 16-bit, mono) across 16 human nonverbal vocalization classes, including throat-clearing, coughing, laughing, panting, and etc. The audio contents are crowdsourced by the general public of South Korea.
The dataset is a subset(approximately 1%) of a much bigger dataset which were recorded under the same circumstances as these open-source datasets. Please contact us(contact@deeplyinc.com) for the full set with the research/commercial license.