TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Hierarchical Regression Chain Framework for Affective Vo...

A Hierarchical Regression Chain Framework for Affective Vocal Burst Recognition

Jinchao Li, Xixin Wu, Kaitao Song, Dongsheng Li, Xunying Liu, Helen Meng

2023-03-14regressionVocal Bursts Intensity PredictionSelf-Supervised LearningA-VB HighCultural Vocal Bursts Intensity PredictionA-VB TwoA-VB CultureVocal Bursts Valence Prediction
PaperPDFCode(official)

Abstract

As a common way of emotion signaling via non-linguistic vocalizations, vocal burst (VB) plays an important role in daily social interaction. Understanding and modeling human vocal bursts are indispensable for developing robust and general artificial intelligence. Exploring computational approaches for understanding vocal bursts is attracting increasing research attention. In this work, we propose a hierarchical framework, based on chain regression models, for affective recognition from VBs, that explicitly considers multiple relationships: (i) between emotional states and diverse cultures; (ii) between low-dimensional (arousal & valence) and high-dimensional (10 emotion classes) emotion spaces; and (iii) between various emotion classes within the high-dimensional space. To address the challenge of data sparsity, we also use self-supervised learning (SSL) representations with layer-wise and temporal aggregation modules. The proposed systems participated in the ACII Affective Vocal Burst (A-VB) Challenge 2022 and ranked first in the "TWO'' and "CULTURE'' tasks. Experimental results based on the ACII Challenge 2022 dataset demonstrate the superior performance of the proposed system and the effectiveness of considering multiple relationships using hierarchical regression chain models.

Results

TaskDatasetMetricValueModel
Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.7237w2v2-mtl-chain
Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.6854w2v2-mtl-chain
Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.6017w2v2-mtl-chain
Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.7237w2v2-mtl-chain
Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.6854w2v2-mtl-chain
Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.6017w2v2-mtl-chain
Cultural Vocal Bursts Intensity PredictionHUME-VBConcordance correlation coefficient (CCC)0.6017w2v2-mtl-chain
Speech Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.7237w2v2-mtl-chain
Speech Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.6854w2v2-mtl-chain
Speech Emotion RecognitionHUME-VBConcordance correlation coefficient (CCC)0.6017w2v2-mtl-chain

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Sparse Regression Codes exploit Multi-User Diversity without CSI2025-07-15Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14Bradley-Terry and Multi-Objective Reward Modeling Are Complementary2025-07-10