Label Uncertainty Modeling and Prediction for Speech Emotion Recognition using t-Distributions

07/25/2022
by   Navin Raj Prabhu, et al.
0

As different people perceive others' emotional expressions differently, their annotation in terms of arousal and valence are per se subjective. To address this, these emotion annotations are typically collected by multiple annotators and averaged across annotators in order to obtain labels for arousal and valence. However, besides the average, also the uncertainty of a label is of interest, and should also be modeled and predicted for automatic emotion recognition. In the literature, for simplicity, label uncertainty modeling is commonly approached with a Gaussian assumption on the collected annotations. However, as the number of annotators is typically rather small due to resource constraints, we argue that the Gaussian approach is a rather crude assumption. In contrast, in this work we propose to model the label distribution using a Student's t-distribution which allows us to account for the number of annotations available. With this model, we derive the corresponding Kullback-Leibler divergence based loss function and use it to train an estimator for the distribution of emotion labels, from which the mean and uncertainty can be inferred. Through qualitative and quantitative analysis, we show the benefits of the t-distribution over a Gaussian distribution. We validate our proposed method on the AVEC'16 dataset. Results reveal that our t-distribution based approach improves over the Gaussian approach with state-of-the-art uncertainty modeling results in speech-based emotion recognition, along with an optimal and even faster convergence.

READ FULL TEXT
research
09/30/2022

End-to-End Label Uncertainty Modeling in Speech Emotion Recognition using Bayesian Neural Networks and Label Distribution Learning

To train machine learning algorithms to predict emotional expressions in...
research
10/07/2021

End-to-end label uncertainty modeling for speech emotion recognition using Bayesian neural networks

Emotions are subjective constructs. Recent end-to-end speech emotion rec...
research
11/09/2022

Distribution-based Emotion Recognition in Conversation

Automatic emotion recognition in conversation (ERC) is crucial for emoti...
research
03/27/2019

MuSE-ing on the Impact of Utterance Ordering On Crowdsourced Emotion Annotations

Emotion recognition algorithms rely on data annotated with high quality ...
research
06/11/2023

Estimating the Uncertainty in Emotion Attributes using Deep Evidential Regression

In automatic emotion recognition (AER), labels assigned by different hum...
research
11/03/2022

Speech-based emotion recognition with self-supervised models using attentive channel-wise correlations and label smoothing

When recognizing emotions from speech, we encounter two common problems:...
research
06/14/2023

Continuous Learning Based Novelty Aware Emotion Recognition System

Current works in human emotion recognition follow the traditional closed...

Please sign up or login with your details

Forgot password? Click here to reset