Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

01/31/2020
by   Vasudha Kowtha, et al.
1

Emotion plays an essential role in human-to-human communication, enabling us to convey feelings such as happiness, frustration, and sincerity. While modern speech technologies rely heavily on speech recognition and natural language understanding for speech content understanding, the investigation of vocal expression is increasingly gaining attention. Key considerations for building robust emotion models include characterizing and improving the extent to which a model, given its training data distribution, is able to generalize to unseen data conditions. This work investigated a long-shot-term memory (LSTM) network and a time convolution - LSTM (TC-LSTM) to detect primitive emotion attributes such as valence, arousal, and dominance, from speech. It was observed that training with multiple datasets and using robust features improved the concordance correlation coefficient (CCC) for valence, by 30% with respect to the baseline system. Additionally, this work investigated how emotion primitives can be used to detect categorical emotions such as happiness, disgust, contempt, anger, and surprise from neutral speech, and results indicated that arousal, followed by dominance was a better detector of such emotions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2019

The phonetic bases of vocal expressed emotion: natural versus acted

Can vocal emotions be emulated? This question has been a recurrent conce...
research
12/23/2019

Learning Transferable Features for Speech Emotion Recognition

Emotion recognition from speech is one of the key steps towards emotiona...
research
06/10/2022

AHD ConvNet for Speech Emotion Classification

Accomplishments in the field of artificial intelligence are utilized in ...
research
08/08/2020

Speech Driven Talking Face Generation from a Single Image and an Emotion Condition

Visual emotion expression plays an important role in audiovisual speech ...
research
07/05/2022

A cross-corpus study on speech emotion recognition

For speech emotion datasets, it has been difficult to acquire large quan...
research
07/05/2019

Jointly Aligning and Predicting Continuous Emotion Annotations

Time-continuous dimensional descriptions of emotions (e.g., arousal, val...
research
09/01/2019

The Ambiguous World of Emotion Representation

Artificial intelligence and machine learning systems have demonstrated h...

Please sign up or login with your details

Forgot password? Click here to reset