Integrating Recurrence Dynamics for Speech Emotion Recognition

11/09/2018
by   Efthymios Tzinis, et al.
0

We investigate the performance of features that can capture nonlinear recurrence dynamics embedded in the speech signal for the task of Speech Emotion Recognition (SER). Reconstruction of the phase space of each speech frame and the computation of its respective Recurrence Plot (RP) reveals complex structures which can be measured by performing Recurrence Quantification Analysis (RQA). These measures are aggregated by using statistical functionals over segment and utterance periods. We report SER results for the proposed feature set on three databases using different classification methods. When fusing the proposed features with traditional feature sets, we show an improvement in unweighted accuracy of up to 5.7 10.7 respectively, over the baseline. Following a segment-based approach we demonstrate state-of-the-art performance on IEMOCAP using a Bidirectional Recurrent Neural Network.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2020

The Effect of Silence Feature in Dimensional Speech Emotion Recognition

Silence is a part of human-to-human communication, which can be a clue f...
research
04/14/2021

Unsupervised low-rank representations for speech emotion recognition

We examine the use of linear and non-linear dimensionality reduction alg...
research
06/02/2023

Learning Local to Global Feature Aggregation for Speech Emotion Recognition

Transformer has emerged in speech emotion recognition (SER) at present. ...
research
09/21/2023

The Broad Impact of Feature Imitation: Neural Enhancements Across Financial, Speech, and Physiological Domains

Initialization of neural network weights plays a pivotal role in determi...
research
04/25/2022

Real-time Speech Emotion Recognition Based on Syllable-Level Feature Extraction

Speech emotion recognition systems have high prediction latency because ...
research
10/08/2021

Affective Burst Detection from Speech using Kernel-fusion Dilated Convolutional Neural Networks

As speech-interfaces are getting richer and widespread, speech emotion r...
research
03/08/2022

SpeechFormer: A Hierarchical Efficient Framework Incorporating the Characteristics of Speech

Transformer has obtained promising results on cognitive speech signal pr...

Please sign up or login with your details

Forgot password? Click here to reset