DNN-HMM based Speaker Adaptive Emotion Recognition using Proposed Epoch and MFCC Features

06/04/2018
by   Md. Shah Fahad, et al.
0

Speech is produced when time varying vocal tract system is excited with time varying excitation source. Therefore, the information present in a speech such as message, emotion, language, speaker is due to the combined effect of both excitation source and vocal tract system. However, there is very less utilization of excitation source features to recognize emotion. In our earlier work, we have proposed a novel method to extract glottal closure instants (GCIs) known as epochs. In this paper, we have explored epoch features namely instantaneous pitch, phase and strength of epochs for discriminating emotions. We have combined the excitation source features and the well known Male-frequency cepstral coefficient (MFCC) features to develop an emotion recognition system with improved performance. DNN-HMM speaker adaptive models have been developed using MFCC, epoch and combined features. IEMOCAP emotional database has been used to evaluate the models. The average accuracy for emotion recognition system when using MFCC and epoch features separately is 59.25 54.52 and epoch features are combined.

READ FULL TEXT
research
02/12/2020

x-vectors meet emotions: A study on dependencies between emotion and speaker recognition

In this work, we explore the dependencies between speaker recognition an...
research
04/15/2021

Speaker Attentive Speech Emotion Recognition

Speech Emotion Recognition (SER) task has known significant improvements...
research
01/07/2022

A New Amharic Speech Emotion Dataset and Classification Benchmark

In this paper we present the Amharic Speech Emotion Dataset (ASED), whic...
research
12/22/2019

Emotion Recognition Using Wearables: A Systematic Literature Review Work in progress

Wearables like smartwatches or wrist bandsequipped with pervasive sensor...
research
10/26/2022

Effect of different splitting criteria on the performance of speech emotion recognition

Traditional speech emotion recognition (SER) evaluations have been perfo...
research
11/15/2021

Biologically inspired speech emotion recognition

Conventional feature-based classification methods do not apply well to a...
research
05/07/2023

Learning Robust Self-attention Features for Speech Emotion Recognition with Label-adaptive Mixup

Speech Emotion Recognition (SER) is to recognize human emotions in a nat...

Please sign up or login with your details

Forgot password? Click here to reset