Deep Residual Local Feature Learning for Speech Emotion Recognition

11/19/2020
by   Sattaya Singkul, et al.
0

Speech Emotion Recognition (SER) is becoming a key role in global business today to improve service efficiency, like call center services. Recent SERs were based on a deep learning approach. However, the efficiency of deep learning depends on the number of layers, i.e., the deeper layers, the higher efficiency. On the other hand, the deeper layers are causes of a vanishing gradient problem, a low learning rate, and high time-consuming. Therefore, this paper proposed a redesign of existing local feature learning block (LFLB). The new design is called a deep residual local feature learning block (DeepResLFLB). DeepResLFLB consists of three cascade blocks: LFLB, residual local feature learning block (ResLFLB), and multilayer perceptron (MLP). LFLB is built for learning local correlations along with extracting hierarchical correlations; DeepResLFLB can take advantage of repeatedly learning to explain more detail in deeper layers using residual learning for solving vanishing gradient and reducing overfitting; and MLP is adopted to find the relationship of learning and discover probability for predicted speech emotions and gender types. Based on two available published datasets: EMODB and RAVDESS, the proposed DeepResLFLB can significantly improve performance when evaluated by standard metrics: accuracy, precision, recall, and F1-score.

READ FULL TEXT
research
04/28/2022

Emotion Recognition In Persian Speech Using Deep Neural Networks

Speech Emotion Recognition (SER) is of great importance in Human-Compute...
research
01/23/2020

Fast Dense Residual Network: Enhancing Global Dense Feature Flow for Text Recognition

Deep Convolutional Neural Networks (CNNs), such as Dense Convolutional N...
research
10/31/2018

Deep Net Features for Complex Emotion Recognition

This paper investigates the influence of different acoustic features, au...
research
07/23/2022

Multimodal Emotion Recognition with Modality-Pairwise Unsupervised Contrastive Loss

Emotion recognition is involved in several real-world applications. With...
research
04/12/2022

ADFF: Attention Based Deep Feature Fusion Approach for Music Emotion Recognition

Music emotion recognition (MER), a sub-task of music information retriev...
research
06/03/2023

LDEB – Label Digitization with Emotion Binarization and Machine Learning for Emotion Recognition in Conversational Dialogues

Emotion recognition in conversations (ERC) is vital to the advancements ...
research
05/26/2020

Is deeper better? It depends on locality of relevant features

It has been recognized that a heavily overparameterized artificial neura...

Please sign up or login with your details

Forgot password? Click here to reset