Optimizing Speech Emotion Recognition using Manta-Ray Based Feature Selection

09/18/2020
by   Soham Chattopadhyay, et al.
0

Emotion recognition from audio signals has been regarded as a challenging task in signal processing as it can be considered as a collection of static and dynamic classification tasks. Recognition of emotions from speech data has been heavily relied upon end-to-end feature extraction and classification using machine learning models, though the absence of feature selection and optimization have restrained the performance of these methods. Recent studies have shown that Mel Frequency Cepstral Coefficients (MFCC) have been emerged as one of the most relied feature extraction methods, though it circumscribes the accuracy of classification with a very small feature dimension. In this paper, we propose that the concatenation of features, extracted by using different existing feature extraction methods can not only boost the classification accuracy but also expands the possibility of efficient feature selection. We have used Linear Predictive Coding (LPC) apart from the MFCC feature extraction method, before feature merging. Besides, we have performed a novel application of Manta Ray optimization in speech emotion recognition tasks that resulted in a state-of-the-art result in this field. We have evaluated the performance of our model using SAVEE and Emo-DB, two publicly available datasets. Our proposed method outperformed all the existing methods in speech emotion analysis and resulted in a decent result in these two datasets with a classification accuracy of 97.06

READ FULL TEXT
research
08/19/2022

Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition

Robust speech emotion recognition relies on the quality of the speech fe...
research
08/28/2019

Emotion Recognition in Low-Resource Settings: An Evaluation of Automatic Feature Selection Methods

Research in automatic emotion recognition has seldom addressed the issue...
research
11/11/2020

WaDeNet: Wavelet Decomposition based CNN for Speech Processing

Existing speech processing systems consist of different modules, individ...
research
02/02/2020

Effect of Analysis Window and Feature Selection on Classification of Hand Movements Using EMG Signal

Electromyography (EMG) signals have been successfully employed for drivi...
research
05/23/2019

forgeNet: A graph deep neural network model using tree-based ensemble classifiers for feature extraction

A unique challenge in predictive model building for omics data has been ...
research
01/18/2022

Selecting and combining complementary feature representations and classifiers for hate speech detection

Hate speech is a major issue in social networks due to the high volume o...

Please sign up or login with your details

Forgot password? Click here to reset