Log In Sign Up

Emotion Recognition In Persian Speech Using Deep Neural Networks

by   Ali Yazdani, et al.

Speech Emotion Recognition (SER) is of great importance in Human-Computer Interaction (HCI), as it provides a deeper understanding of the situation and results in better interaction. In recent years, various machine learning and deep learning algorithms have been developed to improve SER techniques. Recognition of emotions depends on the type of expression that varies between different languages. In this article, to further study this important factor in Farsi, we examine various deep learning techniques on the SheEMO dataset. Using signal features in low- and high-level descriptions and different deep networks and machine learning techniques, Unweighted Average Recall (UAR) of 65.20 is achieved with an accuracy of 78.29.


page 1

page 2

page 3

page 4


Emotion Recognition in Audio and Video Using Deep Neural Networks

Humans are able to comprehend information from multiple domains for e.g....

A breakthrough in Speech emotion recognition using Deep Retinal Convolution Neural Networks

Speech emotion recognition (SER) is to study the formation and change of...

Effect of Deep Learning Feature Inference Techniques on Respiratory Sounds

Analysis of respiratory sounds increases its importance every day. Many ...

Deep Residual Local Feature Learning for Speech Emotion Recognition

Speech Emotion Recognition (SER) is becoming a key role in global busine...

Towards Empathetic Human-Robot Interactions

Since the late 1990s when speech companies began providing their custome...

Speech Emotion Recognition using Semantic Information

Speech emotion recognition is a crucial problem manifesting in a multitu...