SEMOUR: A Scripted Emotional Speech Repository for Urdu

05/19/2021
by   Nimra Zaheer, et al.
0

Designing reliable Speech Emotion Recognition systems is a complex task that inevitably requires sufficient data for training purposes. Such extensive datasets are currently available in only a few languages, including English, German, and Italian. In this paper, we present SEMOUR, the first scripted database of emotion-tagged speech in the Urdu language, to design an Urdu Speech Recognition System. Our gender-balanced dataset contains 15,040 unique instances recorded by eight professional actors eliciting a syntactically complex script. The dataset is phonetically balanced, and reliably exhibits a varied set of emotions as marked by the high agreement scores among human raters in experiments. We also provide various baseline speech emotion prediction scores on the database, which could be used for various applications like personalized robot assistants, diagnosis of psychological disorders, and getting feedback from a low-tech-enabled population, etc. On a random test sample, our model correctly predicts an emotion with a state-of-the-art 92 accuracy.

READ FULL TEXT

page 5

page 9

research
08/19/2022

Feature Selection Enhancement and Feature Space Visualization for Speech-Based Emotion Recognition

Robust speech emotion recognition relies on the quality of the speech fe...
research
01/20/2018

Gender-dependent emotion recognition based on HMMs and SPHMMs

It is well known that emotion recognition performance is not ideal. The ...
research
01/09/2021

Spanish expressive voices: Corpus for emotion research in spanish

A new emotional multimedia database has been recorded and aligned. The d...
research
03/27/2022

A Dataset for Speech Emotion Recognition in Greek Theatrical Plays

Machine learning methodologies can be adopted in cultural applications a...
research
01/07/2022

A New Amharic Speech Emotion Dataset and Classification Benchmark

In this paper we present the Amharic Speech Emotion Dataset (ASED), whic...
research
06/04/2019

ShEMO -- A Large-Scale Validated Database for Persian Speech Emotion Detection

This paper introduces a large-scale, validated database for Persian call...
research
08/16/2020

Computer-Generated Music for Tabletop Role-Playing Games

In this paper we present Bardo Composer, a system to generate background...

Please sign up or login with your details

Forgot password? Click here to reset