Designing and Evaluating Speech Emotion Recognition Systems: A reality check case study with IEMOCAP

04/03/2023
by   Nikolaos Antoniou, et al.
0

There is an imminent need for guidelines and standard test sets to allow direct and fair comparisons of speech emotion recognition (SER). While resources, such as the Interactive Emotional Dyadic Motion Capture (IEMOCAP) database, have emerged as widely-adopted reference corpora for researchers to develop and test models for SER, published work reveals a wide range of assumptions and variety in its use that challenge reproducibility and generalization. Based on a critical review of the latest advances in SER using IEMOCAP as the use case, our work aims at two contributions: First, using an analysis of the recent literature, including assumptions made and metrics used therein, we provide a set of SER evaluation guidelines. Second, using recent publications with open-sourced implementations, we focus on reproducibility assessment in SER.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2018

Gender-dependent emotion recognition based on HMMs and SPHMMs

It is well known that emotion recognition performance is not ideal. The ...
research
09/27/2017

Research on several key technologies in practical speech emotion recognition

In this dissertation the practical speech emotion recognition technology...
research
10/07/2021

SERAB: A multi-lingual benchmark for speech emotion recognition

Recent developments in speech emotion recognition (SER) often leverage d...
research
03/27/2022

A Dataset for Speech Emotion Recognition in Greek Theatrical Plays

Machine learning methodologies can be adopted in cultural applications a...
research
05/25/2023

Transfer Learning for Personality Perception via Speech Emotion Recognition

Holistic perception of affective attributes is an important human percep...

Please sign up or login with your details

Forgot password? Click here to reset