LSSED: a large-scale dataset and benchmark for speech emotion recognition

01/30/2021
by   Weiquan Fan, et al.
0

Speech emotion recognition is a vital contributor to the next generation of human-computer interaction (HCI). However, current existing small-scale databases have limited the development of related research. In this paper, we present LSSED, a challenging large-scale english speech emotion dataset, which has data collected from 820 subjects to simulate real-world distribution. In addition, we release some pre-trained models based on LSSED, which can not only promote the development of speech emotion recognition, but can also be transferred to related downstream tasks such as mental health analysis where data is extremely difficult to collect. Finally, our experiments show the necessity of large-scale datasets and the effectiveness of pre-trained models. The dateset will be released on https://github.com/tobefans/LSSED.

READ FULL TEXT
research
11/29/2019

Bimodal Speech Emotion Recognition Using Pre-Trained Language Models

Speech emotion recognition is a challenging task and an important step t...
research
09/20/2023

Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from Speech

Speech emotion recognition has evolved from research to practical applic...
research
02/26/2023

Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

Fueled by recent advances of self-supervised models, pre-trained speech ...
research
05/18/2023

TrustSER: On the Trustworthiness of Fine-tuning Pre-trained Speech Embeddings For Speech Emotion Recognition

Recent studies have explored the use of pre-trained embeddings for speec...
research
05/22/2023

Learning Emotion Representations from Verbal and Nonverbal Communication

Emotion understanding is an essential but highly challenging component o...
research
08/07/2018

Contemplating Visual Emotions: Understanding and Overcoming Dataset Bias

While machine learning approaches to visual emotion recognition offer gr...
research
04/05/2022

Learning Speech Emotion Representations in the Quaternion Domain

The modeling of human emotion expression in speech signals is an importa...

Please sign up or login with your details

Forgot password? Click here to reset