Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network

09/09/2021
by   Lance Ying, et al.
0

In recent years, deep-learning-based speech emotion recognition models have outperformed classical machine learning models. Previously, neural network designs, such as Multitask Learning, have accounted for variations in emotional expressions due to demographic and contextual factors. However, existing models face a few constraints: 1) they rely on a clear definition of domains (e.g. gender, noise condition, etc.) and the availability of domain labels; 2) they often attempt to learn domain-invariant features while emotion expressions can be domain-specific. In the present study, we propose the Nonparametric Hierarchical Neural Network (NHNN), a lightweight hierarchical neural network model based on Bayesian nonparametric clustering. In comparison to Multitask Learning approaches, the proposed model does not require domain/task labels. In our experiments, the NHNN models generally outperform the models with similar levels of complexity and state-of-the-art models in within-corpus and cross-corpus tests. Through clustering analysis, we show that the NHNN models are able to learn group-specific features and bridge the performance gap between groups.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2019

A Personalized Affective Memory Neural Model for Improving Emotion Recognition

Recent models of emotion recognition strongly rely on supervised deep le...
research
12/23/2019

Learning Transferable Features for Speech Emotion Recognition

Emotion recognition from speech is one of the key steps towards emotiona...
research
04/12/2019

Multimodal Speech Emotion Recognition and Ambiguity Resolution

Identifying emotion from speech is a non-trivial task pertaining to the ...
research
02/17/2023

Deep Implicit Distribution Alignment Networks for Cross-Corpus Speech Emotion Recognition

In this paper, we propose a novel deep transfer learning method called d...
research
06/27/2022

SpeechEQ: Speech Emotion Recognition based on Multi-scale Unified Datasets and Multitask Learning

Speech emotion recognition (SER) has many challenges, but one of the mai...
research
04/20/2018

Domain Adversarial for Acoustic Emotion Recognition

The performance of speech emotion recognition is affected by the differe...
research
12/17/2021

Linguistic and Gender Variation in Speech Emotion Recognition using Spectral Features

This work explores the effect of gender and linguistic-based vocal varia...

Please sign up or login with your details

Forgot password? Click here to reset