Learning Transferable Features for Speech Emotion Recognition

12/23/2019
by   Alison Marczewski, et al.
0

Emotion recognition from speech is one of the key steps towards emotional intelligence in advanced human-machine interaction. Identifying emotions in human speech requires learning features that are robust and discriminative across diverse domains that differ in terms of language, spontaneity of speech, recording conditions, and types of emotions. This corresponds to a learning scenario in which the joint distributions of features and labels may change substantially across domains. In this paper, we propose a deep architecture that jointly exploits a convolutional network for extracting domain-shared features and a long short-term memory network for classifying emotions using domain-specific features. We use transferable features to enable model adaptation from multiple source domains, given the sparseness of speech emotion data and the fact that target domains are short of labeled data. A comprehensive cross-corpora experiment with diverse speech emotion domains reveals that transferable features provide gains ranging from 4.3 speech emotion recognition. We evaluate several domain adaptation approaches, and we perform an ablation study to understand which source domains add the most to the overall recognition effectiveness for a given target domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2019

Emotion Recognition from Speech

In this work, we conduct an extensive comparison of various approaches t...
research
10/30/2018

Transferable Positive/Negative Speech Emotion Recognition via Class-wise Adversarial Domain Adaptation

Speech emotion recognition plays an important role in building more inte...
research
10/28/2022

GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition

In human-computer interaction, Speech Emotion Recognition (SER) plays an...
research
01/31/2020

Detecting Emotion Primitives from Speech and their use in discerning Categorical Emotions

Emotion plays an essential role in human-to-human communication, enablin...
research
09/09/2021

Accounting for Variations in Speech Emotion Recognition with Nonparametric Hierarchical Neural Network

In recent years, deep-learning-based speech emotion recognition models h...
research
04/05/2021

Acted vs. Improvised: Domain Adaptation for Elicitation Approaches in Audio-Visual Emotion Recognition

Key challenges in developing generalized automatic emotion recognition s...
research
04/20/2018

Domain Adversarial for Acoustic Emotion Recognition

The performance of speech emotion recognition is affected by the differe...

Please sign up or login with your details

Forgot password? Click here to reset