Augmenting Generative Adversarial Networks for Speech Emotion Recognition

05/18/2020
by   Siddique Latif, et al.
0

Generative adversarial networks (GANs) have shown potential in learning emotional attributes and generating new data samples. However, their performance is usually hindered by the unavailability of larger speech emotion recognition (SER) data. In this work, we propose a framework that utilises the mixup data augmentation scheme to augment the GAN in feature learning and generation. To show the effectiveness of the proposed framework, we present results for SER on (i) synthetic feature vectors, (ii) augmentation of the training data with synthetic features, (iii) encoded features in compressed representation. Our results show that the proposed framework can effectively learn compressed emotional representations as well as it can generate synthetic samples that help improve performance in within-corpus and cross-corpus evaluation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/19/2023

A Preliminary Study on Augmenting Speech Emotion Recognition using a Diffusion Model

In this paper, we propose to utilise diffusion models for data augmentat...
research
10/31/2019

Modeling Feature Representations for Affective Speech using Generative Adversarial Networks

Emotion recognition is a classic field of research with a typical setup ...
research
06/18/2018

On Enhancing Speech Emotion Recognition using Generative Adversarial Networks

Generative Adversarial Networks (GANs) have gained a lot of attention fr...
research
06/06/2018

Adversarial Auto-encoders for Speech Based Emotion Recognition

Recently, generative adversarial networks and adversarial autoencoders h...
research
11/16/2022

Data Augmentation with Unsupervised Speaking Style Transfer for Speech Emotion Recognition

Currently, the performance of Speech Emotion Recognition (SER) systems i...
research
09/18/2021

Hybrid Data Augmentation and Deep Attention-based Dilated Convolutional-Recurrent Neural Networks for Speech Emotion Recognition

Speech emotion recognition (SER) has been one of the significant tasks i...

Please sign up or login with your details

Forgot password? Click here to reset