Towards Transferable Speech Emotion Representation: On loss functions for cross-lingual latent representations

03/28/2022
by   Sneha Das, et al.
0

In recent years, speech emotion recognition (SER) has been used in wide ranging applications, from healthcare to the commercial sector. In addition to signal processing approaches, methods for SER now also use deep learning techniques which provide transfer learning possibilities. However, generalizing over languages, corpora and recording conditions is still an open challenge. In this work we address this gap by exploring loss functions that aid in transferability, specifically to non-tonal languages. We propose a variational autoencoder (VAE) with KL annealing and a semi-supervised VAE to obtain more consistent latent embedding distributions across data sets. To ensure transferability, the distribution of the latent embedding should be similar across non-tonal languages (data sets). We start by presenting a low-complexity SER based on a denoising-autoencoder, which achieves an unweighted classification accuracy of over 52.09 This performance is comparable to that of similar baseline methods. Following this, we employ a VAE, the semi-supervised VAE and the VAE with KL annealing to obtain a more regularized latent space. We show that while the DAE has the highest classification accuracy among the methods, the semi-supervised VAE has a comparable classification accuracy and a more consistent latent embedding distribution over data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/05/2021

Towards Interpretable and Transferable Speech Emotion Recognition: Latent Representation Based Analysis of Features, Methods and Corpora

In recent years, speech emotion recognition (SER) has been used in wide ...
research
03/28/2022

Continuous Metric Learning For Transferable Speech Emotion Recognition and Embedding Across Low-resource Languages

Speech emotion recognition (SER) refers to the technique of inferring th...
research
11/13/2020

On the Transferability of VAE Embeddings using Relational Knowledge with Semi-Supervision

We propose a new model for relational VAE semi-supervision capable of ba...
research
11/17/2020

Semi-supervised Learning of Galaxy Morphology using Equivariant Transformer Variational Autoencoders

The growth in the number of galaxy images is much faster than the speed ...
research
01/23/2020

Semi-supervised Grasp Detection by Representation Learning in a Vector Quantized Latent Space

Determining quality grasps from an image is an important area of researc...
research
12/27/2022

Semi-supervised multiscale dual-encoding method for faulty traffic data detection

Inspired by the recent success of deep learning in multiscale informatio...
research
07/28/2020

Novel Potential Inhibitors Against SARS-CoV-2 Using Artificial Intelligence

Abstract Since known approved drugs like liponavir and ritonavir failed ...

Please sign up or login with your details

Forgot password? Click here to reset