Fixed-MAML for Few Shot Classification in Multilingual Speech Emotion Recognition

01/05/2021
by   Anugunj Naman, et al.
0

In this paper, we analyze the feasibility of applying few-shot learning to speech emotion recognition task (SER). The current speech emotion recognition models work exceptionally well but fail when then input is multilingual. Moreover, when training such models, the models' performance is suitable only when the training corpus is vast. This availability of a big training corpus is a significant problem when choosing a language that is not much popular or obscure. We attempt to solve this challenge of multilingualism and lack of available data by turning this problem into a few-shot learning problem. We suggest relaxing the assumption that all N classes in an N-way K-shot problem be new and define an N+F way problem where N and F are the number of emotion classes and predefined fixed classes, respectively. We propose this modification to the Model-Agnostic MetaLearning (MAML) algorithm to solve the problem and call this new model F-MAML. This modification performs better than the original MAML and outperforms on EmoFilm dataset.

READ FULL TEXT
research
03/18/2020

Cross Lingual Cross Corpus Speech Emotion Recognition

The majority of existing speech emotion recognition models are trained a...
research
05/17/2018

Convolutional Attention Networks for Multimodal Emotion Recognition from Speech and Text Data

Emotion recognition has become a popular topic of interest, especially i...
research
08/02/2021

The Role of Phonetic Units in Speech Emotion Recognition

We propose a method for emotion recognition through emotiondependent spe...
research
09/20/2023

Ensembling Multilingual Pre-Trained Models for Predicting Multi-Label Regression Emotion Share from Speech

Speech emotion recognition has evolved from research to practical applic...
research
08/21/2023

Refashioning Emotion Recognition Modelling: The Advent of Generalised Large Models

After the inception of emotion recognition or affective computing, it ha...
research
09/07/2021

Few-shot Learning in Emotion Recognition of Spontaneous Speech Using a Siamese Neural Network with Adaptive Sample Pair Formation

Speech-based machine learning (ML) has been heralded as a promising solu...
research
11/14/2021

Speech Emotion Recognition System by Quaternion Nonlinear Echo State Network

The echo state network (ESN) is a powerful and efficient tool for displa...

Please sign up or login with your details

Forgot password? Click here to reset