Personalized Adaptation with Pre-trained Speech Encoders for Continuous Emotion Recognition

09/05/2023
by   Minh Tran, et al.
0

There are individual differences in expressive behaviors driven by cultural norms and personality. This between-person variation can result in reduced emotion recognition performance. Therefore, personalization is an important step in improving the generalization and robustness of speech emotion recognition. In this paper, to achieve unsupervised personalized emotion recognition, we first pre-train an encoder with learnable speaker embeddings in a self-supervised manner to learn robust speech representations conditioned on speakers. Second, we propose an unsupervised method to compensate for the label distribution shifts by finding similar speakers and leveraging their label distributions from the training set. Extensive experimental results on the MSP-Podcast corpus indicate that our method consistently outperforms strong personalization baselines and achieves state-of-the-art performance for valence estimation.

READ FULL TEXT
research
04/15/2021

Speaker Attentive Speech Emotion Recognition

Speech Emotion Recognition (SER) task has known significant improvements...
research
04/23/2019

A Personalized Affective Memory Neural Model for Improving Emotion Recognition

Recent models of emotion recognition strongly rely on supervised deep le...
research
01/19/2022

Unsupervised Personalization of an Emotion Recognition System: The Unique Properties of the Externalization of Valence in Speech

The prediction of valence from speech is an important, but challenging p...
research
09/09/2023

Speech Emotion Recognition with Distilled Prosodic and Linguistic Affect Representations

We propose EmoDistill, a novel speech emotion recognition (SER) framewor...
research
09/05/2023

Leveraging Label Information for Multimodal Emotion Recognition

Multimodal emotion recognition (MER) aims to detect the emotional status...
research
03/31/2022

CTA-RNN: Channel and Temporal-wise Attention RNN Leveraging Pre-trained ASR Embeddings for Speech Emotion Recognition

Previous research has looked into ways to improve speech emotion recogni...
research
10/24/2019

Domain adversarial learning for emotion recognition

In practical applications for emotion recognition, users do not always e...

Please sign up or login with your details

Forgot password? Click here to reset