Comparing supervised and self-supervised embedding for ExVo Multi-Task learning track

06/23/2022
by   Tilak Purohit, et al.
0

The ICML Expressive Vocalizations (ExVo) Multi-task challenge 2022, focuses on understanding the emotional facets of the non-linguistic vocalizations (vocal bursts (VB)). The objective of this challenge is to predict emotional intensities for VB, being a multi-task challenge it also requires to predict speakers' age and native-country. For this challenge we study and compare two distinct embedding spaces namely, self-supervised learning (SSL) based embeddings and task-specific supervised learning based embeddings. Towards that, we investigate feature representations obtained from several pre-trained SSL neural networks and task-specific supervised classification neural networks. Our studies show that the best performance is obtained with a hybrid approach, where predictions derived via both SSL and task-specific supervised learning are used. Our best system on test-set surpasses the ComPARE baseline (harmonic mean of all sub-task scores i.e., S_MTL) by a relative 13% margin.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2022

End-to-End and Self-Supervised Learning for ComParE 2022 Stuttering Sub-Challenge

In this paper, we present end-to-end and speech embedding based systems ...
research
03/30/2020

Improving out-of-distribution generalization via multi-task self-supervised pretraining

Self-supervised feature representations have been shown to be useful for...
research
06/24/2022

Burst2Vec: An Adversarial Multi-Task Approach for Predicting Emotion, Age, and Origin from Vocal Bursts

We present Burst2Vec, our multi-task learning approach to predict emotio...
research
11/10/2020

UmBERTo-MTSA @ AcCompl-It: Improving Complexity and Acceptability Prediction with Multi-task Learning on Self-Supervised Annotations

This work describes a self-supervised data augmentation approach used to...
research
09/10/2023

Continual Robot Learning using Self-Supervised Task Inference

Endowing robots with the human ability to learn a growing set of skills ...
research
08/06/2020

Aalto's End-to-End DNN systems for the INTERSPEECH 2020 Computational Paralinguistics Challenge

End-to-end neural network models (E2E) have shown significant performanc...
research
06/25/2022

Self-supervision and Learnable STRFs for Age, Emotion, and Country Prediction

This work presents a multitask approach to the simultaneous estimation o...

Please sign up or login with your details

Forgot password? Click here to reset