Generating Diverse Vocal Bursts with StyleGAN2 and MEL-Spectrograms

06/25/2022
by   Marco Jiralerspong, et al.
0

We describe our approach for the generative emotional vocal burst task (ExVo Generate) of the ICML Expressive Vocalizations Competition. We train a conditional StyleGAN2 architecture on mel-spectrograms of preprocessed versions of the audio samples. The mel-spectrograms generated by the model are then inverted back to the audio domain. As a result, our generated samples substantially improve upon the baseline provided by the competition from a qualitative and quantitative perspective for all emotions. More precisely, even for our worst-performing emotion (awe), we obtain an FAD of 1.76 compared to the baseline of 4.81 (as a reference, the FAD between the train/validation sets for awe is 0.776).

READ FULL TEXT

page 2

page 3

research
05/03/2022

The ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

The ICML Expressive Vocalization (ExVo) Competition is focused on unders...
research
07/14/2022

Proceedings of the ICML 2022 Expressive Vocalizations Workshop and Competition: Recognizing, Generating, and Personalizing Vocal Bursts

This is the Proceedings of the ICML Expressive Vocalization (ExVo) Compe...
research
09/06/2022

Read it to me: An emotionally aware Speech Narration Application

In this work we try to perform emotional style transfer on audios. In pa...
research
11/11/2017

MojiTalk: Generating Emotional Responses at Scale

Generating emotional language is a key step towards building empathetic ...
research
04/05/2020

Emotional Video to Audio Transformation Using Deep Recurrent Neural Networks and a Neuro-Fuzzy System

Generating music with emotion similar to that of an input video is a ver...
research
03/01/2023

READ Avatars: Realistic Emotion-controllable Audio Driven Avatars

We present READ Avatars, a 3D-based approach for generating 2D avatars t...
research
11/11/2021

Improvements to short-term weather prediction with recurrent-convolutional networks

The Weather4cast 2021 competition gave the participants a task of predic...

Please sign up or login with your details

Forgot password? Click here to reset