Read it to me: An emotionally aware Speech Narration Application

09/06/2022
by   Rishibha Bansal, et al.
0

In this work we try to perform emotional style transfer on audios. In particular, MelGAN-VC architecture is explored for various emotion-pair transfers. The generated audio is then classified using an LSTM-based emotion classifier for audio. We find that "sad" audio is generated well as compared to "happy" or "anger" as people have similar expressions of sadness.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset