Read it to me: An emotionally aware Speech Narration Application

09/06/2022

∙

In this work we try to perform emotional style transfer on audios. In particular, MelGAN-VC architecture is explored for various emotion-pair transfers. The generated audio is then classified using an LSTM-based emotion classifier for audio. We find that "sad" audio is generated well as compared to "happy" or "anger" as people have similar expressions of sadness.

READ FULL TEXT

Read it to me: An emotionally aware Speech Narration Application

Sign in with Google

Consider DeepAI Pro