Speech Synthesis with Mixed Emotions

08/11/2022
by   Kun Zhou, et al.
0

Emotional speech synthesis aims to synthesize human voices with various emotional effects. The current studies are mostly focused on imitating an averaged style belonging to a specific emotion type. In this paper, we seek to generate speech with a mixture of emotions at run-time. We propose a novel formulation that measures the relative difference between the speech samples of different emotions. We then incorporate our formulation into a sequence-to-sequence emotional text-to-speech framework. During the training, the framework does not only explicitly characterize emotion styles, but also explores the ordinal nature of emotions by quantifying the differences with other emotions. At run-time, we control the model to produce the desired emotion mixture by manually defining an emotion attribute vector. The objective and subjective evaluations have validated the effectiveness of the proposed framework. To our best knowledge, this research is the first study on modelling, synthesizing and evaluating mixed emotions in speech.

READ FULL TEXT

page 2

page 14

research
10/25/2022

Mixed Emotion Modelling for Emotional Voice Conversion

Emotional voice conversion (EVC) aims to convert the emotional state of ...
research
06/01/2023

EmoMix: Emotion Mixing via Diffusion Models for Emotional Speech Synthesis

There has been significant progress in emotional Text-To-Speech (TTS) sy...
research
11/26/2022

Contextual Expressive Text-to-Speech

The goal of expressive Text-to-speech (TTS) is to synthesize natural spe...
research
08/10/2021

A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information

There is growing interest in affective computing for the representation ...
research
09/27/2021

Emotional Speech Synthesis for Companion Robot to Imitate Professional Caregiver Speech

When people try to influence others to do something, they subconsciously...
research
10/24/2019

Detecting gender differences in perception of emotion in crowdsourced data

Do men and women perceive emotions differently? Popular convictions plac...
research
10/19/2022

Free energy model of emotional valence in dual-process perceptions

An appropriate level of arousal induces positive emotions, and a high ar...

Please sign up or login with your details

Forgot password? Click here to reset