A Methodology for Controlling the Emotional Expressiveness in Synthetic Speech – a Deep Learning approach

07/05/2019
by   Noé Tits, et al.
0

In this project, we aim to build a Text-to-Speech system able to produce speech with a controllable emotional expressiveness. We propose a methodology for solving this problem in three main steps. The first is the collection of emotional speech data. We discuss the various formats of existing datasets and their usability in speech generation. The second step is the development of a system to automatically annotate data with emotion/expressiveness features. We compare several techniques using transfer learning to extract such a representation through other tasks and propose a method to visualize and interpret the correlation between vocal and emotional features. The third step is the development of a deep learning-based system taking text and emotion/expressiveness as input and producing speech as output. We study the impact of fine tuning from a neutral TTS towards an emotional TTS in terms of intelligibility and perception of the emotion.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2017

Towards Indonesian Speech-Emotion Automatic Recognition (I-SpEAR)

Even though speech-emotion recognition (SER) has been receiving much att...
research
05/23/2023

ZET-Speech: Zero-shot adaptive Emotion-controllable Text-to-Speech Synthesis with Diffusion and Style-based Models

Emotional Text-To-Speech (TTS) is an important task in the development o...
research
01/14/2019

Exploring Transfer Learning for Low Resource Emotional TTS

During the last few years, spoken language technologies have known a big...
research
11/12/2009

Emotion : modèle d'appraisal-coping pour le problème des Cascades

Modeling emotion has become a challenge nowadays. Therefore, several mod...
research
02/24/2020

Emosaic: Visualizing Affective Content of Text at Varying Granularity

This paper presents Emosaic, a tool for visualizing the emotional tone o...
research
11/12/2009

Emotion: Appraisal-coping model for the "Cascades" problem

Modelling emotion has become a challenge nowadays. Therefore, several mo...
research
11/26/2019

A Time Series Analysis of Emotional Loading in Central Bank Statements

We examine the affective content of central bank press statements using ...

Please sign up or login with your details

Forgot password? Click here to reset