Digital Einstein Experience: Fast Text-to-Speech for Conversational AI

07/21/2021
by   Joanna Rownicka, et al.
10

We describe our approach to create and deliver a custom voice for a conversational AI use-case. More specifically, we provide a voice for a Digital Einstein character, to enable human-computer interaction within the digital conversation experience. To create the voice which fits the context well, we first design a voice character and we produce the recordings which correspond to the desired speech attributes. We then model the voice. Our solution utilizes Fastspeech 2 for log-scaled mel-spectrogram prediction from phonemes and Parallel WaveGAN to generate the waveforms. The system supports a character input and gives a speech waveform at the output. We use a custom dictionary for selected words to ensure their proper pronunciation. Our proposed cloud architecture enables for fast voice delivery, making it possible to talk to the digital version of Albert Einstein in real-time.

READ FULL TEXT
research
08/04/2023

Let's Give a Voice to Conversational Agents in Virtual Reality

The dialogue experience with conversational agents can be greatly enhanc...
research
04/29/2020

FitChat: Conversational Artificial Intelligence Interventions for Encouraging Physical Activity in Older Adults

Delivery of digital behaviour change interventions which encourage physi...
research
05/02/2022

How does a spontaneously speaking conversational agent affect user behavior?

This study investigated the effect of synthetic voice of conversational ...
research
11/02/2021

Dehumanizing Voice Technology: Phonetic Experiential Consequences of Restricted Human-Machine Interaction

The use of natural language and voice-based interfaces gradu-ally transf...
research
10/27/2017

Detection and Analysis of Human Emotions through Voice and Speech Pattern Processing

The ability to modulate vocal sounds and generate speech is one of the f...
research
08/24/2023

Real-time Detection of AI-Generated Speech for DeepFake Voice Conversion

There are growing implications surrounding generative AI in the speech d...
research
04/19/2023

Affective social anthropomorphic intelligent system

Human conversational styles are measured by the sense of humor, personal...

Please sign up or login with your details

Forgot password? Click here to reset