Arabic Text-To-Speech (TTS) Data Preparation

04/07/2022
by   Hala Al Masri, et al.
0

People may be puzzled by the fact that voice over recordings data sets exist in addition to Text-to-Speech (TTS), Synthesis system advancements, albeit this is not the case. The goal of this study is to explain the relevance of TTS as well as the data preparation procedures. TTS relies heavily on recorded data since it can have a substantial influence on the outcomes of TTS modules. Furthermore, whether the domain is specialized or general, appropriate data should be developed to address all predicted language variants and domains. Different recording methodologies, taking into account quality and behavior, may also be advantageous in the development of the module. In light of the lack of Arabic language in present synthesizing systems, numerous variables that impact the flow of recorded utterances are being considered in order to manipulate an Arabic TTS module. In this study, two viewpoints will be discussed: linguistics and the creation of high-quality recordings for TTS. The purpose of this work is to offer light on how ground-truth utterances may influence the evolution of speech systems in terms of naturalness, intelligibility, and understanding. Well provide voice actor specs as well as data specs that will assist both voice actors and voice coaches in the studio as well as the annotators who will be evaluating the audios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2019

The Zero Resource Speech Challenge 2019: TTS without T

We present the Zero Resource Speech Challenge 2019, which proposes to bu...
research
07/22/2020

A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture

Speech synthesis is the artificial production of human speech. A typical...
research
02/28/2023

ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus

At present, Text-to-speech (TTS) systems that are trained with high-qual...
research
05/24/2022

TDASS: Target Domain Adaptation Speech Synthesis Framework for Multi-speaker Low-Resource TTS

Recently, synthesizing personalized speech by text-to-speech (TTS) appli...
research
07/29/2023

ÌròyìnSpeech: A multi-purpose Yorùbá Speech Corpus

We introduce the ÌròyìnSpeech corpus – a new dataset influenced by a des...
research
09/14/2023

SingFake: Singing Voice Deepfake Detection

The rise of singing voice synthesis presents critical challenges to arti...
research
10/08/2016

A Semantic Analyzer for the Comprehension of the Spontaneous Arabic Speech

This work is part of a large research project entitled "Oréodule" aimed ...

Please sign up or login with your details

Forgot password? Click here to reset