Building African Voices

07/01/2022
by   Perez Ogayo, et al.
10

Modern speech synthesis techniques can produce natural-sounding speech given sufficient high-quality data and compute resources. However, such data is not readily available for many languages. This paper focuses on speech synthesis for low-resourced African languages, from corpus creation to sharing and deploying the Text-to-Speech (TTS) systems. We first create a set of general-purpose instructions on building speech synthesis systems with minimum technological resources and subject-matter expertise. Next, we create new datasets and curate datasets from "found" data (existing recordings) through a participatory approach while considering accessibility, quality, and breadth. We demonstrate that we can develop synthesizers that generate intelligible speech with 25 minutes of created speech, even when recorded in suboptimal environments. Finally, we release the speech data, code, and trained voices for 12 African languages to support researchers and developers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/15/2018

Tools and resources for Romanian text-to-speech and speech-to-text applications

In this paper we introduce a set of resources and tools aimed at providi...
research
05/18/2022

Macedonian Speech Synthesis for Assistive Technology Applications

Speech technology is becoming ever more ubiquitous with the advance of s...
research
02/08/2016

LSTM Deep Neural Networks Postfiltering for Improving the Quality of Synthetic Voices

Recent developments in speech synthesis have produced systems capable of...
research
07/07/2022

BibleTTS: a large, high-fidelity, multilingual, and uniquely African speech corpus

BibleTTS is a large, high-quality, open speech dataset for ten languages...
research
05/31/2022

Preparing an Endangered Language for the Digital Age: The Case of Judeo-Spanish

We develop machine translation and speech synthesis systems to complemen...
research
12/13/2017

Creating New Language and Voice Components for the Updated MaryTTS Text-to-Speech Synthesis Platform

We present a new workflow to create components for the MaryTTS text-to-s...
research
10/28/2020

Speech Synthesis and Control Using Differentiable DSP

Modern text-to-speech systems are able to produce natural and high-quali...

Please sign up or login with your details

Forgot password? Click here to reset