Macedonian Speech Synthesis for Assistive Technology Applications

05/18/2022
by   Bojan Sofronievski, et al.
0

Speech technology is becoming ever more ubiquitous with the advance of speech enabled devices and services. The use of speech synthesis in Augmentative and Alternative Communication tools, has facilitated inclusion of individuals with speech impediments allowing them to communicate with their surroundings using speech. Although there are numerous speech synthesis systems for the most spoken world languages, there is still a limited offer for smaller languages. We propose and compare three models built using parametric and deep learning techniques for Macedonian trained on a newly recorded corpus. We target low-resource edge deployment for Augmentative and Alternative Communication and assistive technologies, such as communication boards and screen readers. The listening test results show that parametric speech synthesis is as performant compared to the more advanced deep learning models. Since it also requires less resources, and offers full speech rate and pitch control, it is the preferred choice for building a Macedonian TTS system for this application scenario.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

Building African Voices

Modern speech synthesis techniques can produce natural-sounding speech g...
research
12/23/2020

Speech Synthesis as Augmentation for Low-Resource ASR

Speech synthesis might hold the key to low-resource speech recognition. ...
research
11/17/2018

Representation Mixing for TTS Synthesis

Recent character and phoneme-based parametric TTS systems using deep lea...
research
04/20/2021

Review of end-to-end speech synthesis technology based on deep learning

As an indispensable part of modern human-computer interaction system, sp...
research
04/27/2021

Using Radio Archives for Low-Resource Speech Recognition: Towards an Intelligent Virtual Assistant for Illiterate Users

For many of the 700 million illiterate people around the world, speech r...
research
10/28/2017

JSUT corpus: free large-scale Japanese speech corpus for end-to-end speech synthesis

Thanks to improvements in machine learning techniques including deep lea...
research
01/22/2016

Speech vocoding for laboratory phonology

Using phonological speech vocoding, we propose a platform for exploring ...

Please sign up or login with your details

Forgot password? Click here to reset