Applying Phonological Features in Multilingual Text-To-Speech

10/07/2021
by   Cong Zhang, et al.
0

This study investigates whether phonological features can be applied in text-to-speech systems to generate native and non-native speech in English and Mandarin. We present a mapping of ARPABET/pinyin to SAMPA/SAMPA-SC and then to phonological features. We tested whether this mapping could lead to the successful generation of native, non-native, and code-switched speech in the two languages. We ran two experiments, one with a small dataset and one with a larger dataset. The results proved that phonological features could be used as a feasible input system, although further investigation is needed to improve model performance. The accented output generated by the TTS models also helps with understanding human second language acquisition processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/14/2022

Applying Feature Underspecified Lexicon Phonological Features in Multilingual Text-to-Speech

This study investigates whether the phonological features derived from t...
research
04/16/2018

The Relevance of Text and Speech Features in Automatic Non-native English Accent Identification

This paper describes our experiments with automatically identifying nati...
research
01/31/2019

Rhythm Zone Theory: Speech Rhythms are Physical after all

Speech rhythms have been dealt with in three main ways: from the introsp...
research
05/08/2011

Taking the redpill: Artificial Evolution in native x86 systems

In analogon to successful artificial evolution simulations as Tierra or ...
research
09/02/2023

Bridge Diffusion Model: bridge non-English language-native text-to-image diffusion model with English communities

Text-to-Image generation (TTI) technologies are advancing rapidly, espec...
research
10/19/2022

A Data-Driven Investigation of Noise-Adaptive Utterance Generation with Linguistic Modification

In noisy environments, speech can be hard to understand for humans. Spok...
research
04/20/2019

Self-imitating Feedback Generation Using GAN for Computer-Assisted Pronunciation Training

Self-imitating feedback is an effective and learner-friendly method for ...

Please sign up or login with your details

Forgot password? Click here to reset