Multilingual Speech Evaluation: Case Studies on English, Malay and Tamil

07/08/2021
by   Huayun Zhang, et al.
0

Speech evaluation is an essential component in computer-assisted language learning (CALL). While speech evaluation on English has been popular, automatic speech scoring on low resource languages remains challenging. Work in this area has focused on monolingual specific designs and handcrafted features stemming from resource-rich languages like English. Such approaches are often difficult to generalize to other languages, especially if we also want to consider suprasegmental qualities such as rhythm. In this work, we examine three different languages that possess distinct rhythm patterns: English (stress-timed), Malay (syllable-timed), and Tamil (mora-timed). We exploit robust feature representations inspired by music processing and vector representation learning. Empirical validations show consistent gains for all three languages when predicting pronunciation, rhythm and intonation performance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2022

Multilingual BERT has an accent: Evaluating English influences on fluency in multilingual models

While multilingual language models can improve NLP performance on low-re...
research
08/04/2017

Massively Multilingual Neural Grapheme-to-Phoneme Conversion

Grapheme-to-phoneme conversion (g2p) is necessary for text-to-speech and...
research
01/29/2020

Learning Robust and Multilingual Speech Representations

Unsupervised speech representation learning has shown remarkable success...
research
09/27/2022

Multilingual analysis of intelligibility classification using English, Korean, and Tamil dysarthric speech datasets

This paper analyzes dysarthric speech datasets from three languages with...
research
08/23/2018

Role of Intonation in Scoring Spoken English

In this paper, we have introduced and evaluated intonation based feature...
research
06/12/2016

External Lexical Information for Multilingual Part-of-Speech Tagging

Morphosyntactic lexicons and word vector representations have both prove...
research
02/23/2018

The JHU Speech LOREHLT 2017 System: Cross-Language Transfer for Situation-Frame Detection

We describe the system our team used during NIST's LoReHLT (Low Resource...

Please sign up or login with your details

Forgot password? Click here to reset