DeepAI AI Chat
Log In Sign Up

Transparent pronunciation scoring using articulatorily weighted phoneme edit distance

05/07/2019
by   Reima Karhila, et al.
0

For researching effects of gamification in foreign language learning for children in the "Say It Again, Kid!" project we developed a feedback paradigm that can drive gameplay in pronunciation learning games. We describe our scoring system based on the difference between a reference phone sequence and the output of a multilingual CTC phoneme recogniser. We present a white-box scoring model of mapped weighted Levenshtein edit distance between reference and error with error weights for articulatory differences computed from a training set of scored utterances. The system can produce a human-readable list of each detected mispronunciation's contribution to the utterance score. We compare our scoring method to established black box methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

03/15/2023

Matrices inducing generalized metric on sequences

Sequence comparison is a basic task to capture similarities and differen...
01/13/2018

Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers

Although various techniques have been proposed to generate adversarial s...
04/11/2018

Approximating Edit Distance in Truly Subquadratic Time: Quantum and MapReduce

The edit distance between two strings is defined as the smallest number ...
11/08/2018

Phonetic-attention scoring for deep speaker features in speaker verification

Recent studies have shown that frame-level deep speaker features can be ...
02/21/2023

Leveraging phone-level linguistic-acoustic similarity for utterance-level pronunciation scoring

Recent studies on pronunciation scoring have explored the effect of intr...
07/27/2017

A Family of Metrics for Clustering Algorithms

We give the motivation for scoring clustering algorithms and a metric M ...