Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Uncertainty Quantification

05/28/2023
by   Eun Jung Yeo, et al.
0

This paper proposes an improved Goodness of Pronunciation (GoP) that utilizes Uncertainty Quantification (UQ) for automatic speech intelligibility assessment for dysarthric speech. Current GoP methods rely heavily on neural network-driven overconfident predictions, which is unsuitable for assessing dysarthric speech due to its significant acoustic differences from healthy speech. To alleviate the problem, UQ techniques were used on GoP by 1) normalizing the phoneme prediction (entropy, margin, maxlogit, logit-margin) and 2) modifying the scoring function (scaling, prior normalization). As a result, prior-normalized maxlogit GoP achieves the best performance, with a relative increase of 5.66 English, Korean, and Tamil, respectively. Furthermore, phoneme analysis is conducted to identify which phoneme scores significantly correlate with intelligibility scores in each language.

READ FULL TEXT
research
05/24/2020

Detecting Adversarial Examples for Speech Recognition via Uncertainty Quantification

Machine learning systems and also, specifically, automatic speech recogn...
research
06/25/2023

Addressing Cold Start Problem for End-to-end Automatic Speech Scoring

Integrating automatic speech scoring/assessment systems has become a cri...
research
08/17/2020

Do face masks introduce bias in speech technologies? The case of automated scoring of speaking proficiency

The COVID-19 pandemic has led to a dramatic increase in the use of face ...
research
03/15/2019

Crowd Counting with Decomposed Uncertainty

Research in neural networks in the field of computer vision has achieved...
research
02/02/2023

Randomized prior wavelet neural operator for uncertainty quantification

In this paper, we propose a novel data-driven operator learning framewor...
research
10/26/2020

Improving pronunciation assessment via ordinal regression with anchored reference samples

Sentence level pronunciation assessment is important for Computer Assist...
research
06/01/2021

Uncertainty Characteristics Curves: A Systematic Assessment of Prediction Intervals

Accurate quantification of model uncertainty has long been recognized as...

Please sign up or login with your details

Forgot password? Click here to reset