Deep segmental phonetic posterior-grams based discovery of non-categories in L2 English speech

02/01/2020
by   Xu Li, et al.
0

Second language (L2) speech is often labeled with the native, phone categories. However, in many cases, it is difficult to decide on a categorical phone that an L2 segment belongs to. These segments are regarded as non-categories. Most existing approaches for Mispronunciation Detection and Diagnosis (MDD) are only concerned with categorical errors, i.e. a phone category is inserted, deleted or substituted by another. However, non-categorical errors are not considered. To model these non-categorical errors, this work aims at exploring non-categorical patterns to extend the categorical phone set. We apply a phonetic segment classifier to generate segmental phonetic posterior-grams (SPPGs) to represent phone segment-level information. And then we explore the non-categories by looking for the SPPGs with more than one peak. Compared with the baseline system, this approach explores more non-categorical patterns, and also perceptual experimental results show that the explored non-categories are more accurate with increased confusion degree by 7.3 preliminarily analyze the reason behind those non-categories.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2020

An End-to-End Mispronunciation Detection System for L2 English Speech Leveraging Novel Anti-Phone Modeling

Mispronunciation detection and diagnosis (MDD) is a core component of co...
research
07/30/2023

Mispronunciation detection using self-supervised speech representations

In recent years, self-supervised learning (SSL) models have produced pro...
research
03/29/2022

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

This paper presents a macroscopic approach to automatic detection of spe...
research
10/19/2021

On Clustering Categories of Categorical Predictors in Generalized Linear Models

We propose a method to reduce the complexity of Generalized Linear Model...
research
05/31/2022

Easy Variational Inference for Categorical Models via an Independent Binary Approximation

We pursue tractable Bayesian analysis of generalized linear models (GLMs...
research
08/06/2020

Evaluating computational models of infant phonetic learning across languages

In the first year of life, infants' speech perception becomes attuned to...
research
02/14/2017

Gaussian-Dirichlet Posterior Dominance in Sequential Learning

We consider the problem of sequential learning from categorical observat...

Please sign up or login with your details

Forgot password? Click here to reset