Deep Learning-based automated classification of Chinese Speech Sound Disorders

05/24/2022
by   Yao-Ming Kuo, et al.
0

This article describes a system for analyzing acoustic data to assist in the diagnosis and classification of children's speech sound disorders (SSDs) using a computer. The analysis concentrated on identifying and categorizing four distinct types of Chinese SSDs. The study collected and generated a speech corpus containing 2540 stopping, backing, final consonant deletion process (FCDP), and affrication samples from 90 children aged 3–6 years with normal or pathological articulatory features. Each recording was accompanied by a detailed diagnostic annotation by two speech-language pathologists (SLPs). Classification of the speech samples was accomplished using three well-established neural network models for image classification. The feature maps were created using three sets of Mel-frequency cepstral coefficients (MFCC) parameters extracted from speech sounds and aggregated into a three-dimensional data structure as model input. We employed six techniques for data augmentation to augment the available dataset while avoiding overfitting. The experiments examine the usability of four different categories of Chinese phrases and characters. Experiments with different data subsets demonstrate the system's ability to accurately detect the analyzed pronunciation disorders. The best multi-class classification using a single Chinese phrase achieves an accuracy of 74.4 percent.

READ FULL TEXT

page 3

page 4

page 5

page 6

research
11/09/2020

Data Augmentation For Children's Speech Recognition – The "Ethiopian" System For The SLT 2021 Children Speech Recognition Challenge

This paper presents the "Ethiopian" system for the SLT 2021 Children Spe...
research
10/09/2020

Learning to Pronounce Chinese Without a Pronunciation Dictionary

We demonstrate a program that learns to pronounce Chinese text in Mandar...
research
07/01/2019

UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions

We introduce UltraSuite, a curated repository of ultrasound and acoustic...
research
05/31/2021

Parkinsonian Chinese Speech Analysis towards Automatic Classification of Parkinson's Disease

Speech disorders often occur at the early stage of Parkinson's disease (...
research
05/02/2022

A Novel Speech-Driven Lip-Sync Model with CNN and LSTM

Generating synchronized and natural lip movement with speech is one of t...

Please sign up or login with your details

Forgot password? Click here to reset