Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image

01/27/2021
by   Kele Xu, et al.
0

Ultrasound tongue imaging is widely used for speech production research, and it has attracted increasing attention as its potential applications seem to be evident in many different fields, such as the visual biofeedback tool for second language acquisition and silent speech interface. Unlike previous studies, here we explore the feasibility of age estimation using the ultrasound tongue image of the speakers. Motivated by the success of deep learning, this paper leverages deep learning on this task. We train a deep convolutional neural network model on the UltraSuite dataset. The deep model achieves mean absolute error (MAE) of 2.03 for the data from typically developing children, while MAE is 4.87 for the data from the children with speech sound disorders, which suggest that age estimation using ultrasound is more challenging for the children with speech sound disorder. The developed method can be used a tool to evaluate the performance of speech therapy sessions. It is also worthwhile to notice that, although we leverage the ultrasound tongue imaging for our study, the proposed methods may also be extended to other imaging modalities (e.g. MRI) to assist the studies on speech production.

READ FULL TEXT

page 2

page 4

research
07/01/2019

UltraSuite: A Repository of Ultrasound and Acoustic Data from Child Speech Therapy Sessions

We introduce UltraSuite, a curated repository of ultrasound and acoustic...
research
05/22/2023

Towards Ultrasound Tongue Image prediction from EEG during speech production

Previous initial research has already been carried out to propose speech...
research
02/27/2021

Exploiting ultrasound tongue imaging for the automatic detection of speech articulation errors

Speech sound disorders are a common communication impairment in childhoo...
research
08/06/2020

Quantification of Transducer Misalignment in Ultrasound Tongue Imaging

In speech production research, different imaging modalities have been em...
research
06/20/2021

Improving Ultrasound Tongue Image Reconstruction from Lip Images Using Self-supervised Learning and Attention Mechanism

Speech production is a dynamic procedure, which involved multi human org...
research
08/09/2019

Synthetic Elastography using B-mode Ultrasound through a Deep Fully-Convolutional Neural Network

Shear-wave elastography (SWE) permits local estimation of tissue elastic...
research
07/02/2019

Estimation of Absolute States of Human Skeletal Muscle via Standard B-Mode Ultrasound Imaging and Deep Convolutional Neural Networks

Objective: To test automated in vivo estimation of active and passive sk...

Please sign up or login with your details

Forgot password? Click here to reset