Phone Duration Modeling for Speaker Age Estimation in Children

Automatic inference of important paralinguistic information such as age from speech is an important area of research with numerous spoken language technology based applications. Speaker age estimation has applications in enabling personalization and age-appropriate curation of information and content. However, research in speaker age estimation in children is especially challenging due to paucity of relevant speech data representing the developmental spectrum, and the high signal variability especially intra age variability that complicates modeling. Most approaches in children speaker age estimation adopt methods directly from research on adult speech processing. In this paper, we propose features specific to children and focus on speaker's phone duration as an important biomarker of children's age. We propose phone duration modeling for predicting age from child's speech. To enable that, children speech is first forced aligned with the corresponding transcription to derive phone duration distributions. Statistical functionals are computed from phone duration distributions for each phoneme which are in turn used to train regression models to predict speaker age. Two children speech datasets are employed to demonstrate the robustness of phone duration features. We perform age regression experiments on age categories ranging from children studying in kindergarten to grade 10. Experimental results suggest phone durations contain important development-related information of children. Phonemes contributing most to estimation of children speaker age are analyzed and presented.

READ FULL TEXT

page 1

page 5

research
05/25/2016

Design and development a children's speech database

The report presents the process of planning, designing and the developme...
research
10/19/2022

Speaker- and Age-Invariant Training for Child Acoustic Modeling Using Adversarial Multi-Task Learning

One of the major challenges in acoustic modelling of child speech is the...
research
09/27/2022

Automated Sex Classification of Children's Voices and Changes in Differentiating Factors with Age

Sex classification of children's voices allows for an investigation of t...
research
03/29/2022

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

This paper presents a macroscopic approach to automatic detection of spe...
research
08/05/2018

Kid on The Phone! Toward Automatic Detection of Children on Mobile Devices

Studies have shown that children can be exposed to smart devices at a ve...
research
02/02/2021

Child-Computer Interaction: Recent Works, New Dataset, and Age Detection

We overview recent research in Child-Computer Interaction and describe o...
research
10/08/2021

Inferring age-specific differences in susceptibility to and infectiousness upon SARS-CoV-2 infection based on Belgian social contact data

Several important aspects related to SARS-CoV-2 transmission are not wel...

Please sign up or login with your details

Forgot password? Click here to reset