Low-dimensional representation of infant and adult vocalization acoustics

04/25/2022
by   Silvia Pagliarini, et al.
0

During the first years of life, infant vocalizations change considerably, as infants develop the vocalization skills that enable them to produce speech sounds. Characterizations based on specific acoustic features, protophone categories, or phonetic transcription are able to provide a representation of the sounds infants make at different ages and in different contexts but do not fully describe how sounds are perceived by listeners, can be inefficient to obtain at large scales, and are difficult to visualize in two dimensions without additional statistical processing. Machine-learning-based approaches provide the opportunity to complement these characterizations with purely data-driven representations of infant sounds. Here, we use spectral features extraction and unsupervised machine learning, specifically Uniform Manifold Approximation (UMAP), to obtain a novel 2-dimensional spatial representation of infant and caregiver vocalizations extracted from day-long home recordings. UMAP yields a continuous and well-distributed space conducive to certain analyses of infant vocal development. For instance, we found that the dispersion of infant vocalization acoustics within the 2-D space over a day increased from 3 to 9 months, and then decreased from 9 to 18 months. The method also permits analysis of similarity between infant and adult vocalizations, which also shows changes with infant age.

READ FULL TEXT
research
11/08/2016

Inferring low-dimensional microstructure representations using convolutional neural networks

We apply recent advances in machine learning and computer vision to a ce...
research
09/13/2022

Data-Driven Spectral Submanifold Reduction for Nonlinear Optimal Control of High-Dimensional Robots

Modeling and control of high-dimensional, nonlinear robotic systems rema...
research
09/05/2022

Advancing Reacting Flow Simulations with Data-Driven Models

The use of machine learning algorithms to predict behaviors of complex s...
research
11/26/2019

Robust Estimation of Hypernasality in Dysarthria

Hypernasality is a common symptom across many motor-speech disorders. Fo...
research
11/26/2019

Robust Estimation of Hypernasality in Dysarthria with Acoustic Model Likelihood Features

Hypernasality is a common characteristic symptom across many motor-speec...
research
11/16/2016

The Life of Lazarillo de Tormes and of His Machine Learning Adversities

Summit work of the Spanish Golden Age and forefather of the so-called pi...
research
10/02/2017

Learning event representation: As sparse as possible, but not sparser

Selecting an optimal event representation is essential for event classif...

Please sign up or login with your details

Forgot password? Click here to reset