Classifying Autism from Crowdsourced Semi-Structured Speech Recordings: A Machine Learning Approach

by   Nathan A. Chi, et al.

Autism spectrum disorder (ASD) is a neurodevelopmental disorder which results in altered behavior, social development, and communication patterns. In past years, autism prevalence has tripled, with 1 in 54 children now affected. Given that traditional diagnosis is a lengthy, labor-intensive process, significant attention has been given to developing systems that automatically screen for autism. Prosody abnormalities are among the clearest signs of autism, with affected children displaying speech idiosyncrasies including echolalia, monotonous intonation, atypical pitch, and irregular linguistic stress patterns. In this work, we present a suite of machine learning approaches to detect autism in self-recorded speech audio captured from autistic and neurotypical (NT) children in home environments. We consider three methods to detect autism in child speech: first, Random Forests trained on extracted audio features (including Mel-frequency cepstral coefficients); second, convolutional neural networks (CNNs) trained on spectrograms; and third, fine-tuned wav2vec 2.0–a state-of-the-art Transformer-based ASR model. We train our classifiers on our novel dataset of cellphone-recorded child speech audio curated from Stanford's Guess What? mobile game, an app designed to crowdsource videos of autistic and neurotypical children in a natural home environment. The Random Forest classifier achieves 70 achieves 77 children's audio as either ASD or NT. Our models were able to predict autism status when training on a varied selection of home audio clips with inconsistent recording quality, which may be more generalizable to real world conditions. These results demonstrate that machine learning methods offer promise in detecting autism automatically from speech without specialized equipment.


page 6

page 9


Enhancing Child Vocalization Classification in Multi-Channel Child-Adult Conversations Through Wav2vec2 Children ASR Features

Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder that oft...

Detecting Autism Spectrum Disorders with Machine Learning Models Using Speech Transcripts

Autism spectrum disorder (ASD) can be defined as a neurodevelopmental di...

Can Machine Learning Be Used to Recognize and Diagnose Coughs?

5G is bringing new use cases to the forefront, one of the most prominent...

Generacion de voces artificiales infantiles en castellano con acento costarricense

This article evaluates a first experience of generating artificial child...

Comparing Machine Learning-Centered Approaches for Forecasting Language Patterns During Frustration in Early Childhood

When faced with self-regulation challenges, children have been known the...

Automatic Detection of Expressed Emotion from Five-Minute Speech Samples: Challenges and Opportunities

We present a novel feasibility study on the automatic recognition of Exp...

Please sign up or login with your details

Forgot password? Click here to reset