Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts

08/25/2020
by   Matthew Perez, et al.
0

Robust speech recognition is a key prerequisite for semantic feature extraction in automatic aphasic speech analysis. However, standard one-size-fits-all automatic speech recognition models perform poorly when applied to aphasic speech. One reason for this is the wide range of speech intelligibility due to different levels of severity (i.e., higher severity lends itself to less intelligible speech). To address this, we propose a novel acoustic model based on a mixture of experts (MoE), which handles the varying intelligibility stages present in aphasic speech by explicitly defining severity-based experts. At test time, the contribution of each expert is decided by estimating speech intelligibility with a speech intelligibility detector (SID). We show that our proposed approach significantly reduces phone error rates across all severity stages in aphasic speech compared to a baseline approach that does not incorporate severity information into the modeling process.

READ FULL TEXT
research
05/18/2023

Use of Speech Impairment Severity for Dysarthric Speech Recognition

A key challenge in dysarthric speech recognition is the speaker-level di...
research
06/16/2020

Towards Automated Assessment of Stuttering and Stuttering Therapy

Stuttering is a complex speech disorder that can be identified by repeti...
research
08/16/2023

Accurate synthesis of Dysarthric Speech for ASR data augmentation

Dysarthria is a motor speech disorder often characterized by reduced spe...
research
10/16/2021

Towards Robust Waveform-Based Acoustic Models

We propose an approach for learning robust acoustic models in adverse en...
research
07/13/2020

Stutter Diagnosis and Therapy System Based on Deep Learning

Stuttering, also called stammering, is a communication disorder that bre...
research
10/11/2013

A Bayesian Network View on Acoustic Model-Based Techniques for Robust Speech Recognition

This article provides a unifying Bayesian network view on various approa...
research
05/10/2019

Role of non-linear data processing on speech recognition task in the framework of reservoir computing

The reservoir computing neural network architecture is widely used to te...

Please sign up or login with your details

Forgot password? Click here to reset