Use of Speech Impairment Severity for Dysarthric Speech Recognition

05/18/2023
by   Mengzhe Geng, et al.
0

A key challenge in dysarthric speech recognition is the speaker-level diversity attributed to both speaker-identity associated factors such as gender, and speech impairment severity. Most prior researches on addressing this issue focused on using speaker-identity only. To this end, this paper proposes a novel set of techniques to use both severity and speaker-identity in dysarthric speech recognition: a) multitask training incorporating severity prediction error; b) speaker-severity aware auxiliary feature adaptation; and c) structured LHUC transforms separately conditioned on speaker-identity and severity. Experiments conducted on UASpeech suggest incorporating additional speech impairment severity into state-of-the-art hybrid DNN, E2E Conformer and pre-trained Wav2vec 2.0 ASR systems produced statistically significant WER reductions up to 4.78 published WER of 17.82 UASpeech.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/28/2022

On-the-fly Feature Based Speaker Adaptation for Dysarthric and Elderly Speech Recognition

Automatic recognition of dysarthric and elderly speech highly challengin...
research
04/02/2022

Speaker adaptation for Wav2vec2 based dysarthric ASR

Dysarthric speech recognition has posed major challenges due to lack of ...
research
02/18/2023

Speaker and Language Change Detection using Wav2vec2 and Whisper

We investigate recent transformer networks pre-trained for automatic spe...
research
08/25/2020

Aphasic Speech Recognition using a Mixture of Speech Intelligibility Experts

Robust speech recognition is a key prerequisite for semantic feature ext...
research
06/26/2023

Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems

Rich sources of variability in natural speech present significant challe...
research
06/16/2020

Towards Automated Assessment of Stuttering and Stuttering Therapy

Stuttering is a complex speech disorder that can be identified by repeti...
research
01/27/2022

Synthesizing Dysarthric Speech Using Multi-talker TTS for Dysarthric Speech Recognition

Dysarthria is a motor speech disorder often characterized by reduced spe...

Please sign up or login with your details

Forgot password? Click here to reset