Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task Learning

10/27/2022
by   Eun Jung Yeo, et al.
0

Automatic assessment of dysarthric speech is essential for sustained treatments and rehabilitation. However, obtaining atypical speech is challenging, often leading to data scarcity issues. To tackle the problem, we propose a novel automatic severity assessment method for dysarthric speech, using the self-supervised model in conjunction with multi-task learning. Wav2vec 2.0 XLS-R is jointly trained for two different tasks: severity level classification and an auxilary automatic speech recognition (ASR). For the baseline experiments, we employ hand-crafted features such as eGeMaps and linguistic features, and SVM, MLP, and XGBoost classifiers. Explored on the Korean dysarthric speech QoLT database, our model outperforms the traditional baseline methods, with a relative percentage increase of 4.79 classification accuracy. In addition, the proposed model surpasses the model trained without ASR head, achieving 10.09 Furthermore, we present how multi-task learning affects the severity classification performance by analyzing the latent representations and regularization effect.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/02/2022

ASR-Aware End-to-end Neural Diarization

We present a Conformer-based end-to-end neural diarization (EEND) model ...
research
11/23/2020

Multi-task Language Modeling for Improving Speech Recognition of Rare Words

End-to-end automatic speech recognition (ASR) systems are increasingly p...
research
06/16/2020

Towards Automated Assessment of Stuttering and Stuttering Therapy

Stuttering is a complex speech disorder that can be identified by repeti...
research
08/09/2017

DeepFaceLIFT: Interpretable Personalized Models for Automatic Estimation of Self-Reported Pain

Previous research on automatic pain estimation from facial expressions h...
research
08/24/2023

MultiPA: a multi-task speech pronunciation assessment system for a closed and open response scenario

The design of automatic speech pronunciation assessment can be categoriz...
research
11/15/2020

Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech

Automatic techniques in the context of motor speech disorders (MSDs) are...
research
07/07/2023

STG-MTL: Scalable Task Grouping for Multi-Task Learning Using Data Map

Multi-Task Learning (MTL) is a powerful technique that has gained popula...

Please sign up or login with your details

Forgot password? Click here to reset