Multi-class Detection of Pathological Speech with Latent Features: How does it perform on unseen data?

10/27/2022
by   Dominik Wagner, et al.
0

The detection of pathologies from speech features is usually defined as a binary classification task with one class representing a specific pathology and the other class representing healthy speech. In this work, we train neural networks, large margin classifiers, and tree boosting machines to distinguish between four different pathologies: Parkinson's disease, laryngeal cancer, cleft lip and palate, and oral squamous cell carcinoma. We demonstrate that latent representations extracted at different layers of a pre-trained wav2vec 2.0 system can be effectively used to classify these types of pathological voices. We evaluate the robustness of our classifiers by adding room impulse responses to the test data and by applying them to unseen speech corpora. Our approach achieves unweighted average F1-Scores between 74.1 depending on the model and the noise conditions used. The systems generalize and perform well on unseen data of healthy speakers sampled from a variety of different sources.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/28/2022

The Importance of Speech Stimuli for Pathologic Speech Classification

Current findings show that pre-trained wav2vec 2.0 models can be success...
research
07/18/2023

Detecting Throat Cancer from Speech Signals Using Machine Learning: A Reproducible Literature Review

In this work we perform a scoping review of the current literature on th...
research
03/21/2022

Multi-class versus One-class classifier in spontaneous speech analysis oriented to Alzheimer Disease diagnosis

Most of medical developments require the ability to identify samples tha...
research
09/21/2022

MulBot: Unsupervised Bot Detection Based on Multivariate Time Series

Online social networks are actively involved in the removal of malicious...
research
03/18/2022

Identification of Hypokinetic Dysarthria Using Acoustic Analysis of Poem Recitation

Up to 90 dysarthria (HD). In this work, we analysed the power of conven...
research
11/19/2019

Towards non-toxic landscapes: Automatic toxic comment detection using DNN

The spectacular expansion of the Internet led to the development of a ne...
research
08/16/2023

Classifying Dementia in the Presence of Depression: A Cross-Corpus Study

Automated dementia screening enables early detection and intervention, r...

Please sign up or login with your details

Forgot password? Click here to reset