Careful Whisper – leveraging advances in automatic speech recognition for robust and interpretable aphasia subtype classification

08/02/2023
by   Laurin Wagner, et al.
0

This paper presents a fully automated approach for identifying speech anomalies from voice recordings to aid in the assessment of speech impairments. By combining Connectionist Temporal Classification (CTC) and encoder-decoder-based automatic speech recognition models, we generate rich acoustic and clean transcripts. We then apply several natural language processing methods to extract features from these transcripts to produce prototypes of healthy speech. Basic distance measures from these prototypes serve as input features for standard machine learning classifiers, yielding human-level accuracy for the distinction between recordings of people with aphasia and a healthy control group. Furthermore, the most frequently occurring aphasia types can be distinguished with 90 applicable to other diseases and languages, showing promise for robustly extracting diagnostic speech biomarkers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2023

Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation

Punctuated text prediction is crucial for automatic speech recognition a...
research
09/04/2020

Silent Speech Interfaces for Speech Restoration: A Review

This review summarises the status of silent speech interface (SSI) resea...
research
04/14/2022

Lombard Effect for Bilingual Speakers in Cantonese and English: importance of spectro-temporal features

For a better understanding of the mechanisms underlying speech perceptio...
research
04/26/2022

Parkinson's disease diagnostics using AI and natural language knowledge transfer

In this work, the issue of Parkinson's disease (PD) diagnostics using no...
research
05/05/2021

Accent Recognition with Hybrid Phonetic Features

The performance of voice-controlled systems is usually influenced by acc...
research
01/13/2023

Automated speech- and text-based classification of neuropsychiatric conditions in a multidiagnostic setting

Speech patterns have been identified as potential diagnostic markers for...
research
11/15/2020

Automatic and perceptual discrimination between dysarthria, apraxia of speech, and neurotypical speech

Automatic techniques in the context of motor speech disorders (MSDs) are...

Please sign up or login with your details

Forgot password? Click here to reset