Latent Phrase Matching for Dysarthric Speech

06/08/2023
by   Colin Lea, et al.
0

Many consumer speech recognition systems are not tuned for people with speech disabilities, resulting in poor recognition and user experience, especially for severe speech differences. Recent studies have emphasized interest in personalized speech models from people with atypical speech patterns. We propose a query-by-example-based personalized phrase recognition system that is trained using small amounts of speech, is language agnostic, does not assume a traditional pronunciation lexicon, and generalizes well across speech difference severities. On an internal dataset collected from 32 people with dysarthria, this approach works regardless of severity and shows a 60 improvement in recall relative to a commercial speech recognition system. On the public EasyCall dataset of dysarthric speech, our approach improves accuracy by 30.5 consistently outperforms ASR systems when trained with 50 unique phrases.

READ FULL TEXT
research
02/17/2023

From User Perceptions to Technical Improvement: Enabling People Who Stutter to Better Use Speech Recognition

Consumer speech recognition systems do not work as well for many people ...
research
07/31/2019

Personalizing ASR for Dysarthric and Accented Speech with Limited Data

Automatic speech recognition (ASR) systems have dramatically improved ov...
research
10/09/2021

Personalized Automatic Speech Recognition Trained on Small Disordered Speech Datasets

This study investigates the performance of personalized automatic speech...
research
02/15/2022

Nonverbal Sound Detection for Disordered Speech

Voice assistants have become an essential tool for people with various d...
research
10/05/2021

Fast Contextual Adaptation with Neural Associative Memory for On-Device Personalized Speech Recognition

Fast contextual adaptation has shown to be effective in improving Automa...
research
02/24/2021

SEP-28k: A Dataset for Stuttering Event Detection From Podcasts With People Who Stutter

The ability to automatically detect stuttering events in speech could he...
research
11/17/2021

The People's Speech: A Large-Scale Diverse English Speech Recognition Dataset for Commercial Usage

The People's Speech is a free-to-download 30,000-hour and growing superv...

Please sign up or login with your details

Forgot password? Click here to reset