On-Device Personalization of Automatic Speech Recognition Models for Disordered Speech

06/18/2021
by   Katrin Tomanek, et al.
0

While current state-of-the-art Automatic Speech Recognition (ASR) systems achieve high accuracy on typical speech, they suffer from significant performance degradation on disordered speech and other atypical speech patterns. Personalization of ASR models, a commonly applied solution to this problem, is usually performed in a server-based training environment posing problems around data privacy, delayed model-update times, and communication cost for copying data and models between mobile device and server infrastructure. In this paper, we present an approach to on-device based ASR personalization with very small amounts of speaker-specific data. We test our approach on a diverse set of 100 speakers with disordered speech and find median relative word error rate improvement of 71 utterances required per speaker. When tested on a voice-controlled home automation platform, on-device personalized models show a median task success rate of 81

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2021

Personalized Automatic Speech Recognition Trained on Small Disordered Speech Datasets

This study investigates the performance of personalized automatic speech...
research
09/14/2019

An Investigation Into On-device Personalization of End-to-end Automatic Speech Recognition Models

Speaker-independent speech recognition systems trained with data from ma...
research
06/18/2020

Automatic Speech Recognition Benchmark for Air-Traffic Communications

Advances in Automatic Speech Recognition (ASR) over the last decade open...
research
03/24/2021

Voice Privacy with Smart Digital Assistants in Educational Settings

The emergence of voice-assistant devices ushers in delightful user exper...
research
08/06/2019

Practical Speech Recognition with HTK

The practical aspects of developing an Automatic Speech Recognition Syst...
research
06/15/2023

MobileASR: A resource-aware on-device personalisation framework for automatic speech recognition in mobile phones

We describe a comprehensive methodology for developing user-voice person...
research
12/19/2018

Streaming Voice Query Recognition using Causal Convolutional Recurrent Neural Networks

Voice-enabled commercial products are ubiquitous, typically enabled by l...

Please sign up or login with your details

Forgot password? Click here to reset