Personalization of End-to-end Speech Recognition On Mobile Devices For Named Entities

12/14/2019
by   Khe Chai Sim, et al.
0

We study the effectiveness of several techniques to personalize end-to-end speech models and improve the recognition of proper names relevant to the user. These techniques differ in the amounts of user effort required to provide supervision, and are evaluated on how they impact speech recognition performance. We propose using keyword-dependent precision and recall metrics to measure vocabulary acquisition performance. We evaluate the algorithms on a dataset that we designed to contain names of persons that are difficult to recognize. Therefore, the baseline recall rate for proper names in this dataset is very low: 2.4 with no need for speech input from the user. With speech input, if the user corrects only the names, the name recall rate improves to 64.4 corrects all the recognition errors, we achieve the best recall of 73.5 eliminate the need to upload user data and store personalized models on a server, we focus on performing the entire personalization workflow on a mobile device.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2019

An Investigation Into On-device Personalization of End-to-end Automatic Speech Recognition Models

Speaker-independent speech recognition systems trained with data from ma...
research
06/21/2019

Phoneme-Based Contextualization for Cross-Lingual Speech Recognition in End-to-End Models

Contextual automatic speech recognition, i.e., biasing recognition towar...
research
03/30/2023

PROCTER: PROnunciation-aware ConTextual adaptER for personalized speech recognition in neural transducers

End-to-End (E2E) automatic speech recognition (ASR) systems used in voic...
research
09/18/2023

CB-Whisper: Contextual Biasing Whisper using TTS-based Keyword Spotting

End-to-end automatic speech recognition (ASR) systems often struggle to ...
research
07/02/2022

UserLibri: A Dataset for ASR Personalization Using Only Text

Personalization of speech models on mobile devices (on-device personaliz...
research
07/12/2022

End-to-end speech recognition modeling from de-identified data

De-identification of data used for automatic speech recognition modeling...
research
02/15/2021

Personalization Strategies for End-to-End Speech Recognition Systems

The recognition of personalized content, such as contact names, remains ...

Please sign up or login with your details

Forgot password? Click here to reset