Personalization for BERT-based Discriminative Speech Recognition Rescoring

07/13/2023
by   Jari Kolehmainen, et al.
0

Recognition of personalized content remains a challenge in end-to-end speech recognition. We explore three novel approaches that use personalized content in a neural rescoring step to improve recognition: gazetteers, prompting, and a cross-attention based encoder-decoder model. We use internal de-identified en-US data from interactions with a virtual voice assistant supplemented with personalized named entities to compare these approaches. On a test set with personalized named entities, we show that each of these approaches improves word error rate by over 10 that on this test set, natural language prompts can improve word error rate by 7 gazetteers were found to perform the best with a 10 rate (WER), while also improving WER on a general test set by 1

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/22/2017

Attention-Based End-to-End Speech Recognition on Voice Search

Recently, there has been an increasing interest in end-to-end speech rec...
research
02/15/2021

Personalization Strategies for End-to-End Speech Recognition Systems

The recognition of personalized content, such as contact names, remains ...
research
05/01/2020

Multi-head Monotonic Chunkwise Attention For Online Speech Recognition

The attention mechanism of the Listen, Attend and Spell (LAS) model requ...
research
04/08/2022

Adding Connectionist Temporal Summarization into Conformer to Improve Its Decoder Efficiency For Speech Recognition

The Conformer model is an excellent architecture for speech recognition ...
research
06/21/2021

A Discriminative Entity-Aware Language Model for Virtual Assistants

High-quality automatic speech recognition (ASR) is essential for virtual...
research
03/31/2016

Learning Multiscale Features Directly From Waveforms

Deep learning has dramatically improved the performance of speech recogn...
research
07/02/2022

UserLibri: A Dataset for ASR Personalization Using Only Text

Personalization of speech models on mobile devices (on-device personaliz...

Please sign up or login with your details

Forgot password? Click here to reset