N-gram Boosting: Improving Contextual Biasing with Normalized N-gram Targets

08/04/2023
by   Wang Yau Li, et al.
0

Accurate transcription of proper names and technical terms is particularly important in speech-to-text applications for business conversations. These words, which are essential to understanding the conversation, are often rare and therefore likely to be under-represented in text and audio training data, creating a significant challenge in this domain. We present a two-step keyword boosting mechanism that successfully works on normalized unigrams and n-grams rather than just single tokens, which eliminates missing hits issues with boosting raw targets. In addition, we show how adjusting the boosting weight logic avoids over-boosting multi-token keywords. This improves our keyword recognition rate by 26 LibriSpeech. This method is particularly useful on targets that involve non-alphabetic characters or have non-standard pronunciations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2021

Spell my name: keyword boosted speech recognition

Recognition of uncommon words such as names and technical terminology is...
research
01/12/2019

Prototypical Metric Transfer Learning for Continuous Speech Keyword Spotting With Limited Training Data

Continuous Speech Keyword Spotting (CSKS) is the problem of spotting key...
research
06/25/2018

Fast ASR-free and almost zero-resource keyword spotting using DTW and CNNs for humanitarian monitoring

We use dynamic time warping (DTW) as supervision for training a convolut...
research
07/11/2018

Efficient keyword spotting using time delay neural networks

This paper describes a novel method of live keyword spotting using a two...
research
11/16/2022

PBSM: Backdoor attack against Keyword spotting based on pitch boosting and sound masking

Keyword spotting (KWS) has been widely used in various speech control sc...
research
05/24/2022

Boosting Tail Neural Network for Realtime Custom Keyword Spotting

In this paper, we propose a Boosting Tail Neural Network (BTNN) for impr...

Please sign up or login with your details

Forgot password? Click here to reset