Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

03/19/2020
by   Nikolay Malkovsky, et al.
0

The problem of out of vocabulary words (OOV) is typical for any speech recognition system, hybrid systems are usually constructed to recognize a fixed set of words and rarely can include all the words that will be encountered during exploitation of the system. One of the popular approach to cover OOVs is to use subword units rather then words. Such system can potentially recognize any previously unseen word if the word can be constructed from present subword units, but also non-existing words can be recognized. The other popular approach is to modify HMM part of the system so that it can be easily and effectively expanded with custom set of words we want to add to the system. In this paper we explore different existing methods of this solution on both graph construction and search method levels. We also present a novel vocabulary expansion techniques which solve some common internal subroutine problems regarding recognition graph processing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2017

Automatic Speech Recognition with Very Large Conversational Finnish and Estonian Vocabularies

Today, the vocabulary size for language models in large vocabulary speec...
research
07/16/2021

A Comparison of Methods for OOV-word Recognition on a New Public Dataset

A common problem for automatic speech recognition systems is how to reco...
research
05/22/2023

The neural dynamics of auditory word recognition and integration

Listeners recognize and integrate words in rapid and noisy everyday spee...
research
06/25/2015

How to improve robustness in Kohonen maps and display additional information in Factorial Analysis: application to text mining

This article is an extended version of a paper presented in the WSOM'201...
research
03/29/2022

Short-Term Word-Learning in a Dynamically Changing Environment

Neural sequence-to-sequence automatic speech recognition (ASR) systems a...
research
06/16/2015

Recognize Foreign Low-Frequency Words with Similar Pairs

Low-frequency words place a major challenge for automatic speech recogni...
research
07/13/2018

Large-Scale Visual Speech Recognition

This work presents a scalable solution to open-vocabulary visual speech ...

Please sign up or login with your details

Forgot password? Click here to reset