A Comparison of Methods for OOV-word Recognition on a New Public Dataset

07/16/2021
by   Rudolf A. Braun, et al.
0

A common problem for automatic speech recognition systems is how to recognize words that they did not see during training. Currently there is no established method of evaluating different techniques for tackling this problem. We propose using the CommonVoice dataset to create test sets for multiple languages which have a high out-of-vocabulary (OOV) ratio relative to a training set and release a new tool for calculating relevant performance metrics. We then evaluate, within the context of a hybrid ASR system, how much better subword models are at recognizing OOVs, and how much benefit one can get from incorporating OOV-word information into an existing system by modifying WFSTs. Additionally, we propose a new method for modifying a subword-based language model so as to better recognize OOV-words. We showcase very large improvements in OOV-word recognition and make both the data and code available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2023

Syllable Subword Tokens for Open Vocabulary Speech Recognition in Malayalam

In a hybrid automatic speech recognition (ASR) system, a pronunciation l...
research
07/05/2021

Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition

Neural sequence-to-sequence systems deliver state-of-the-art performance...
research
03/19/2020

Techniques for Vocabulary Expansion in Hybrid Speech Recognition Systems

The problem of out of vocabulary words (OOV) is typical for any speech r...
research
08/10/2019

Unsupervised Stemming based Language Model for Telugu Broadcast News Transcription

In Indian Languages , native speakers are able to understand new words f...
research
06/16/2015

Recognize Foreign Low-Frequency Words with Similar Pairs

Low-frequency words place a major challenge for automatic speech recogni...
research
12/12/2002

Exploiting Context When Learning to Classify

This paper addresses the problem of classifying observations when featur...
research
09/11/2022

Applying wav2vec2 for Speech Recognition on Bengali Common Voices Dataset

Speech is inherently continuous, where discrete words, phonemes and othe...

Please sign up or login with your details

Forgot password? Click here to reset