Adaptive Multi-Corpora Language Model Training for Speech Recognition

11/09/2022
by   Yingyi Ma, et al.
0

Neural network language model (NNLM) plays an essential role in automatic speech recognition (ASR) systems, especially in adaptation tasks when text-only data is available. In practice, an NNLM is typically trained on a combination of data sampled from multiple corpora. Thus, the data sampling strategy is important to the adaptation performance. Most existing works focus on designing static sampling strategies. However, each corpus may show varying impacts at different NNLM training stages. In this paper, we introduce a novel adaptive multi-corpora training algorithm that dynamically learns and adjusts the sampling probability of each corpus along the training process. The algorithm is robust to corpora sizes and domain relevance. Compared with static sampling strategy baselines, the proposed approach yields remarkable improvement by achieving up to relative 7 in-domain and out-of-domain adaptation tasks, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/27/2021

Factorized Neural Transducer for Efficient Language Model Adaptation

In recent years, end-to-end (E2E) based automatic speech recognition (AS...
research
09/18/2023

Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models

While Automatic Speech Recognition (ASR) systems are widely used in many...
research
08/02/2019

Multilingual Speech Recognition with Corpus Relatedness Sampling

Multilingual acoustic models have been successfully applied to low-resou...
research
09/07/2017

Cynical Selection of Language Model Training Data

The Moore-Lewis method of "intelligent selection of language model train...
research
09/28/2021

Private Language Model Adaptation for Speech Recognition

Speech model adaptation is crucial to handle the discrepancy between ser...
research
04/09/2021

Language model fusion for streaming end to end speech recognition

Streaming processing of speech audio is required for many contemporary p...
research
05/04/2020

Fast and Robust Unsupervised Contextual Biasing for Speech Recognition

Automatic speech recognition (ASR) system is becoming a ubiquitous techn...

Please sign up or login with your details

Forgot password? Click here to reset