Damage Control During Domain Adaptation for Transducer Based Automatic Speech Recognition

10/06/2022
by   Somshubra Majumdar, et al.
9

Automatic speech recognition models are often adapted to improve their accuracy in a new domain. A potential drawback of model adaptation to new domains is catastrophic forgetting, where the Word Error Rate on the original domain is significantly degraded. This paper addresses the situation when we want to simultaneously adapt automatic speech recognition models to a new domain and limit the degradation of accuracy on the original domain without access to the original training dataset. We propose several techniques such as a limited training strategy and regularized adapter modules for the Transducer encoder, prediction, and joiner network. We apply these methods to the Google Speech Commands and to the UK and Ireland English Dialect speech data set and obtain strong results on the new target domain while limiting the degradation on the original domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/04/2021

Unsupervised Domain Adaptation in Speech Recognition using Phonetic Features

Automatic speech recognition is a difficult problem in pattern recogniti...
research
10/01/2019

Domain Expansion in DNN-based Acoustic Models for Robust Speech Recognition

Training acoustic models with sequentially incoming data – while both le...
research
01/02/2020

Attention based on-device streaming speech recognition with large speech corpus

In this paper, we present a new on-device automatic speech recognition (...
research
02/23/2023

Evaluating Automatic Speech Recognition in an Incremental Setting

The increasing reliability of automatic speech recognition has prolifera...
research
04/17/2019

A Multi-Task Learning Framework for Overcoming the Catastrophic Forgetting in Automatic Speech Recognition

Recently, data-driven based Automatic Speech Recognition (ASR) systems h...
research
07/02/2019

Latent Dirichlet Allocation Based Acoustic Data Selection for Automatic Speech Recognition

Selecting in-domain data from a large pool of diverse and out-of-domain ...
research
07/12/2022

End-to-end speech recognition modeling from de-identified data

De-identification of data used for automatic speech recognition modeling...

Please sign up or login with your details

Forgot password? Click here to reset