Continual Learning for End-to-End ASR by Averaging Domain Experts

05/12/2023
by   Peter Plantinga, et al.
0

Continual learning for end-to-end automatic speech recognition has to contend with a number of difficulties. Fine-tuning strategies tend to lose performance on data already seen, a process known as catastrophic forgetting. On the other hand, strategies that freeze parameters and append tunable parameters must maintain multiple models. We suggest a strategy that maintains only a single model for inference and avoids catastrophic forgetting. Our experiments show that a simple linear interpolation of several models' parameters, each fine-tuned from the same generalist model, results in a single model that performs well on all tested data. For our experiments we selected two open-source end-to-end speech recognition models pre-trained on large datasets and fine-tuned them on 3 separate datasets: SGPISpeech, CORAAL, and DiPCo. The proposed average of domain experts model performs well on all tested data, and has almost no loss in performance on data from the domain of original training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/17/2021

Continual Learning for Monolingual End-to-End Automatic Speech Recognition

Adapting Automatic Speech Recognition (ASR) models to new domains leads ...
research
06/26/2023

Continual Learning for Out-of-Distribution Pedestrian Detection

A continual learning solution is proposed to address the out-of-distribu...
research
07/01/2021

Improving Human Motion Prediction Through Continual Learning

Human motion prediction is an essential component for enabling closer hu...
research
07/11/2022

Online Continual Learning of End-to-End Speech Recognition Models

Continual Learning, also known as Lifelong Learning, aims to continually...
research
10/13/2021

Continual learning using lattice-free MMI for speech recognition

Continual learning (CL), or domain expansion, recently became a popular ...
research
08/07/2023

WIKITIDE: A Wikipedia-Based Timestamped Definition Pairs Dataset

A fundamental challenge in the current NLP context, dominated by languag...
research
04/04/2021

Towards Lifelong Learning of End-to-end ASR

Automatic speech recognition (ASR) technologies today are primarily opti...

Please sign up or login with your details

Forgot password? Click here to reset