Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding

02/10/2022
by   Peter Sullivan, et al.
0

ASR systems designed for native English (L1) usually underperform on non-native English (L2). To address this performance gap, (i) we extend our previous work to investigate fine-tuning of a pre-trained wav2vec 2.0 model <cit.> under a rich set of L1 and L2 training conditions. We further (ii) incorporate language model decoding in the ASR system, along with the fine-tuning method. Quantifying gains acquired from each of these two approaches separately and an error analysis allows us to identify different sources of improvement within our models. We find that while the large self-trained wav2vec 2.0 may be internalizing sufficient decoding knowledge for clean L1 speech <cit.>, this does not hold for L2 speech and accounts for the utility of employing language model decoding on L2 data.

READ FULL TEXT

page 2

page 9

page 13

research
10/01/2021

Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning

To address the performance gap of English ASR models on L2 English speak...
research
05/25/2023

INTapt: Information-Theoretic Adversarial Prompt Tuning for Enhanced Non-Native Speech Recognition

Automatic Speech Recognition (ASR) systems have attained unprecedented p...
research
06/05/2023

Incorporating L2 Phonemes Using Articulatory Features for Robust Speech Recognition

The limited availability of non-native speech datasets presents a major ...
research
03/01/2023

Synthetic Cross-accent Data Augmentation for Automatic Speech Recognition

The awareness for biased ASR datasets or models has increased notably in...
research
07/04/2018

Investigating the role of L1 in automatic pronunciation evaluation of L2 speech

Automatic pronunciation evaluation plays an important role in pronunciat...
research
11/29/2022

Better Transcription of UK Supreme Court Hearings

Transcription of legal proceedings is very important to enable access to...
research
07/13/2023

Adapting an ASR Foundation Model for Spoken Language Assessment

A crucial part of an accurate and reliable spoken language assessment sy...

Please sign up or login with your details

Forgot password? Click here to reset