Speech Technology for Everyone: Automatic Speech Recognition for Non-Native English with Transfer Learning

10/01/2021
by   Toshiko Shibano, et al.
0

To address the performance gap of English ASR models on L2 English speakers, we evaluate fine-tuning of pretrained wav2vec 2.0 models (Baevski et al., 2020; Xu et al., 2021) on L2-ARCTIC, a non-native English speech corpus (Zhao et al., 2018) under different training settings. We compare (a) models trained with a combination of diverse accents to ones trained with only specific accents and (b) results from different single-accent models. Our experiments demonstrate the promise of developing ASR models for non-native English speakers, even with small amounts of L2 training data and even without a language model. Our models also excel in the zero-shot setting where we train on multiple L2 datasets and test on a blind L2 test set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2020

AccentDB: A Database of Non-Native English Accents to Assist Neural Speech Recognition

Modern Automatic Speech Recognition (ASR) technology has evolved to iden...
research
02/10/2022

Improving Automatic Speech Recognition for Non-Native English with Transfer Learning and Language Model Decoding

ASR systems designed for native English (L1) usually underperform on non...
research
11/10/2021

Scaling ASR Improves Zero and Few Shot Learning

With 4.5 million hours of English speech from 10 different sources acros...
research
12/12/2021

Learning Nigerian accent embeddings from speech: preliminary results based on SautiDB-Naija corpus

This paper describes foundational efforts with SautiDB-Naija, a novel co...
research
11/13/2018

Corpus Phonetics Tutorial

Corpus phonetics has become an increasingly popular method of research i...
research
09/18/2019

Do We Need Neural Models to Explain Human Judgments of Acceptability?

Native speakers can judge whether a sentence is an acceptable instance o...
research
09/20/2021

Model Bias in NLP – Application to Hate Speech Classification

This document sums up our results forthe NLP lecture at ETH in the sprin...

Please sign up or login with your details

Forgot password? Click here to reset