Unsupervised ASR via Cross-Lingual Pseudo-Labeling

05/19/2023
by   Tatiana Likhomanenko, et al.
0

Recent work has shown that it is possible to train an unsupervised automatic speech recognition (ASR) system using only unpaired audio and text. Existing unsupervised ASR methods assume that no labeled data can be used for training. We argue that even if one does not have any labeled audio for a given language, there is always labeled data available for other languages. We show that it is possible to use character-level acoustic models (AMs) from other languages to bootstrap an unsupervised AM in a new language. Here, "unsupervised" means no labeled audio is available for the target language. Our approach is based on two key ingredients: (i) generating pseudo-labels (PLs) of the target language using some other language AM and (ii) constraining these PLs with a target language model. Our approach is effective on Common Voice: e.g. transfer of English AM to Swahili achieves 18 character-based wav2vec-U 2.0 by 15 labeled German data instead of 60k hours of unlabeled English data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2019

Deep Contextualized Acoustic Representations For Semi-Supervised Speech Recognition

We propose a novel approach to semi-supervised automatic speech recognit...
research
10/24/2022

10 hours data is all you need

We propose a novel procedure to generate pseudo mandarin speech data nam...
research
08/04/2018

Language Model Supervision for Handwriting Recognition Model Adaptation

Training state-of-the-art offline handwriting recognition (HWR) models r...
research
04/03/2022

Automatic Dialect Density Estimation for African American English

In this paper, we explore automatic prediction of dialect density of the...
research
03/31/2022

Effectiveness of text to speech pseudo labels for forced alignment and cross lingual pretrained models for low resource speech recognition

In the recent years end to end (E2E) automatic speech recognition (ASR) ...
research
08/19/2022

Pseudo-Labels Are All You Need

Automatically estimating the complexity of texts for readers has a varie...
research
10/17/2022

Continuous Pseudo-Labeling from the Start

Self-training (ST), or pseudo-labeling has sparked significant interest ...

Please sign up or login with your details

Forgot password? Click here to reset