Data Augmentation for End-to-end Code-switching Speech Recognition

11/04/2020
by   Chenpeng Du, et al.
11

Training a code-switching end-to-end automatic speech recognition (ASR) model normally requires a large amount of data, while code-switching data is often limited. In this paper, three novel approaches are proposed for code-switching data augmentation. Specifically, they are audio splicing with the existing code-switching data, and TTS with new code-switching texts generated by word translation or word insertion. Our experiments on 200 hours Mandarin-English code-switching dataset show that all the three proposed approaches yield significant improvements on code-switching ASR individually. Moreover, all the proposed approaches can be combined with recent popular SpecAugment, and an addition gain can be obtained. WER is significantly reduced by relative 24.0 compared to the system without any data augmentation, and still relative 13.0 gain compared to the system with only SpecAugment

READ FULL TEXT
research
06/14/2023

Improving Code-Switching and Named Entity Recognition in ASR with Speech Editing based Data Augmentation

Recently, end-to-end (E2E) automatic speech recognition (ASR) models hav...
research
10/31/2018

Towards End-to-End Code-Switching Speech Recognition

Code-switching speech recognition has attracted an increasing interest r...
research
03/20/2023

Code-Switching Text Generation and Injection in Mandarin-English ASR

Code-switching speech refers to a means of expression by mixing two or m...
research
10/12/2020

Improving Low Resource Code-switched ASR using Augmented Code-switched TTS

Building Automatic Speech Recognition (ASR) systems for code-switched sp...
research
10/28/2020

Decoupling Pronunciation and Language for End-to-end Code-switching Automatic Speech Recognition

Despite the recent significant advances witnessed in end-to-end (E2E) AS...
research
05/30/2022

Adversarial synthesis based data-augmentation for code-switched spoken language identification

Spoken Language Identification (LID) is an important sub-task of Automat...
research
05/26/2023

Code-Switched Text Synthesis in Unseen Language Pairs

Existing efforts on text synthesis for code-switching mostly require tra...

Please sign up or login with your details

Forgot password? Click here to reset