Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation

10/21/2022
by   Thien Nguyen, et al.
0

Code-switching describes the practice of using more than one language in the same sentence. In this study, we investigate how to optimize a neural transducer based bilingual automatic speech recognition (ASR) model for code-switching speech. Focusing on the scenario where the ASR model is trained without supervised code-switching data, we found that semi-supervised training and synthetic code-switched data can improve the bilingual ASR system on code-switching speech. We analyze how each of the neural transducer's encoders contributes towards code-switching performance by measuring encoder-specific recall values, and evaluate our English/Mandarin system on the ASCEND data set. Our final system achieves 25 English/Mandarin code-switching test set – reducing the MER by 2.1 compared to the previous literature – while maintaining good accuracy on the monolingual test sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2021

Arabic Code-Switching Speech Recognition using Monolingual Data

Code-switching in automatic speech recognition (ASR) is an important cha...
research
03/20/2023

Code-Switching Text Generation and Injection in Mandarin-English ASR

Code-switching speech refers to a means of expression by mixing two or m...
research
06/14/2021

Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition

Modeling code-switched speech is an important problem in automatic speec...
research
06/29/2022

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Dual-encoder structure successfully utilizes two language-specific encod...
research
10/28/2018

Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training

We focus on the problem of language modeling for code-switched language,...
research
06/20/2022

Bilingual by default: Voice Assistants and the role of code-switching in creating a bilingual user experience

Conversational User Interfaces such as Voice Assistants are hugely popul...
research
08/29/2023

Back to the Future: From Microservice to Monolith

Recently the trend of companies switching from microservice back to mono...

Please sign up or login with your details

Forgot password? Click here to reset