Towards End-to-End Training of Automatic Speech Recognition for Nigerian Pidgin

10/21/2020
by   Daniel Ajisafe, et al.
0

Nigerian Pidgin remains one of the most popular languages in West Africa. With at least 75 million speakers along the West African coast, the language has spread to diasporic communities through Nigerian immigrants in England, Canada, and America, amongst others. In contrast, the language remains an under-resourced one in the field of natural language processing, particularly on speech recognition and translation tasks. In this work, we present the first parallel (speech-to-text) data on Nigerian pidgin. We also trained the first end-to-end speech recognition system (QuartzNet and Jasper model) on this language which were both optimized using Connectionist Temporal Classification (CTC) loss. With baseline results, we were able to achieve a low word error rate (WER) of 0.77 open-source the data and code along with this publication in order to encourage future research in this direction.

READ FULL TEXT
research
12/21/2022

End-to-End Automatic Speech Recognition model for the Sudanese Dialect

Designing a natural voice interface rely mostly on Speech recognition fo...
research
06/02/2023

Improved Training for End-to-End Streaming Automatic Speech Recognition Model with Punctuation

Punctuated text prediction is crucial for automatic speech recognition a...
research
07/07/2022

End-to-end Speech-to-Punctuated-Text Recognition

Conventional automatic speech recognition systems do not produce punctua...
research
08/23/2019

Deploying Technology to Save Endangered Languages

Computer scientists working on natural language processing, native speak...
research
05/08/2021

Robustness of end-to-end Automatic Speech Recognition Models – A Case Study using Mozilla DeepSpeech

When evaluating the performance of automatic speech recognition models, ...
research
05/11/2021

Investigating the Reordering Capability in CTC-based Non-Autoregressive End-to-End Speech Translation

We study the possibilities of building a non-autoregressive speech-to-te...
research
10/07/2016

A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks

We consider the two related problems of detecting if an example is miscl...

Please sign up or login with your details

Forgot password? Click here to reset