Multi-Dialect Arabic Speech Recognition

12/25/2021
by   Abbas Raza Ali, et al.
0

This paper presents the design and development of multi-dialect automatic speech recognition for Arabic. Deep neural networks are becoming an effective tool to solve sequential data problems, particularly, adopting an end-to-end training of the system. Arabic speech recognition is a complex task because of the existence of multiple dialects, non-availability of large corpora, and missing vocalization. Thus, the first contribution of this work is the development of a large multi-dialectal corpus with either full or at least partially vocalized transcription. Additionally, the open-source corpus has been gathered from multiple sources that bring non-standard Arabic alphabets in transcription which are normalized by defining a common character-set. The second contribution is the development of a framework to train an acoustic model achieving state-of-the-art performance. The network architecture comprises of a combination of convolutional and recurrent layers. The spectrogram features of the audio data are extracted in the frequency vs time domain and fed in the network. The output frames, produced by the recurrent model, are further trained to align the audio features with its corresponding transcription sequences. The sequence alignment is performed using a beam search decoder with a tetra-gram language model. The proposed system achieved a 14

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2023

ClArTTS: An Open-Source Classical Arabic Text-to-Speech Corpus

At present, Text-to-speech (TTS) systems that are trained with high-qual...
research
06/24/2021

QASR: QCRI Aljazeera Speech Resource – A Large Scale Annotated Arabic Speech Corpus

We introduce the largest transcribed Arabic speech corpus, QASR, collect...
research
12/11/2022

End-to-End Speech Translation of Arabic to English Broadcast News

Speech translation (ST) is the task of directly translating acoustic spe...
research
10/14/2022

Bringing NURC/SP to Digital Life: the Role of Open-source Automatic Speech Recognition Models

The NURC Project that started in 1969 to study the cultured linguistic u...
research
10/08/2016

A Semantic Analyzer for the Comprehension of the Spontaneous Arabic Speech

This work is part of a large research project entitled "Oréodule" aimed ...
research
09/21/2017

Speech Recognition Challenge in the Wild: Arabic MGB-3

This paper describes the Arabic MGB-3 Challenge - Arabic Speech Recognit...
research
04/17/2023

Prak: An automatic phonetic alignment tool for Czech

Labeling speech down to the identity and time boundaries of phones is a ...

Please sign up or login with your details

Forgot password? Click here to reset