Neural Arabic Text Diacritization: State of the Art Results and a Novel Approach for Machine Translation

11/08/2019
by   Ali Fadel, et al.
0

In this work, we present several deep learning models for the automatic diacritization of Arabic text. Our models are built using two main approaches, viz. Feed-Forward Neural Network (FFNN) and Recurrent Neural Network (RNN), with several enhancements such as 100-hot encoding, embeddings, Conditional Random Field (CRF) and Block-Normalized Gradient (BNG). The models are tested on the only freely available benchmark dataset and the results show that our models are either better or on par with other models, which require language-dependent post-processing steps, unlike ours. Moreover, we show that diacritics in Arabic can be used to enhance the models of NLP tasks such as Machine Translation (MT) by proposing the Translation over Diacritization (ToD) approach.

READ FULL TEXT
research
09/21/2023

OSN-MDAD: Machine Translation Dataset for Arabic Multi-Dialectal Conversations on Online Social Media

While resources for English language are fairly sufficient to understand...
research
01/09/2023

Automatic Standardization of Arabic Dialects for Machine Translation

Based on an annotated multimedia corpus, television series Marāyā 2013, ...
research
07/14/2019

Simple Automatic Post-editing for Arabic-Japanese Machine Translation

A common bottleneck for developing machine translation (MT) systems for ...
research
05/07/2019

Learning meters of Arabic and English poems with Recurrent Neural Networks: a step forward for language understanding and synthesis

Recognizing a piece of writing as a poem or prose is usually easy for th...
research
05/27/2022

TURJUMAN: A Public Toolkit for Neural Arabic Machine Translation

We present TURJUMAN, a neural toolkit for translating from 20 languages ...
research
03/22/2017

Classification-based RNN machine translation using GRUs

We report the results of our classification-based machine translation mo...
research
10/04/2017

Cross-Language Question Re-Ranking

We study how to find relevant questions in community forums when the lan...

Please sign up or login with your details

Forgot password? Click here to reset