Learning meters of Arabic and English poems with Recurrent Neural Networks: a step forward for language understanding and synthesis

05/07/2019
by   Waleed A. Yousef, et al.
0

Recognizing a piece of writing as a poem or prose is usually easy for the majority of people; however, only specialists can determine which meter a poem belongs to. In this paper, we build Recurrent Neural Network (RNN) models that can classify poems according to their meters from plain text. The input text is encoded at the character level and directly fed to the models without feature handcrafting. This is a step forward for machine understanding and synthesis of languages in general, and Arabic language in particular. Among the 16 poem meters of Arabic and the 4 meters of English the networks were able to correctly classify poem with an overall accuracy of 96.38% and 82.31% respectively. The poem datasets used to conduct this research were massive, over 1.5 million of verses, and were crawled from different nontechnical sources, almost Arabic and English literature sites, and in different heterogeneous and unstructured formats. These datasets are now made publicly available in clean, structured, and documented format for other future research. To the best of the authors' knowledge, this research is the first to address classifying poem meters in a machine learning approach, in general, and in RNN featureless based approach, in particular. In addition, the dataset is the first publicly available dataset ready for the purpose of future computational research.

READ FULL TEXT
research
12/21/2022

ORCA: A Challenging Benchmark for Arabic Language Understanding

Due to their crucial role in all NLP, several benchmarks have been propo...
research
04/23/2020

Transliteration of Judeo-Arabic Texts into Arabic Script Using Recurrent Neural Networks

Many of the great Jewish works of the Middle Ages were written in Judeo-...
research
11/08/2019

Neural Arabic Text Diacritization: State of the Art Results and a Novel Approach for Machine Translation

In this work, we present several deep learning models for the automatic ...
research
07/22/2020

A Transfer Learning End-to-End ArabicText-To-Speech (TTS) Deep Architecture

Speech synthesis is the artificial production of human speech. A typical...
research
07/13/2018

Image Classification for Arabic: Assessing the Accuracy of Direct English to Arabic Translations

Image classification is an ongoing research challenge. Most of the avail...
research
06/17/2021

A Deep Belief Network Classification Approach for Automatic Diacritization of Arabic Text

Deep learning has emerged as a new area of machine learning research. It...
research
01/08/2019

Computational Register Analysis and Synthesis

The study of register in computational language research has historicall...

Please sign up or login with your details

Forgot password? Click here to reset