Printed Arabic Text Recognition using Linear and Nonlinear Regression

02/05/2017
by   Ashraf A. Shahin, et al.
0

Arabic language is one of the most popular languages in the world. Hundreds of millions of people in many countries around the world speak Arabic as their native speaking. However, due to complexity of Arabic language, recognition of printed and handwritten Arabic text remained untouched for a very long time compared with English and Chinese. Although, in the last few years, significant number of researches has been done in recognizing printed and handwritten Arabic text, it stills an open research field due to cursive nature of Arabic script. This paper proposes automatic printed Arabic text recognition technique based on linear and ellipse regression techniques. After collecting all possible forms of each character, unique code is generated to represent each character form. Each code contains a sequence of lines and ellipses. To recognize fonts, a unique list of codes is identified to be used as a fingerprint of font. The proposed technique has been evaluated using over 14000 different Arabic words with different fonts and experimental results show that average recognition rate of the proposed technique is 86

READ FULL TEXT

page 7

page 8

research
08/22/2013

A review on handwritten character and numeral recognition for Roman, Arabic, Chinese and Indian scripts

There are a lot of intensive researches on handwritten character recogni...
research
09/04/2020

A Hybrid Deep Learning Model for Arabic Text Recognition

Arabic text recognition is a challenging task because of the cursive nat...
research
01/20/2013

Recurrent Neural Network Method in Arabic Words Recognition System

The recognition of unconstrained handwriting continues to be a difficult...
research
05/12/2023

Towards Transliteration between Sindhi Scripts from Devanagari to Perso-Arabic

In this paper, we have shown a script conversion (transliteration) techn...
research
08/02/2021

Correcting Arabic Soft Spelling Mistakes using BiLSTM-based Machine Learning

Soft spelling errors are a class of spelling mistakes that is widespread...
research
02/03/2021

A Trainless Recognition of Handwritten Persian/Arabic Letters using Primitive Elements

This paper aim at applying primitive elements composing Persian/Arabic l...
research
11/06/2021

CALText: Contextual Attention Localization for Offline Handwritten Text

Recognition of Arabic-like scripts such as Persian and Urdu is more chal...

Please sign up or login with your details

Forgot password? Click here to reset