Off-Line Arabic Handwritten Words Segmentation using Morphological Operators

01/07/2021
by   Nisreen AbdAllah, et al.
0

The main aim of this study is the assessment and discussion of a model for hand-written Arabic through segmentation. The framework is proposed based on three steps: pre-processing, segmentation, and evaluation. In the pre-processing step, morphological operators are applied for Connecting Gaps (CGs) in written words. Gaps happen when pen lifting-off during writing, scanning documents, or while converting images to binary type. In the segmentation step, first removed the small diacritics then bounded a connected component to segment offline words. Huge data was utilized in the proposed model for applying a variety of handwriting styles so that to be more compatible with real-life applications. Consequently, on the automatic evaluation stage, selected randomly 1,131 images from the IESK-ArDB database, and then segmented into sub-words. After small gaps been connected, the model performance evaluation had been reached 88 of the database. The proposed model achieved the highest accuracy when compared with the related works.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2020

Word Segmentation from Unconstrained Handwritten Bangla Document Images using Distance Transform

Segmentation of handwritten document images into text lines and words is...
research
10/17/2014

Large Vocabulary Arabic Online Handwriting Recognition System

Arabic handwriting is a consonantal and cursive writing. The analysis of...
research
09/04/2020

A Hybrid Deep Learning Model for Arabic Text Recognition

Arabic text recognition is a challenging task because of the cursive nat...
research
03/02/2021

AraBERT and Farasa Segmentation Based Approach For Sarcasm and Sentiment Detection in Arabic Tweets

This paper presents our strategy to tackle the EACL WANLP-2021 Shared Ta...
research
05/10/2019

Restoring Arabic vowels through omission-tolerant dictionary lookup

Vowels in Arabic are optional orthographic symbols written as diacritics...
research
11/17/2014

AlexU-Word: A New Dataset for Isolated-Word Closed-Vocabulary Offline Arabic Handwriting Recognition

In this paper, we introduce the first phase of a new dataset for offline...

Please sign up or login with your details

Forgot password? Click here to reset