AlexU-Word: A New Dataset for Isolated-Word Closed-Vocabulary Offline Arabic Handwriting Recognition

11/17/2014
by   Mohamed E. Hussein, et al.
0

In this paper, we introduce the first phase of a new dataset for offline Arabic handwriting recognition. The aim is to collect a very large dataset of isolated Arabic words that covers all letters of the alphabet in all possible shapes using a small number of simple words. The end goal is to collect a very large dataset of segmented letter images, which can be used to build and evaluate Arabic handwriting recognition systems that are based on segmented letter recognition. The current version of the dataset contains 25114 samples of 109 unique Arabic words that cover all possible shapes of all alphabet letters. The samples were collected from 907 writers. In its current form, the dataset can be used for the problem of closed-vocabulary word recognition. We evaluated a number of window-based descriptors and classifiers on this task and obtained an accuracy of 92.16% using a SIFT-based descriptor and ANN.

READ FULL TEXT
research
11/13/2014

Window-Based Descriptors for Arabic Handwritten Alphabet Recognition: A Comparative Study on a Novel Dataset

This paper presents a comparative study for window-based descriptors on ...
research
11/14/2011

A prototype system for handwritten sub-word recognition: Toward Arabic-manuscript transliteration

A prototype system for the transliteration of diacritics-less Arabic man...
research
04/11/2018

Problem of Multiple Diacritics Design for Arabic Script

This study focuses on the design of multiple Arabic diacritical marks an...
research
04/11/2018

Aesthetical Attributes for Segmenting Arabic Word

The connected allograph representing calligraphic Arabic word does not a...
research
04/08/2015

Supporting Language Learners with the Meanings Of Closed Class Items

The process of language learning involves the mastery of countless tasks...
research
12/24/2014

AltecOnDB: A Large-Vocabulary Arabic Online Handwriting Recognition Database

Arabic is a semitic language characterized by a complex and rich morphol...
research
01/07/2021

Off-Line Arabic Handwritten Words Segmentation using Morphological Operators

The main aim of this study is the assessment and discussion of a model f...

Please sign up or login with your details

Forgot password? Click here to reset