A prototype system for handwritten sub-word recognition: Toward Arabic-manuscript transliteration

11/14/2011
by   Reza Farrahi Moghaddam, et al.
0

A prototype system for the transliteration of diacritics-less Arabic manuscripts at the sub-word or part of Arabic word (PAW) level is developed. The system is able to read sub-words of the input manuscript using a set of skeleton-based features. A variation of the system is also developed which reads archigraphemic Arabic manuscripts, which are dot-less, into archigraphemes transliteration. In order to reduce the complexity of the original highly multiclass problem of sub-word recognition, it is redefined into a set of binary descriptor classifiers. The outputs of trained binary classifiers are combined to generate the sequence of sub-word letters. SVMs are used to learn the binary classifiers. Two specific Arabic databases have been developed to train and test the system. One of them is a database of the Naskh style. The initial results are promising. The systems could be trained on other scripts found in Arabic manuscripts.

READ FULL TEXT
research
06/21/2017

Deep Learning Autoencoder Approach for Handwritten Arabic Digits Recognition

This paper presents a new unsupervised learning approach with stacked au...
research
05/20/2014

Dynamic Hierarchical Bayesian Network for Arabic Handwritten Word Recognition

This paper presents a new probabilistic graphical model used to model an...
research
11/17/2014

AlexU-Word: A New Dataset for Isolated-Word Closed-Vocabulary Offline Arabic Handwriting Recognition

In this paper, we introduce the first phase of a new dataset for offline...
research
05/06/2010

Multistage Hybrid Arabic/Indian Numeral OCR System

The use of OCR in postal services is not yet universal and there are sti...
research
04/11/2018

Problem of Multiple Diacritics Design for Arabic Script

This study focuses on the design of multiple Arabic diacritical marks an...
research
08/10/2018

Hybrid approach for transliteration of Algerian arabizi: a primary study

A hybrid approach for the transliteration of Algerian Arabizi: A primary...
research
04/11/2018

Aesthetical Attributes for Segmenting Arabic Word

The connected allograph representing calligraphic Arabic word does not a...

Please sign up or login with your details

Forgot password? Click here to reset