A Hybrid Swarm and Gravitation based feature selection algorithm for Handwritten Indic Script Classification problem

05/10/2020
by   Ritam Guha, et al.
1

In any multi-script environment, handwritten script classification is of paramount importance before the document images are fed to their respective Optical Character Recognition (OCR) engines. Over the years, this complex pattern classification problem has been solved by researchers proposing various feature vectors mostly having large dimension, thereby increasing the computation complexity of the whole classification model. Feature Selection (FS) can serve as an intermediate step to reduce the size of the feature vectors by restricting them only to the essential and relevant features. In our paper, we have addressed this issue by introducing a new FS algorithm, called Hybrid Swarm and Gravitation based FS (HSGFS). This algorithm is made to run on 3 feature vectors introduced in the literature recently - Distance-Hough Transform (DHT), Histogram of Oriented Gradients (HOG) and Modified log-Gabor (MLG) filter Transform. Three state-of-the-art classifiers namely, Multi-Layer Perceptron (MLP), K-Nearest Neighbour (KNN) and Support Vector Machine (SVM) are used for the handwritten script classification. Handwritten datasets, prepared at block, text-line and word level, consisting of officially recognized 12 Indic scripts are used for the evaluation of our method. An average improvement in the range of 2-5 accuracies by utilizing only about 75-80 all three datasets. The proposed methodology also shows better performance when compared to some popularly used FS models.

READ FULL TEXT

page 19

page 20

page 34

page 35

page 36

page 37

research
01/25/2022

A Classical Approach to Handcrafted Feature Extraction Techniques for Bangla Handwritten Digit Recognition

Bangla Handwritten Digit recognition is a significant step forward in th...
research
07/26/2017

A Harmony Search Based Wrapper Feature Selection Method for Holistic Bangla word Recognition

A lot of search approaches have been explored for the selection of featu...
research
04/26/2016

A New Approach in Persian Handwritten Letters Recognition Using Error Correcting Output Coding

Classification Ensemble, which uses the weighed polling of outputs, is t...
research
09/16/2020

Handwritten Script Identification from Text Lines

In a multilingual country like India where 12 different official scripts...
research
05/10/2020

Atom Search Optimization with Simulated Annealing – a Hybrid Metaheuristic Approach for Feature Selection

'Hybrid meta-heuristics' is one of the most interesting recent trends in...
research
10/29/2017

A Saak Transform Approach to Efficient, Scalable and Robust Handwritten Digits Recognition

An efficient, scalable and robust approach to the handwritten digits rec...
research
04/03/2020

Sparse Concept Coded Tetrolet Transform for Unconstrained Odia Character Recognition

Feature representation in the form of spatio-spectral decomposition is o...

Please sign up or login with your details

Forgot password? Click here to reset