Automatic Training Data Synthesis for Handwriting Recognition Using the Structural Crossing-Over Technique

10/09/2014
by   Sirisak Visessenee, et al.
0

The paper presents a novel technique called "Structural Crossing-Over" to synthesize qualified data for training machine learning-based handwriting recognition. The proposed technique can provide a greater variety of patterns of training data than the existing approaches such as elastic distortion and tangent-based affine transformation. A couple of training characters are chosen, then they are analyzed by their similar and different structures, and finally are crossed over to generate the new characters. The experiments are set to compare the performances of tangent-based affine transformation and the proposed approach in terms of the variety of generated characters and percent of recognition errors. The standard MNIST corpus including 60,000 training characters and 10,000 test characters is employed in the experiments. The proposed technique uses 1,000 characters to synthesize 60,000 characters, and then uses these data to train and test the benchmark handwriting recognition system that exploits Histogram of Gradient (HOG) as features and Support Vector Machine (SVM) as recognizer. The experimental result yields 8.06 significantly outperforms the tangent-based affine transformation and the original MNIST training data, which are 11.74

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2020

A complete character recognition and transliteration technique for Devanagari script

Transliteration involves transformation of one script to another based o...
research
06/30/2010

A Two Stage Classification Approach for Handwritten Devanagari Characters

The paper presents a two stage classification approach for handwritten d...
research
05/08/2012

Spectral Analysis of Projection Histogram for Enhancing Close matching character Recognition in Malayalam

The success rates of Optical Character Recognition (OCR) systems for pri...
research
03/10/2022

Towards Open-Set Text Recognition via Label-to-Prototype Learning

Scene text recognition is a popular topic and can benefit various tasks....
research
07/25/2022

Riemannian Geometry Approach for Minimizing Distortion and its Applications

Given an affine transformation T, we define its Fisher distortion Dist_F...
research
09/22/2009

A Method for Extraction and Recognition of Isolated License Plate Characters

A method to extract and recognize isolated characters in license plates ...
research
04/30/2019

Handwritten Chinese Font Generation with Collaborative Stroke Refinement

Automatic character generation is an appealing solution for new typeface...

Please sign up or login with your details

Forgot password? Click here to reset