DeepAI
Log In Sign Up

Recognition of handwritten Roman Numerals using Tesseract open source OCR engine

03/30/2010
by   Sandip Rakshit, et al.
0

The objective of the paper is to recognize handwritten samples of Roman numerals using Tesseract open source Optical Character Recognition (OCR) engine. Tesseract is trained with data samples of different persons to generate one user-independent language model, representing the handwritten Roman digit-set. The system is trained with 1226 digit samples collected form the different users. The performance is tested on two different datasets, one consisting of samples collected from the known users (those who prepared the training data samples) and the other consisting of handwritten data samples of unknown users. The overall recognition accuracy is obtained as 92.1 on these test datasets respectively.

READ FULL TEXT
03/30/2010

Development of a multi-user handwriting recognition system using Tesseract open source OCR engine

The objective of the paper is to recognize handwritten samples of lower ...
05/30/2015

An Open Source Testing Tool for Evaluating Handwriting Input Methods

This paper presents an open source tool for testing the recognition accu...
03/30/2010

Development of a Multi-User Recognition Engine for Handwritten Bangla Basic Characters and Digits

The objective of the paper is to recognize handwritten samples of basic ...
03/30/2010

Recognition of Handwritten Roman Script Using Tesseract Open source OCR Engine

In the present work, we have used Tesseract 2.01 open source Optical Cha...
09/18/2019

Unsupervised Writer Adaptation for Synthetic-to-Real Handwritten Word Recognition

Handwritten Text Recognition (HTR) is still a challenging problem becaus...
04/17/2019

TextCaps : Handwritten Character Recognition with Very Small Datasets

Many localized languages struggle to reap the benefits of recent advance...