Implicit segmentation of Kannada characters in offline handwriting recognition using hidden Markov models

10/16/2014
by   Manasij Venkatesh, et al.
0

We describe a method for classification of handwritten Kannada characters using Hidden Markov Models (HMMs). Kannada script is agglutinative, where simple shapes are concatenated horizontally to form a character. This results in a large number of characters making the task of classification difficult. Character segmentation plays a significant role in reducing the number of classes. Explicit segmentation techniques suffer when overlapping shapes are present, which is common in the case of handwritten text. We use HMMs to take advantage of the agglutinative nature of Kannada script, which allows us to perform implicit segmentation of characters along with recognition. All the experiments are performed on the Chars74k dataset that consists of 657 handwritten characters collected across multiple users. Gradient-based features are extracted from individual characters and are used to train character HMMs. The use of implicit segmentation technique at the character level resulted in an improvement of around 10 tested on the same dataset by around 16 showed that increasing the training data could result in better accuracy. Accordingly, we collected additional data and obtained an improvement of 4 with 6 additional samples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2012

Segmentation of Offline Handwritten Bengali Script

Character segmentation has long been one of the most critical areas of o...
research
09/28/2020

A complete character recognition and transliteration technique for Devanagari script

Transliteration involves transformation of one script to another based o...
research
11/17/2021

Augmentation of base classifier performance via HMMs on a handwritten character data set

This paper presents results of a study of the performance of several bas...
research
09/22/2015

Classification error in multiclass discrimination from Markov data

As a model for an on-line classification setting we consider a stochasti...
research
08/01/2017

HMM-based Indic Handwritten Word Recognition using Zone Segmentation

This paper presents a novel approach towards Indic handwritten word reco...
research
09/06/2023

Character Queries: A Transformer-based Approach to On-Line Handwritten Character Segmentation

On-line handwritten character segmentation is often associated with hand...
research
01/24/2020

Character-independent font identification

There are a countless number of fonts with various shapes and styles. In...

Please sign up or login with your details

Forgot password? Click here to reset