Text-independent writer identification using convolutional neural network

09/10/2020
by   Hung Tuan Nguyen, et al.
0

The text-independent approach to writer identification does not require the writer to write some predetermined text. Previous research on text-independent writer identification has been based on identifying writer-specific features designed by experts. However, in the last decade, deep learning methods have been successfully applied to learn features from data automatically. We propose here an end-to-end deep-learning method for text-independent writer identification that does not require prior identification of features. A Convolutional Neural Network (CNN) is trained initially to extract local features, which represent characteristics of individual handwriting in the whole character images and their sub-regions. Randomly sampled tuples of images from the training set are used to train the CNN and aggregate the extracted local features of images from the tuples to form global features. For every training epoch, the process of randomly sampling tuples is repeated, which is equivalent to a large number of training patterns being prepared for training the CNN for text-independent writer identification. We conducted experiments on the JEITA-HP database of offline handwritten Japanese character patterns. With 200 characters, our method achieved an accuracy of 99.97 writers. Even when using 50 characters for 100 writers or 100 characters for 400 writers, our method achieved accuracy levels of 92.80 respectively. We conducted further experiments on the Firemaker and IAM databases of offline handwritten English text. Using only one page per writer to train, our method achieved over 91.81 Overall, we achieved a better performance than the previously published best result based on handcrafted features and clustering algorithms, which demonstrates the effectiveness of our method for handwritten English text also.

READ FULL TEXT
research
07/11/2023

Handwritten Text Recognition Using Convolutional Neural Network

OCR (Optical Character Recognition) is a technology that offers comprehe...
research
11/20/2021

Exploiting Multi-Scale Fusion, Spatial Attention and Patch Interaction Techniques for Text-Independent Writer Identification

Text independent writer identification is a challenging problem that dif...
research
01/24/2020

Character-independent font identification

There are a countless number of fonts with various shapes and styles. In...
research
06/21/2016

DeepWriter: A Multi-Stream Deep CNN for Text-independent Writer Identification

Text-independent writer identification is challenging due to the huge va...
research
05/24/2021

TRACE: A Differentiable Approach to Line-level Stroke Recovery for Offline Handwritten Text

Stroke order and velocity are helpful features in the fields of signatur...
research
02/21/2022

Offline Text-Independent Writer Identification based on word level data

This paper proposes a novel scheme to identify the authorship of a docum...
research
10/30/2020

Automatic Counting and Identification of Train Wagons Based on Computer Vision and Deep Learning

In this work, we present a robust and efficient solution for counting an...

Please sign up or login with your details

Forgot password? Click here to reset