Recognizing Handwriting Styles in a Historical Scanned Document Using Scikit-Fuzzy c-means Clustering

10/30/2022
by   Sriparna Majumdar, et al.
0

The forensic attribution of the handwriting in a digitized document to multiple scribes is a challenging problem of high dimensionality. Unique handwriting styles may be dissimilar in a blend of several factors including character size, stroke width, loops, ductus, slant angles, and cursive ligatures. Previous work on labeled data with Hidden Markov models, support vector machines, and semi-supervised recurrent neural networks have provided moderate to high success. In this study, we successfully detect hand shifts in a historical manuscript through fuzzy soft clustering in combination with linear principal component analysis. This advance demonstrates the successful deployment of unsupervised methods for writer attribution of historical documents and forensic document analysis.

READ FULL TEXT

page 7

page 9

page 14

research
09/21/2019

Application of Fuzzy Clustering for Text Data Dimensionality Reduction

Large textual corpora are often represented by the document-term frequen...
research
12/15/2022

The Effects of Character-Level Data Augmentation on Style-Based Dating of Historical Manuscripts

Identifying the production dates of historical manuscripts is one of the...
research
04/25/2017

Automatic Compositor Attribution in the First Folio of Shakespeare

Compositor attribution, the clustering of pages in a historical printed ...
research
07/09/2014

Classifying Fonts and Calligraphy Styles Using Complex Wavelet Transform

Recognizing fonts has become an important task in document analysis, due...
research
01/06/2010

Document Clustering with K-tree

This paper describes the approach taken to the XML Mining track at INEX ...
research
01/26/2019

A Linear-complexity Multi-biometric Forensic Document Analysis System, by Fusing the Stylome and Signature Modalities

Forensic Document Analysis (FDA) addresses the problem of finding the au...
research
12/19/2014

Multiple Authors Detection: A Quantitative Analysis of Dream of the Red Chamber

Inspired by the authorship controversy of Dream of the Red Chamber and t...

Please sign up or login with your details

Forgot password? Click here to reset