Text Line Identification in Tagore's Manuscript

08/29/2014
by   Chandranath Adak, et al.
0

In this paper, a text line identification method is proposed. The text lines of printed document are easy to segment due to uniform straightness of the lines and sufficient gap between the lines. But in handwritten documents, the line is non-uniform and interline gaps are variable. We take Rabindranath Tagore's manuscript as it is one of the most difficult manuscripts that contain doodles. Our method consists of a pre-processing stage to clean the document image. Then we separate doodles from the manuscript to get the textual region. After that we identify the text lines on the manuscript. For text line identification, we use window examination, black run-length smearing, horizontal histogram and connected component analysis.

READ FULL TEXT

page 1

page 3

page 4

research
01/18/2021

Text line extraction using fully convolutional network and energy minimization

Text lines are important parts of handwritten document images and easier...
research
03/16/2021

Combining Morphological and Histogram based Text Line Segmentation in the OCR Context

Text line segmentation is one of the pre-stages of modern optical charac...
research
07/21/2017

HMM-based Writer Identification in Music Score Documents without Staff-Line Removal

Writer identification from musical score documents is a challenging task...
research
06/19/2018

A New COLD Feature based Handwriting Analysis for Ethnicity/Nationality Identification

Identifying crime for forensic investigating teams when crimes involve p...
research
05/19/2021

Unsupervised learning of text line segmentation by differentiating coarse patterns

Despite recent advances in the field of supervised deep learning for tex...
research
01/03/2019

Text line Segmentation in Compressed Representation of Handwritten Document using Tunneling Algorithm

In this research work, we perform text line segmentation directly in com...

Please sign up or login with your details

Forgot password? Click here to reset