Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network

01/20/2021
by   Berat Barakat, et al.
18

This paper presents a method for text line segmentation of challenging historical manuscript images. These manuscript images contain narrow interline spaces with touching components, interpenetrating vowel signs and inconsistent font types and sizes. In addition, they contain curved, multi-skewed and multi-directed side note lines within a complex page layout. Therefore, bounding polygon labeling would be very difficult and time consuming. Instead we rely on line masks that connect the components on the same text line. Then these line masks are predicted using a Fully Convolutional Network (FCN). In the literature, FCN has been successfully used for text line segmentation of regular handwritten document images. The present paper shows that FCN is useful with challenging manuscript images as well. Using a new evaluation metric that is sensitive to over segmentation as well as under segmentation, testing results on a publicly available challenging handwritten dataset are comparable with the results of a previous work on the same dataset.

READ FULL TEXT

page 1

page 3

page 4

page 5

research
01/18/2021

Text line extraction using fully convolutional network and energy minimization

Text lines are important parts of handwritten document images and easier...
research
03/19/2020

Unsupervised text line segmentation

We present an unsupervised text line segmentation method that is inspire...
research
04/18/2021

Line Segmentation from Unconstrained Handwritten Text Images using Adaptive Approach

Line segmentation from handwritten text images is one of the challenging...
research
04/14/2016

Multi-Oriented Text Detection with Fully Convolutional Networks

In this paper, we propose a novel approach for text detec- tion in natur...
research
07/09/2019

BADAM: A Public Dataset for Baseline Detection in Arabic-script Manuscripts

The application of handwritten text recognition to historical works is h...
research
11/21/2017

Fully Convolutional Neural Networks for Page Segmentation of Historical Document Images

We propose a high-performance fully convolutional neural network (FCN) f...
research
05/07/2018

A probabilistic framework for handwritten text line segmentation

We successfully combine Expectation-Maximization algorithm and variation...

Please sign up or login with your details

Forgot password? Click here to reset