Fully Convolutional Neural Networks for Page Segmentation of Historical Document Images

11/21/2017
by   Christoph Wick, et al.
0

We propose a high-performance fully convolutional neural network (FCN) for historical handwritten document segmentation that is designed to process a single page in one step. The advantage of this model beside its speed is its ability to directly learn from raw pixels instead of using preprocessing steps e. g. feature computation or superpixel generation. We show that this network yields better results than existing methods on different public data sets. For evaluation of this model we introduce a novel metric that is independent of ambiguous ground truth called Foreground Pixel Accuracy (FgPA). This pixel based measure only counts foreground pixels in the binarized page, any background pixel is omitted. The major advantage of this metric is, that it enables researchers to compare different segmentation methods on their ability to successfully segment text or pictures and not on their ability to learn and possibly overfit the peculiarities of an ambiguous hand-made ground truth segmentation.

READ FULL TEXT

page 1

page 3

page 6

research
04/05/2017

Convolutional Neural Networks for Page Segmentation of Historical Document Images

This paper presents a Convolutional Neural Network (CNN) based page segm...
research
11/23/2018

An Adaptive Approach for Automated Grapevine Phenotyping using VGG-based Convolutional Neural Networks

In (grapevine) breeding programs and research, periodic phenotyping and ...
research
02/28/2020

Neural Network Segmentation of Interstitial Fibrosis, Tubular Atrophy, and Glomerulosclerosis in Renal Biopsies

Glomerulosclerosis, interstitial fibrosis, and tubular atrophy (IFTA) ar...
research
01/20/2021

Text Line Segmentation for Challenging Handwritten Document Images Using Fully Convolutional Network

This paper presents a method for text line segmentation of challenging h...
research
08/10/2017

Document Image Binarization with Fully Convolutional Neural Networks

Binarization of degraded historical manuscript images is an important pr...
research
08/15/2019

SFSegNet: Parse Freehand Sketches using Deep Fully Convolutional Networks

Parsing sketches via semantic segmentation is attractive but challenging...
research
01/19/2021

Unsupervised Deep Learning for Handwritten Page Segmentation

Segmenting handwritten document images into regions with homogeneous pat...

Please sign up or login with your details

Forgot password? Click here to reset