Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

04/01/2018
by   Andreas Kölsch, et al.
0

This paper introduces a very challenging dataset of historic German documents and evaluates Fully Convolutional Neural Network (FCNN) based methods to locate handwritten annotations of any kind in these documents. The handwritten annotations can appear in form of underlines and text by using various writing instruments, e.g., the use of pencils makes the data more challenging. We train and evaluate various end-to-end semantic segmentation approaches and report the results. The task is to classify the pixels of documents into two classes: background and handwritten annotation. The best model achieves a mean Intersection over Union (IoU) score of 95.6 presented dataset. We also present a comparison of different strategies used for data augmentation and training on our presented dataset. For evaluation, we use the Layout Analysis Evaluator for the ICDAR 2017 Competition on Layout Analysis for Challenging Medieval Manuscripts.

READ FULL TEXT

page 1

page 2

page 3

page 5

research
01/19/2021

VML-MOC: Segmenting a multiply oriented and curved handwritten text lines dataset

This paper publishes a natural and very complicated dataset of handwritt...
research
09/30/2022

Towards End-to-end Handwritten Document Recognition

Handwritten text recognition has been widely studied in the last decades...
research
08/21/2021

Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts

Handwritten documents are often characterized by dense and uneven layout...
research
06/27/2017

Training a Fully Convolutional Neural Network to Route Integrated Circuits

We present a deep, fully convolutional neural network that learns to rou...
research
07/29/2022

Recognition of Handwritten Chinese Text by Segmentation: A Segment-annotation-free Approach

Online and offline handwritten Chinese text recognition (HTCR) has been ...
research
08/09/2014

Automatic Removal of Marginal Annotations in Printed Text Document

Recovering the original printed texts from a document with added handwri...
research
01/26/2018

PDNet: Semantic Segmentation integrated with a Primal-Dual Network for Document binarization

Binarization of digital documents is the task of classifying each pixel ...

Please sign up or login with your details

Forgot password? Click here to reset