Automatic Removal of Marginal Annotations in Printed Text Document

08/09/2014
by   Abdessamad Elboushaki, et al.
0

Recovering the original printed texts from a document with added handwritten annotations in the marginal area is one of the challenging problems, especially when the original document is not available. Therefore, this paper aims at salvaging automatically the original document from the annotated document by detecting and removing any handwritten annotations that appear in the marginal area of the document without any loss of information. Here a two stage algorithm is proposed, where in the first stage due to approximate marginal boundary detection with horizontal and vertical projection profiles, all of the marginal annotations along with some part of the original printed text that may appear very close to the marginal boundary are removed. Therefore as a second stage, using the connected components, a strategy is applied to bring back the printed text components cropped during the first stage. The proposed method is validated using a dataset of 50 documents having complex handwritten annotations, which gives an overall accuracy of 89.01 annotations and 97.74 document.

READ FULL TEXT
research
11/22/2017

TexT - Text Extractor Tool for Handwritten Document Transcription and Annotation

This paper presents a framework for semi-automatic transcription of larg...
research
01/10/2019

New Radon Transform Based Texture Features of Handwritten Document

In this paper, we present some new features describing the handwritten d...
research
06/06/2013

K-Algorithm A Modified Technique for Noise Removal in Handwritten Documents

OCR has been an active research area since last few decades. OCR perform...
research
11/09/2012

Localisation of Numerical Date Field in an Indian Handwritten Document

This paper describes a method to localise all those areas which may cons...
research
04/01/2018

Recognizing Challenging Handwritten Annotations with Fully Convolutional Networks

This paper introduces a very challenging dataset of historic German docu...
research
06/16/2021

ICDAR 2021 Competition on Components Segmentation Task of Document Photos

This paper describes the short-term competition on Components Segmentati...
research
10/14/2019

Vertebrae Detection and Localization in CT with Two-Stage CNNs and Dense Annotations

We propose a new, two-stage approach to the vertebrae centroid detection...

Please sign up or login with your details

Forgot password? Click here to reset