Radial Line Fourier Descriptor for Segmentation-free Handwritten Word Spotting

09/06/2017
by   Anders Hast, et al.
0

Automatic recognition of historical handwritten manuscripts is a daunting task due to paper degradation over time. Recognition-free retrieval or word spotting is popularly used for information retrieval and digitization of the historical handwritten documents. However, the performance of word spotting algorithms depends heavily on feature detection and representation methods. Although there exist popular feature descriptors such as Scale Invariant Feature Transform (SIFT) and Speeded Up Robust Features (SURF), the invariant properties of these descriptors amplify the noise in the degraded document images, rendering them more sensitive to noise and complex characteristics of historical manuscripts. Therefore, an efficient and relaxed feature descriptor is required as the handwritten words across different documents are indeed similar, but not identical. This paper introduces a Radial Line Fourier (RLF) descriptor for handwritten word representation, with a short feature vector of 32 dimensions. A segmentation-free and training-free handwritten word spotting method is studied herein that relies on the proposed Radial Line Fourier (RLF) descriptor, taking into account different keypoints representations and using a simple preconditioner-based feature matching algorithm. The effectiveness of the proposed RLF descriptor for segmentation-free handwritten word spotting is empirically evaluated on well-known historical handwritten datasets using standard evaluation measures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2016

Local Binary Pattern for Word Spotting in Handwritten Historical Document

Digital libraries store images which can be highly degraded and to index...
research
09/17/2020

Word Segmentation from Unconstrained Handwritten Bangla Document Images using Distance Transform

Segmentation of handwritten document images into text lines and words is...
research
08/28/2013

A proposition of a robust system for historical document images indexation

Characterizing noisy or ancient documents is a challenging problem up to...
research
01/22/2015

An Improved Feature Descriptor for Recognition of Handwritten Bangla Alphabet

Appropriate feature set for representation of pattern classes is one of ...
research
09/06/2017

On-the-fly Historical Handwritten Text Annotation

The performance of information retrieval algorithms depends upon the ava...
research
02/17/2018

HWNet v2: An Efficient Word Image Representation for Handwritten Documents

We present a framework for learning efficient holistic representation fo...

Please sign up or login with your details

Forgot password? Click here to reset