Attribute CNNs for Word Spotting in Handwritten Documents

12/20/2017
by   Sebastian Sudholt, et al.
0

Word spotting has become a field of strong research interest in document image analysis over the last years. Recently, AttributeSVMs were proposed which predict a binary attribute representation. At their time, this influential method defined the state-of-the-art in segmentation-based word spotting. In this work, we present an approach for learning attribute representations with Convolutional Neural Networks (CNNs). By taking a probabilistic perspective on training CNNs, we derive two different loss functions for binary and real-valued word string embeddings. In addition, we propose two different CNN architectures, specifically designed for word spotting. These architectures are able to be trained in an end-to-end fashion. In a number of experiments, we investigate the influence of different word string embeddings and optimization strategies. We show our Attribute CNNs to achieve state-of-the-art results for segmentation-based word spotting on a large variety of data sets.

READ FULL TEXT

page 6

page 9

research
04/01/2016

PHOCNet: A Deep Convolutional Neural Network for Word Spotting in Handwritten Documents

In recent years, deep convolutional neural networks have achieved state ...
research
03/26/2019

Exploring Confidence Measures for Word Spotting in Heterogeneous Datasets

In recent years, convolutional neural networks (CNNs) took over the fiel...
research
05/28/2015

Query by String word spotting based on character bi-gram indexing

In this paper we propose a segmentation-free query by string word spotti...
research
12/01/2017

Learning Deep Representations for Word Spotting Under Weak Supervision

Convolutional Neural Networks have made their mark in various fields of ...
research
06/28/2018

Expolring Architectures for CNN-Based Word Spotting

The goal in word spotting is to retrieve parts of document images which ...
research
01/05/2018

Deep learning for word-level handwritten Indic script identification

We propose a novel method that uses convolutional neural networks (CNNs)...
research
10/29/2020

A Comprehensive Comparison of End-to-End Approaches for Handwritten Digit String Recognition

Over the last decades, most approaches proposed for handwritten digit st...

Please sign up or login with your details

Forgot password? Click here to reset