Exploring Confidence Measures for Word Spotting in Heterogeneous Datasets

03/26/2019
by   Fabian Wolf, et al.
0

In recent years, convolutional neural networks (CNNs) took over the field of document analysis and they became the predominant model for word spotting. Especially attribute CNNs, which learn the mapping between a word image and an attribute representation, showed exceptional performances. The drawback of this approach is the overconfidence of neural networks when used out of their training distribution. In this paper, we explore different metrics for quantifying the confidence of a CNN in its predictions, specifically on the retrieval problem of word spotting. With these confidence measures, we limit the inability of a retrieval list to reject certain candidates. We investigate four different approaches that are either based on the network's attribute estimations or make use of a surrogate model. Our approach also aims at answering the question for which part of a dataset the retrieval system gives reliable results. We further show that there exists a direct relation between the proposed confidence measures and the quality of an estimated attribute representation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2017

Attribute CNNs for Word Spotting in Handwritten Documents

Word spotting has become a field of strong research interest in document...
research
04/23/2015

Multimodal Convolutional Neural Networks for Matching Image and Sentence

In this paper, we propose multimodal convolutional neural networks (m-CN...
research
06/28/2018

Expolring Architectures for CNN-Based Word Spotting

The goal in word spotting is to retrieve parts of document images which ...
research
07/25/2017

Exploring the Effectiveness of Convolutional Neural Networks for Answer Selection in End-to-End Question Answering

Most work on natural language question answering today focuses on answer...
research
06/08/2020

Combining word embeddings and convolutional neural networks to detect duplicated questions

Detecting semantic similarities between sentences is still a challenge t...
research
03/03/2020

What's the relationship between CNNs and communication systems?

The interpretability of Convolutional Neural Networks (CNNs) is an impor...

Please sign up or login with your details

Forgot password? Click here to reset