Bootstrapping Weakly Supervised Segmentation-free Word Spotting through HMM-based Alignment

03/24/2020
by   Tomas Wilkinson, et al.
0

Recent work in word spotting in handwritten documents has yielded impressive results. This progress has largely been made by supervised learning systems, which are dependent on manually annotated data, making deployment to new collections a significant effort. In this paper, we propose an approach that utilises transcripts without bounding box annotations to train segmentation-free query-by-string word spotting models, given a partially trained model. This is done through a training-free alignment procedure based on hidden Markov models. This procedure creates a tentative mapping between word region proposals and the transcriptions to automatically create additional weakly annotated training data, without choosing any single alignment possibility as the correct one. When only using between 1 annotated training sets for partial convergence, we automatically annotate the remaining training data and successfully train using it. On all our datasets, our final trained model then comes within a few mAP model trained with the full training set used as ground truth. We believe that this will be a significant advance towards a more general use of word spotting, since digital transcription data will already exist for parts of many collections of interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2017

Neural Ctrl-F: Segmentation-free Query-by-String Word Spotting in Handwritten Manuscript Collections

In this paper, we approach the problem of segmentation-free query-by-str...
research
03/04/2020

Annotation-free Learning of Deep Representations for Word Spotting using Synthetic Data and Self Labeling

Word spotting is a popular tool for supporting the first exploration of ...
research
10/12/2021

Word Order Does Not Matter For Speech Recognition

In this paper, we study training of automatic speech recognition system ...
research
05/17/2020

Wake Word Detection with Alignment-Free Lattice-Free MMI

Always-on spoken language interfaces, e.g. personal digital assistants, ...
research
08/19/2020

Gradually Applying Weakly Supervised and Active Learning for Mass Detection in Breast Ultrasound Images

We propose a method for effectively utilizing weakly annotated image dat...
research
02/17/2023

Handling the Alignment for Wake Word Detection: A Comparison Between Alignment-Based, Alignment-Free and Hybrid Approaches

Wake word detection exists in most intelligent homes and portable device...
research
12/01/2017

Learning Deep Representations for Word Spotting Under Weak Supervision

Convolutional Neural Networks have made their mark in various fields of ...

Please sign up or login with your details

Forgot password? Click here to reset