AT-ST: Self-Training Adaptation Strategy for OCR in Domains with Limited Transcriptions

04/27/2021
by   Martin Kišš, et al.
0

This paper addresses text recognition for domains with limited manual annotations by a simple self-training strategy. Our approach should reduce human annotation effort when target domain data is plentiful, such as when transcribing a collection of single person's correspondence or a large manuscript. We propose to train a seed system on large scale data from related domains mixed with available annotated data from the target domain. The seed system transcribes the unannotated data from the target domain which is then used to train a better system. We study several confidence measures and eventually decide to use the posterior probability of a transcription for data selection. Additionally, we propose to augment the data using an aggressive masking scheme. By self-training, we achieve up to 55 error rate for handwritten data and up to 38 augmentation itself reduces the error rate by about 10 better pronounced in case of difficult handwritten data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/18/2023

Corpus Synthesis for Zero-shot ASR domain Adaptation using Large Language Models

While Automatic Speech Recognition (ASR) systems are widely used in many...
research
03/02/2023

Target Domain Data induces Negative Transfer in Mixed Domain Training with Disjoint Classes

In practical scenarios, it is often the case that the available training...
research
10/01/2021

Large-scale ASR Domain Adaptation using Self- and Semi-supervised Learning

Self- and semi-supervised learning methods have been actively investigat...
research
03/05/2023

IDA: Informed Domain Adaptive Semantic Segmentation

Mixup-based data augmentation has been validated to be a critical stage ...
research
11/23/2021

A self-training framework for glaucoma grading in OCT B-scans

In this paper, we present a self-training-based framework for glaucoma g...
research
08/20/2022

General-to-Specific Transfer Labeling for Domain Adaptable Keyphrase Generation

Training keyphrase generation (KPG) models requires a large amount of an...
research
07/01/2020

Iterative Paraphrastic Augmentation with Discriminative Span Alignment

We introduce a novel paraphrastic augmentation strategy based on sentenc...

Please sign up or login with your details

Forgot password? Click here to reset