Text Recognition – Real World Data and Where to Find Them

07/06/2020
by   Klára Janoušková, et al.
13

We present a method for exploiting weakly annotated images to improve text extraction pipelines. The approach exploits an arbitrary existing end-to-end text recognition system to obtain text region proposals and their, possibly erroneous, transcriptions. A process that includes imprecise transcription to annotation matching and edit distance guided neighbourhood search produces nearly error-free, localised instances of scene text, which we treat as pseudo ground truth used for training. We apply the method to two weakly-annotated datasets and show that the process consistently improves the accuracy of a state of the art recognition model across different benchmark datasets (image domains) as well as providing a significant performance boost on the same dataset.

READ FULL TEXT

page 1

page 7

research
05/12/2021

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text

A crucial component for the scene text based reasoning required for Text...
research
06/27/2023

UTRNet: High-Resolution Urdu Text Recognition In Printed Documents

In this paper, we propose a novel approach to address the challenges of ...
research
03/14/2023

Rethinking Image-based Table Recognition Using Weakly Supervised Methods

Most of the previous methods for table recognition rely on training data...
research
01/13/2022

Weakly Supervised Scene Text Detection using Deep Reinforcement Learning

The challenging field of scene text detection requires complex data anno...
research
04/23/2018

Guided Attention for Large Scale Scene Text Verification

Many tasks are related to determining if a particular text string exists...
research
08/26/2019

End-To-End Measure for Text Recognition

Measuring the performance of text recognition and text line detection en...
research
10/09/2018

Selective Distillation of Weakly Annotated GTD for Vision-based Slab Identification System

This paper proposes an algorithm for recognizing slab identification num...

Please sign up or login with your details

Forgot password? Click here to reset