STEP – Towards Structured Scene-Text Spotting

09/05/2023
by   Sergi Garcia-Bordils, et al.
0

We introduce the structured scene-text spotting task, which requires a scene-text OCR system to spot text in the wild according to a query regular expression. Contrary to generic scene text OCR, structured scene-text spotting seeks to dynamically condition both scene text detection and recognition on user-provided regular expressions. To tackle this task, we propose the Structured TExt sPotter (STEP), a model that exploits the provided text structure to guide the OCR process. STEP is able to deal with regular expressions that contain spaces and it is not bound to detection at the word-level granularity. Our approach enables accurate zero-shot structured text spotting in a wide variety of real-world reading scenarios and is solely trained on publicly available data. To demonstrate the effectiveness of our approach, we introduce a new challenging test dataset that contains several types of out-of-vocabulary structured text, reflecting important reading applications of fields such as prices, dates, serial numbers, license plates etc. We demonstrate that STEP can provide specialised OCR performance on demand in all tested scenarios.

READ FULL TEXT
research
12/13/2018

Advances of Scene Text Datasets

This article introduces publicly available datasets in scene text detect...
research
11/10/2021

Improving Structured Text Recognition with Regular Expression Biasing

We study the problem of recognizing structured text, i.e. text that foll...
research
05/16/2023

StructGPT: A General Framework for Large Language Model to Reason over Structured Data

In this paper, we study how to improve the zero-shot reasoning ability o...
research
02/08/2023

Geometric Perception based Efficient Text Recognition

Every Scene Text Recognition (STR) task consists of text localization & ...
research
07/29/2023

Separate Scene Text Detector for Unseen Scripts is Not All You Need

Text detection in the wild is a well-known problem that becomes more cha...
research
07/19/2023

Comparing with Python: Text Analysis in Stata

Text analysis is the process of constructing structured data from unstru...
research
01/23/2021

ARTH: Algorithm For Reading Text Handily – An AI Aid for People having Word Processing Issues

The objective of this project is to solve one of the major problems face...

Please sign up or login with your details

Forgot password? Click here to reset