DeepAI AI Chat
Log In Sign Up

A3S: Adversarial learning of semantic representations for Scene-Text Spotting

by   Masato Fujitake, et al.

Scene-text spotting is a task that predicts a text area on natural scene images and recognizes its text characters simultaneously. It has attracted much attention in recent years due to its wide applications. Existing research has mainly focused on improving text region detection, not text recognition. Thus, while detection accuracy is improved, the end-to-end accuracy is insufficient. Texts in natural scene images tend to not be a random string of characters but a meaningful string of characters, a word. Therefore, we propose adversarial learning of semantic representations for scene text spotting (A3S) to improve end-to-end accuracy, including text recognition. A3S simultaneously predicts semantic features in the detected text area instead of only performing text recognition based on existing visual features. Experimental results on publicly available datasets show that the proposed method achieves better accuracy than other methods.


page 2

page 4


DiffusionSTR: Diffusion Model for Scene Text Recognition

This paper presents Diffusion Model for Scene Text Recognition (Diffusio...

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition

Leveraging the advances of natural language processing, most recent scen...

Guided Attention for Large Scale Scene Text Verification

Many tasks are related to determining if a particular text string exists...

Deformation Robust Text Spotting with Geometric Prior

The goal of text spotting is to perform text detection and recognition i...

Text Detection Recognition in the Wild for Robot Localization

Signage is everywhere and a robot should be able to take advantage of si...

Edit Probability for Scene Text Recognition

We consider the scene text recognition problem under the attention-based...