2D-CTC for Scene Text Recognition

07/23/2019
by   Zhaoyi Wan, et al.
3

Scene text recognition has been an important, active research topic in computer vision for years. Previous approaches mainly consider text as 1D signals and cast scene text recognition as a sequence prediction problem, by feat of CTC or attention based encoder-decoder framework, which is originally designed for speech recognition. However, different from speech voices, which are 1D signals, text instances are essentially distributed in 2D image spaces. To adhere to and make use of the 2D nature of text for higher recognition accuracy, we extend the vanilla CTC model to a second dimension, thus creating 2D-CTC. 2D-CTC can adaptively concentrate on most relevant features while excluding the impact from clutters and noises in the background; It can also naturally handle text instances with various forms (horizontal, oriented and curved) while giving more interpretable intermediate predictions. The experiments on standard benchmarks for scene text recognition, such as IIIT-5K, ICDAR 2015, SVP-Perspective, and CUTE80, demonstrate that the proposed 2D-CTC model outperforms state-of-the-art methods on the text of both regular and irregular shapes. Moreover, 2D-CTC exhibits its superiority over prior art on training and testing speed. Our implementation and models of 2D-CTC will be made publicly available soon later.

READ FULL TEXT

page 1

page 2

page 3

page 7

research
09/18/2018

Scene Text Recognition from Two-Dimensional Perspective

Inspired by speech recognition, recent state-of-the-art algorithms mostl...
research
08/24/2023

LISTER: Neighbor Decoding for Length-Insensitive Scene Text Recognition

The diversity in length constitutes a significant characteristic of text...
research
03/18/2020

Scene Text Recognition via Transformer

Scene text recognition with arbitrary shape is very challenging due to l...
research
02/04/2020

GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition

Connectionist Temporal Classification (CTC) and attention mechanism are ...
research
03/25/2020

SCATTER: Selective Context Attentional Scene Text Recognizer

Scene Text Recognition (STR), the task of recognizing text against compl...
research
11/09/2022

Portmanteauing Features for Scene Text Recognition

Scene text images have different shapes and are subjected to various dis...
research
05/09/2018

Edit Probability for Scene Text Recognition

We consider the scene text recognition problem under the attention-based...

Please sign up or login with your details

Forgot password? Click here to reset