TextScanner: Reading Characters in Order for Robust Scene Text Recognition

12/28/2019
by   Zhaoyi Wan, et al.
23

Driven by deep learning and the large volume of data, scene text recognition has evolved rapidly in recent years. Formerly, RNN-attention based methods have dominated this field, but suffer from the problem of attention drift in certain situations. Lately, semantic segmentation based algorithms have proven effective at recognizing text of different forms (horizontal, oriented and curved). However, these methods may produce spurious characters or miss genuine characters, as they rely heavily on a thresholding procedure operated on segmentation maps. To tackle these challenges, we propose in this paper an alternative approach, called TextScanner, for scene text recognition. TextScanner bears three characteristics: (1) Basically, it belongs to the semantic segmentation family, as it generates pixel-wise, multi-channel segmentation maps for character class, position and order; (2) Meanwhile, akin to RNN-attention based methods, it also adopts RNN for context modeling; (3) Moreover, it performs paralleled prediction for character position and class, and ensures that characters are transcripted in correct order. The experiments on standard benchmark datasets demonstrate that TextScanner outperforms the state-of-the-art methods. Moreover, TextScanner shows its superiority in recognizing more difficult text such Chinese transcripts and aligning with target characters.

READ FULL TEXT

page 1

page 6

research
03/07/2022

A Glyph-driven Topology Enhancement Network for Scene Text Recognition

Attention-based methods by establishing one-dimensional (1D) and two-dim...
research
02/28/2020

DGST : Discriminator Guided Scene Text detector

Scene text detection task has attracted considerable attention in comput...
research
09/18/2018

Scene Text Recognition from Two-Dimensional Perspective

Inspired by speech recognition, recent state-of-the-art algorithms mostl...
research
11/22/2021

CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition

The attention-based encoder-decoder framework is becoming popular in sce...
research
07/12/2019

Boosting Scene Character Recognition by Learning Canonical Forms of Glyphs

As one of the fundamental problems in document analysis, scene character...
research
09/05/2022

Scene Text Recognition with Single-Point Decoding Network

In recent years, attention-based scene text recognition methods have bee...
research
12/29/2015

Robust Scene Text Recognition Using Sparse Coding based Features

In this paper, we propose an effective scene text recognition method usi...

Please sign up or login with your details

Forgot password? Click here to reset