MASTER: Multi-Aspect Non-local Network for Scene Text Recognition

10/07/2019
by   Ning Lu, et al.
0

Attention based scene text recognizers have gained huge success, which leverage a more compact intermediate representations to learn 1d- or 2d- attention by a RNN-based encoder-decoder architecture. However, such methods suffer from attention-drift problem because high similarity among encoded features lead to attention confusion under the RNN-based local attention mechanism. Moreover RNN-based methods have low efficiency due to poor parallelization. To overcome these problems, we propose the MASTER, a self-attention based scene text recognizer that (1) not only encodes the input-output attention, but also learns self-attention which encodes feature-feature and target-target relationships inside the encoder and decoder and (2) learns a more powerful and robust intermediate representation to spatial distortion and (3) owns a better training and evaluation efficiency. Extensive experiments on various benchmarks demonstrate the superior performance of our MASTER on both regular and irregular scene text.

READ FULL TEXT

page 1

page 4

research
02/04/2020

GTC: Guided Training of CTC Towards Efficient and Accurate Scene Text Recognition

Connectionist Temporal Classification (CTC) and attention mechanism are ...
research
06/04/2018

NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition

Scene text recognition has attracted a great many researches for decades...
research
01/01/2022

SAFL: A Self-Attention Scene Text Recognizer with Focal Loss

In the last decades, scene text recognition has gained worldwide attenti...
research
05/10/2021

Primitive Representation Learning for Scene Text Recognition

Scene text recognition is a challenging task due to diverse variations o...
research
09/07/2017

Focusing Attention: Towards Accurate Text Recognition in Natural Images

Scene text recognition has been a hot research topic in computer vision ...
research
09/05/2022

Scene Text Recognition with Single-Point Decoding Network

In recent years, attention-based scene text recognition methods have bee...
research
12/11/2017

Attention networks for image-to-text

The paper approaches the problem of image-to-text with attention-based e...

Please sign up or login with your details

Forgot password? Click here to reset