Parallel Scale-wise Attention Network for Effective Scene Text Recognition

04/25/2021
by   Usman Sajid, et al.
0

The paper proposes a new text recognition network for scene-text images. Many state-of-the-art methods employ the attention mechanism either in the text encoder or decoder for the text alignment. Although the encoder-based attention yields promising results, these schemes inherit noticeable limitations. They perform the feature extraction (FE) and visual attention (VA) sequentially, which bounds the attention mechanism to rely only on the FE final single-scale output. Moreover, the utilization of the attention process is limited by only applying it directly to the single scale feature-maps. To address these issues, we propose a new multi-scale and encoder-based attention network for text recognition that performs the multi-scale FE and VA in parallel. The multi-scale channels also undergo regular fusion with each other to develop the coordinated knowledge together. Quantitative evaluation and robustness analysis on the standard benchmarks demonstrate that the proposed network outperforms the state-of-the-art in most cases.

READ FULL TEXT

page 1

page 7

research
12/21/2019

Decoupled Attention Network for Text Recognition

Text recognition has attracted considerable research interests because o...
research
01/17/2019

SAFE: Scale Aware Feature Encoder for Scene Text Recognition

In this paper, we address the problem of having characters with differen...
research
03/01/2021

DR-TANet: Dynamic Receptive Temporal Attention Network for Street Scene Change Detection

Street scene change detection continues to capture researchers' interest...
research
05/10/2021

Primitive Representation Learning for Scene Text Recognition

Scene text recognition is a challenging task due to diverse variations o...
research
08/02/2018

Double Supervised Network with Attention Mechanism for Scene Text Recognition

In this paper, we propose Double Supervised Network with Attention Mecha...
research
04/20/2019

FACLSTM: ConvLSTM with Focused Attention for Scene Text Recognition

Scene text recognition has recently been widely treated as a sequence-to...
research
06/09/2020

Single Image Deraining via Scale-space Invariant Attention Neural Network

Image enhancement from degradation of rainy artifacts plays a critical r...

Please sign up or login with your details

Forgot password? Click here to reset