Gaussian Constrained Attention Network for Scene Text Recognition

10/19/2020
by   Zhi Qiao, et al.
0

Scene text recognition has been a hot topic in computer vision. Recent methods adopt the attention mechanism for sequence prediction which achieve convincing results. However, we argue that the existing attention mechanism faces the problem of attention diffusion, in which the model may not focus on a certain character area. In this paper, we propose Gaussian Constrained Attention Network to deal with this problem. It is a 2D attention-based method integrated with a novel Gaussian Constrained Refinement Module, which predicts an additional Gaussian mask to refine the attention weights. Different from adopting an additional supervision on the attention weights simply, our proposed method introduces an explicit refinement. In this way, the attention weights will be more concentrated and the attention-based recognition network achieves better performance. The proposed Gaussian Constrained Refinement Module is flexible and can be applied to existing attention-based methods directly. The experiments on several benchmark datasets demonstrate the effectiveness of our proposed method. Our code has been available at https://github.com/Pay20Y/GCAN.

READ FULL TEXT

page 1

page 6

research
09/07/2017

Focusing Attention: Towards Accurate Text Recognition in Natural Images

Scene text recognition has been a hot research topic in computer vision ...
research
09/05/2022

Scene Text Recognition with Single-Point Decoding Network

In recent years, attention-based scene text recognition methods have bee...
research
01/10/2019

A Multi-Object Rectified Attention Network for Scene Text Recognition

Irregular text is widely used. However, it is considerably difficult to ...
research
02/22/2023

KS-DETR: Knowledge Sharing in Attention Learning for Detection Transformer

Scaled dot-product attention applies a softmax function on the scaled do...
research
08/02/2018

Double Supervised Network with Attention Mechanism for Scene Text Recognition

In this paper, we propose Double Supervised Network with Attention Mecha...
research
06/23/2022

Dynamic Scene Deblurring Base on Continuous Cross-Layer Attention Transmission

The deep convolutional neural networks (CNNs) using attention mechanism ...
research
12/01/2021

On-Device Spatial Attention based Sequence Learning Approach for Scene Text Script Identification

Automatic identification of script is an essential component of a multil...

Please sign up or login with your details

Forgot password? Click here to reset