Edit Probability for Scene Text Recognition

05/09/2018
by   Fan Bai, et al.
0

We consider the scene text recognition problem under the attention-based encoder-decoder framework, which is the state of the art. The existing methods usually employ a frame-wise maximal likelihood loss to optimize the models. When we train the model, the misalignment between the ground truth strings and the attention's output sequences of probability distribution, which is caused by missing or superfluous characters, will confuse and mislead the training process, and consequently make the training costly and degrade the recognition accuracy. To handle this problem, we propose a novel method called edit probability (EP) for scene text recognition. EP tries to effectively estimate the probability of generating a string from the output sequence of probability distribution conditioned on the input image, while considering the possible occurrences of missing/superfluous characters. The advantage lies in that the training process can focus on the missing, superfluous and unrecognized characters, and thus the impact of the misalignment problem can be alleviated or even overcome. We conduct extensive experiments on standard benchmarks, including the IIIT-5K, Street View Text and ICDAR datasets. Experimental results show that the EP can substantially boost scene text recognition performance.

READ FULL TEXT
research
08/26/2019

Adaptive Embedding Gate for Attention-Based Scene Text Recognition

Scene text recognition has attracted particular research interest becaus...
research
06/04/2018

NRTR: A No-Recurrence Sequence-to-Sequence Model For Scene Text Recognition

Scene text recognition has attracted a great many researches for decades...
research
02/21/2023

A3S: Adversarial learning of semantic representations for Scene-Text Spotting

Scene-text spotting is a task that predicts a text area on natural scene...
research
11/04/2019

Scene Text Recognition with Temporal Convolutional Encoder

Texts from scene images typically consist of several characters and exhi...
research
03/07/2022

A Glyph-driven Topology Enhancement Network for Scene Text Recognition

Attention-based methods by establishing one-dimensional (1D) and two-dim...
research
07/23/2019

2D-CTC for Scene Text Recognition

Scene text recognition has been an important, active research topic in c...
research
01/17/2019

SAFE: Scale Aware Feature Encoder for Scene Text Recognition

In this paper, we address the problem of having characters with differen...

Please sign up or login with your details

Forgot password? Click here to reset