Reading Scene Text in Deep Convolutional Sequences

06/14/2015
by   Pan He, et al.
0

We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labelling problem. We leverage recent advances of deep convolutional neural networks to generate an ordered high-level sequence from a whole word image, avoiding the difficult character segmentation problem. Then a deep recurrent model, building on long short-term memory (LSTM), is developed to robustly recognize the generated CNN sequences, departing from most existing approaches recognising each character independently. Our model has a number of appealing properties in comparison to existing scene text recognition methods: (i) It can recognise highly ambiguous words by leveraging meaningful context information, allowing it to work reliably without either pre- or post-processing; (ii) the deep CNN feature is robust to various image distortions; (iii) it retains the explicit order information in word image, which is essential to discriminate word strings; (iv) the model does not depend on pre-defined dictionary, and it can process unknown words and arbitrary strings. Codes for the DTRN will be available.

READ FULL TEXT

page 1

page 5

page 6

research
01/06/2016

Memory Matters: Convolutional Recurrent Neural Network for Scene Text Recognition

Text recognition in natural scene is a challenging problem due to the ma...
research
01/21/2016

Reading Car License Plates Using Deep Convolutional Neural Networks and LSTMs

In this work, we tackle the problem of car license plate detection and r...
research
09/06/2017

Scene Text Recognition with Sliding Convolutional Character Models

Scene text recognition has attracted great interests from the computer v...
research
02/08/2017

Character-level Deep Conflation for Business Data Analytics

Connecting different text attributes associated with the same entity (co...
research
09/13/2017

Reading Scene Text with Attention Convolutional Sequence Modeling

Reading text in the wild is a challenging task in the field of computer ...
research
02/25/2018

One Single Deep Bidirectional LSTM Network for Word Sense Disambiguation of Text Data

Due to recent technical and scientific advances, we have a wealth of inf...
research
06/16/2018

Offline Extraction of Indic Regional Language from Natural Scene Image using Text Segmentation and Deep Convolutional Sequence

Regional language extraction from a natural scene image is always a chal...

Please sign up or login with your details

Forgot password? Click here to reset