STN-OCR: A single Neural Network for Text Detection and Text Recognition

07/27/2017
by   Christian Bartz, et al.
0

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In re- cent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have been proposed. In this paper we present STN-OCR, a step towards semi-supervised neural networks for scene text recognition, that can be optimized end-to-end. In contrast to most existing works that consist of multiple deep neural networks and several pre-processing steps we propose to use a single deep neural network that learns to detect and recognize text from natural images in a semi-supervised way. STN-OCR is a network that integrates and jointly learns a spatial transformer network, that can learn to detect text regions in an image, and a text recognition network that takes the identified text regions and recognizes their textual content. We investigate how our model behaves on a range of different tasks (detection and recognition of characters, and lines of text). Experimental results on public benchmark datasets show the ability of our model to handle a variety of different tasks, without substantial changes in its overall network structure.

READ FULL TEXT

page 3

page 7

page 8

research
12/14/2017

SEE: Towards Semi-Supervised End-to-End Scene Text Recognition

Detecting and recognizing text in natural scene images is a challenging,...
research
07/30/2019

Towards Pure End-to-End Learning for Recognizing Multiple Text Sequences from an Image

Here we address a challenging problem: recognizing multiple text sequenc...
research
11/27/2018

A Compositional Textual Model for Recognition of Imperfect Word Images

Printed text recognition is an important problem for industrial OCR syst...
research
08/04/2019

Deep Neural Network for Semantic-based Text Recognition in Images

State-of-the-art text spotting systems typically aim to detect isolated ...
research
09/26/2017

Towards End-to-End Car License Plates Detection and Recognition with Deep Neural Networks

In this work, we tackle the problem of car license plate detection and r...
research
07/02/2019

Semi-Bagging Based Deep Neural Architecture to Extract Text from High Entropy Images

Extracting texts of various size and shape from images containing multip...
research
11/17/2020

Digging Deeper into CRNN Model in Chinese Text Images Recognition

Automatic text image recognition is a prevalent application in computer ...

Please sign up or login with your details

Forgot password? Click here to reset