Reading Text in the Wild with Convolutional Neural Networks

12/04/2014
by   Max Jaderberg, et al.
0

In this work we present an end-to-end system for text spotting -- localising and recognising text in natural scene images -- and text based image retrieval. This system is based on a region proposal mechanism for detection and deep convolutional neural networks for recognition. Our pipeline uses a novel combination of complementary proposal generation techniques to ensure high recall, and a fast subsequent filtering stage for improving precision. For the recognition and ranking of proposals, we train very large convolutional neural networks to perform word recognition on the whole proposal region at the same time, departing from the character classifier based systems of the past. These networks are trained solely on data produced by a synthetic text generation engine, requiring no human labelled data. Analysing the stages of our pipeline, we show state-of-the-art performance throughout. We perform rigorous experiments across a number of standard end-to-end text spotting benchmarks and text-based image retrieval datasets, showing a large improvement over all previous methods. Finally, we demonstrate a real-world application of our text spotting system to allow thousands of hours of news footage to be instantly searchable via a text query.

READ FULL TEXT

page 2

page 6

page 7

page 8

page 16

page 17

page 19

research
06/09/2014

Synthetic Data and Artificial Neural Networks for Natural Scene Text Recognition

In this work we present a framework for the recognition of natural scene...
research
10/10/2017

AdaDNNs: Adaptive Ensemble of Deep Neural Networks for Scene Text Recognition

Recognizing text in the wild is a really challenging task because of com...
research
11/03/2019

Scene Graph based Image Retrieval – A case study on the CLEVR Dataset

With the prolification of multimodal interaction in various domains, rec...
research
11/25/2018

A pooling based scene text proposal technique for scene text reading in the wild

Automatic reading texts in scenes has attracted increasing interest in r...
research
04/10/2016

TextProposals: a Text-specific Selective Search Algorithm for Word Spotting in the Wild

Motivated by the success of powerful while expensive techniques to recog...
research
09/12/2016

Detecting Text in Natural Image with Connectionist Text Proposal Network

We propose a novel Connectionist Text Proposal Network (CTPN) that accur...
research
02/27/2017

DepthSynth: Real-Time Realistic Synthetic Data Generation from CAD Models for 2.5D Recognition

Recent progress in computer vision has been dominated by deep neural net...

Please sign up or login with your details

Forgot password? Click here to reset