Text-Attentional Convolutional Neural Networks for Scene Text Detection

10/12/2015
by   Tong He, et al.
0

Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this work, we present a new system for scene text detection by proposing a novel Text-Attentional Convolutional Neural Network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/nontext information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates main task of text/non-text classification. In addition, a powerful low-level detector called Contrast- Enhancement Maximally Stable Extremal Regions (CE-MSERs) is developed, which extends the widely-used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 dataset, with a F-measure of 0.82, improving the state-of-the-art results substantially.

READ FULL TEXT

page 1

page 5

page 6

page 7

page 8

page 9

page 11

page 13

research
01/27/2023

SLCNN: Sentence-Level Convolutional Neural Network for Text Classification

Text classification is a fundamental task in natural language processing...
research
05/08/2017

Scene Text Eraser

The character information in natural scene images contains various perso...
research
03/31/2016

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

We introduce a new top-down pipeline for scene text detection. We propos...
research
09/09/2018

TextContourNet: a Flexible and Effective Framework for Improving Scene Text Detection Architecture with a Multi-task Cascade

We study the problem of extracting text instance contour information fro...
research
05/10/2018

Boosting up Scene Text Detectors with Guided CNN

Deep CNNs have achieved great success in text detection. Most of existin...
research
12/06/2016

Learning to Detect Multiple Photographic Defects

In this paper, we introduce the problem of simultaneously detecting mult...
research
01/23/2023

SMDDH: Singleton Mention detection using Deep Learning in Hindi Text

Mention detection is an important component of coreference resolution sy...

Please sign up or login with your details

Forgot password? Click here to reset