R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

by   Yingying Jiang, et al.

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations. Second, for each axis-aligned text box proposed by RPN, we extract its pooled features with different pooled sizes and the concatenated features are used to simultaneously predict the text/non-text score, axis-aligned box and inclined minimum area box. At last, we use an inclined non-maximum suppression to get the detection results. Our approach achieves competitive results on text detection benchmarks: ICDAR 2015 and ICDAR 2013.


page 1

page 3

page 4

page 7

page 8


ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

Arbitrary-oriented text detection in the wild is a very challenging task...

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

In this paper, we develop a novel unified framework called DeepText for ...

Sliding Line Point Regression for Shape Robust Scene Text Detection

Traditional text detection methods mostly focus on quadrangle text. In t...

A Hierarchical Location Normalization System for Text

It's natural these days for people to know the local events from massive...

Learning Orientation-Estimation Convolutional Neural Network for Building Detection in Optical Remote Sensing Image

Benefiting from the great success of deep learning in computer vision, C...

Leveraging Orientation for Weakly Supervised Object Detection with Application to Firearm Localization

Automatic detection of firearms is important for enhancing the security ...

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In th...