R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

06/29/2017
by   Yingying Jiang, et al.
0

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations. Second, for each axis-aligned text box proposed by RPN, we extract its pooled features with different pooled sizes and the concatenated features are used to simultaneously predict the text/non-text score, axis-aligned box and inclined minimum area box. At last, we use an inclined non-maximum suppression to get the detection results. Our approach achieves competitive results on text detection benchmarks: ICDAR 2015 and ICDAR 2013.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

11/30/2017

ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

Arbitrary-oriented text detection in the wild is a very challenging task...
05/24/2016

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

In this paper, we develop a novel unified framework called DeepText for ...
01/30/2018

Sliding Line Point Regression for Shape Robust Scene Text Detection

Traditional text detection methods mostly focus on quadrangle text. In t...
01/21/2020

A Hierarchical Location Normalization System for Text

It's natural these days for people to know the local events from massive...
03/14/2019

Learning Orientation-Estimation Convolutional Neural Network for Building Detection in Optical Remote Sensing Image

Benefiting from the great success of deep learning in computer vision, C...
02/28/2021

Leveraging Orientation for Weakly Supervised Object Detection with Application to Firearm Localization

Automatic detection of firearms is important for enhancing the security ...
06/26/2020

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In th...