R2CNN: Rotational Region CNN for Orientation Robust Scene Text Detection

06/29/2017
by   Yingying Jiang, et al.
0

In this paper, we propose a novel method called Rotational Region CNN (R2CNN) for detecting arbitrary-oriented texts in natural scene images. The framework is based on Faster R-CNN [1] architecture. First, we use the Region Proposal Network (RPN) to generate axis-aligned bounding boxes that enclose the texts with different orientations. Second, for each axis-aligned text box proposed by RPN, we extract its pooled features with different pooled sizes and the concatenated features are used to simultaneously predict the text/non-text score, axis-aligned box and inclined minimum area box. At last, we use an inclined non-maximum suppression to get the detection results. Our approach achieves competitive results on text detection benchmarks: ICDAR 2015 and ICDAR 2013.

READ FULL TEXT

page 1

page 3

page 4

page 7

page 8

research
11/30/2017

ArbiText: Arbitrary-Oriented Text Detection in Unconstrained Scene

Arbitrary-oriented text detection in the wild is a very challenging task...
research
05/24/2016

DeepText: A Unified Framework for Text Proposal Generation and Text Detection in Natural Images

In this paper, we develop a novel unified framework called DeepText for ...
research
01/30/2018

Sliding Line Point Regression for Shape Robust Scene Text Detection

Traditional text detection methods mostly focus on quadrangle text. In t...
research
01/21/2020

A Hierarchical Location Normalization System for Text

It's natural these days for people to know the local events from massive...
research
02/28/2021

Leveraging Orientation for Weakly Supervised Object Detection with Application to Firearm Localization

Automatic detection of firearms is important for enhancing the security ...
research
07/06/2021

Plot2Spectra: an Automatic Spectra Extraction Tool

Different types of spectroscopies, such as X-ray absorption near edge st...
research
06/26/2020

Text Detection on Roughly Placed Books by Leveraging a Learning-based Model Trained with Another Domain Data

Text detection enables us to extract rich information from images. In th...

Please sign up or login with your details

Forgot password? Click here to reset