RRPN++: Guidance Towards More Accurate Scene Text Detection

09/28/2020
by   Jianqi Ma, et al.
0

RRPN is among the outstanding scene text detection approaches, but the manually-designed anchor and coarse proposal refinement make the performance still far from perfection. In this paper, we propose RRPN++ to exploit the potential of RRPN-based model by several improvements. Based on RRPN, we propose the Anchor-free Pyramid Proposal Networks (APPN) to generate first-stage proposals, which adopts the anchor-free design to reduce proposal number and accelerate the inference speed. In our second stage, both the detection branch and the recognition branch are incorporated to perform multi-task learning. In inference stage, the detection branch outputs the proposal refinement and the recognition branch predicts the transcript of the refined text region. Further, the recognition branch also helps rescore the proposals and eliminate the false positive proposals by the jointing filtering strategy. With these enhancements, we boost the detection results by 6% of F-measure in ICDAR2015 compared to RRPN. Experiments conducted on other benchmarks also illustrate the superior performance and efficiency of our model.

READ FULL TEXT

page 1

page 4

page 8

research
04/24/2018

An Anchor-Free Region Proposal Network for Faster R-CNN based Text Detection Approaches

The anchor mechanism of Faster R-CNN and SSD framework is considered not...
research
07/18/2020

Bounding Maps for Universal Lesion Detection

Universal Lesion Detection (ULD) in computed tomography plays an essenti...
research
12/13/2018

FA-RPN: Floating Region Proposals for Face Detection

We propose a novel approach for generating region proposals for performi...
research
03/07/2022

Screentone-Preserved Manga Retargeting

As a popular comic style, manga offers a unique impression by utilizing ...
research
08/19/2020

Scene Text Detection with Selected Anchor

Object proposal technique with dense anchoring scheme for scene text det...
research
12/16/2020

Two-Stage Copy-Move Forgery Detection with Self Deep Matching and Proposal SuperGlue

Copy-move forgery detection identifies a tampered image by detecting pas...
research
11/16/2022

R-Pred: Two-Stage Motion Prediction Via Tube-Query Attention-Based Trajectory Refinement

Predicting the future motion of dynamic agents is of paramount importanc...

Please sign up or login with your details

Forgot password? Click here to reset