Pyramid Mask Text Detector

03/28/2019
by   Jingchao Liu, et al.
0

Scene text detection, an essential step of scene text recognition system, is to locate text instances in natural scene images automatically. Some recent attempts benefiting from Mask R-CNN formulate scene text detection task as an instance segmentation problem and achieve remarkable performance. In this paper, we present a new Mask R-CNN based framework named Pyramid Mask Text Detector (PMTD) to handle the scene text detection. Instead of binary text mask generated by the existing Mask R-CNN based methods, our PMTD performs pixel-level regression under the guidance of location-aware supervision, yielding a more informative soft text mask for each text instance. As for the generation of text boxes, PMTD reinterprets the obtained 2D soft mask into 3D space and introduces a novel plane clustering algorithm to derive the optimal text box on the basis of 3D shape. Experiments on standard datasets demonstrate that the proposed PMTD brings consistent and noticeable gain and clearly outperforms state-of-the-art methods. Specifically, it achieves an F-measure of 80.13

READ FULL TEXT

page 1

page 3

page 5

page 7

page 8

research
11/21/2018

Scene Text Detection with Supervised Pyramid Context Network

Scene text detection methods based on deep learning have achieved remark...
research
11/22/2018

Mask R-CNN with Pyramid Attention Network for Scene Text Detection

In this paper, we present a new Mask R-CNN based text detection approach...
research
02/28/2020

DGST : Discriminator Guided Scene Text detector

Scene text detection task has attracted considerable attention in comput...
research
05/10/2018

Boosting up Scene Text Detectors with Guided CNN

Deep CNNs have achieved great success in text detection. Most of existin...
research
11/30/2020

BOTD: Bold Outline Text Detector

Recently, text detection for arbitrary shape has attracted more and more...
research
11/25/2019

Shape-aware Feature Extraction for Instance Segmentation

Modern instance segmentation approaches mainly adopt a sequential paradi...
research
09/08/2021

Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection

Due to the large success in object detection and instance segmentation, ...

Please sign up or login with your details

Forgot password? Click here to reset