MSR: Multi-Scale Shape Regression for Scene Text Detection

by   Chuhui Xue, et al.

State-of-the-art scene text detection techniques predict quadrilateral boxes which are prone to localization errors while dealing with long or curved text lines in scenes. This paper presents a novel multi-scale shape regression network (MSR) that is capable of locating scene texts of arbitrary orientations, shapes and lengths accurately. The MSR detects scene texts by predicting dense text boundary points instead of sparse quadrilateral vertices which often suffers from regression errors while dealing with long text lines. The detection by linking of dense boundary points also enables accurate localization of scene texts of arbitrary orientations and shapes whereas most existing techniques using quadrilaterals often include undesired background to the ensuing text recognition. Additionally, the multi-scale network extracts and fuses features at different scales concurrently and seamlessly which demonstrates superb tolerance to the text scale variation. Extensive experiments over several public datasets show that MSR obtains superior detection performance for both curved and arbitrarily oriented text lines of different lengths, e.g. 80.7 f-score for the CTW1500, 81.7 f-score for the MSRA-TD500, etc.


page 1

page 2

page 4

page 6

page 7


Detection and Rectification of Arbitrary Shaped Scene Texts by using Text Keypoints and Links

Detection and recognition of scene texts of arbitrary shapes remain a gr...

Accurate Scene Text Detection through Border Semantics Awareness and Bootstrapping

This paper presents a scene text detection technique that exploits boots...

ESIR: End-to-end Scene Text Recognition via Iterative Image Rectification

Automated recognition of texts in scenes has been a research challenge f...

FC2RN: A Fully Convolutional Corner Refinement Network for Accurate Multi-Oriented Scene Text Detection

Recent scene text detection works mainly focus on curve text detection. ...

A pooling based scene text proposal technique for scene text reading in the wild

Automatic reading texts in scenes has attracted increasing interest in r...

Accurate Text Localization in Natural Image with Cascaded Convolutional Text Network

We introduce a new top-down pipeline for scene text detection. We propos...

A Single Shot Text Detector with Scale-adaptive Anchors

Currently, most top-performing text detection networks tend to employ fi...

Please sign up or login with your details

Forgot password? Click here to reset