A Single Shot Text Detector with Scale-adaptive Anchors

07/05/2018
by   Qi Yuan, et al.
16

Currently, most top-performing text detection networks tend to employ fixed-size anchor boxes to guide the search for text instances. They usually rely on a large amount of anchors with different scales to discover texts in scene images, thus leading to high computational cost. In this paper, we propose an end-to-end box-based text detector with scale-adaptive anchors, which can dynamically adjust the scales of anchors according to the sizes of underlying texts by introducing an additional scale regression layer. The proposed scale-adaptive anchors allow us to use a few number of anchors to handle multi-scale texts and therefore significantly improve the computational efficiency. Moreover, compared to discrete scales used in previous methods, the learned continuous scales are more reliable, especially for small texts detection. Additionally, we propose Anchor convolution to better exploit necessary feature information by dynamically adjusting the sizes of receptive fields according to the learned scales. Extensive experiments demonstrate that the proposed detector is fast, taking only 0.28 second per image, while outperforming most state-of-the-art methods in accuracy.

READ FULL TEXT

page 3

page 6

page 7

research
04/02/2021

MOST: A Multi-Oriented Scene Text Detector with Localization Refinement

Over the past few years, the field of scene text detection has progresse...
research
11/25/2022

Aggregated Text Transformer for Scene Text Detection

This paper explores the multi-scale aggregation strategy for scene text ...
research
10/18/2019

AFO-TAD: Anchor-free One-Stage Detector for Temporal Action Detection

Temporal action detection is a fundamental yet challenging task in video...
research
07/22/2021

Adaptive Dilated Convolution For Human Pose Estimation

Most existing human pose estimation (HPE) methods exploit multi-scale in...
research
01/09/2019

MSR: Multi-Scale Shape Regression for Scene Text Detection

State-of-the-art scene text detection techniques predict quadrilateral b...
research
08/25/2021

Layer-wise Customized Weak Segmentation Block and AIoU Loss for Accurate Object Detection

The anchor-based detectors handle the problem of scale variation by buil...
research
12/21/2021

DRPN: Making CNN Dynamically Handle Scale Variation

Based on our observations of infrared targets, serious scale variation a...

Please sign up or login with your details

Forgot password? Click here to reset