Arbitrary Shape Text Detection via Boundary Transformer

05/11/2022
by   Shi-Xue Zhang, et al.
6

Arbitrary shape text detection is a challenging task due to its complexity and variety, e.g, various scales, random rotations, and curve shapes. In this paper, we propose an arbitrary shape text detector with a boundary transformer, which can accurately and directly locate text boundaries without any post-processing. Our method mainly consists of a boundary proposal module and an iteratively optimized boundary transformer module. The boundary proposal module consisting of multi-layer dilated convolutions will compute important prior information (including classification map, distance field, and direction field) for generating coarse boundary proposals meanwhile guiding the optimization of boundary transformer. The boundary transformer module adopts an encoder-decoder structure, in which the encoder is constructed by multi-layer transformer blocks with residual connection while the decoder is a simple multi-layer perceptron network (MLP). Under the guidance of prior information, the boundary transformer module will gradually refine the coarse boundary proposals via boundary deformation in an iterative manner. Furthermore, we propose a novel boundary energy loss (BEL) which introduces an energy minimization constraint and an energy monotonically decreasing constraint for every boundary optimization step. Extensive experiments on publicly available and challenging datasets demonstrate the state-of-the-art performance and promising efficiency of our method.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 8

page 9

page 11

page 12

research
07/27/2021

Adaptive Boundary Proposal Network for Arbitrary Shape Text Detection

Arbitrary shape text detection is a challenging task due to the high com...
research
02/23/2023

A Convolutional-Transformer Network for Crack Segmentation with Boundary Awareness

Cracks play a crucial role in assessing the safety and durability of man...
research
10/06/2020

On the Sub-Layer Functionalities of Transformer Decoder

There have been significant efforts to interpret the encoder of Transfor...
research
12/07/2021

DCAN: Improving Temporal Action Detection via Dual Context Aggregation

Temporal action detection aims to locate the boundaries of action in the...
research
03/23/2019

Curve Text Detection with Local Segmentation Network and Curve Connection

Curve text or arbitrary shape text is very common in real-world scenario...
research
08/26/2022

Arbitrary Shape Text Detection via Segmentation with Probability Maps

Arbitrary shape text detection is a challenging task due to the signific...
research
06/25/2022

SC-Transformer++: Structured Context Transformer for Generic Event Boundary Detection

This report presents the algorithm used in the submission of Generic Eve...

Please sign up or login with your details

Forgot password? Click here to reset