PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

04/12/2021
by   Pengfei Wang, et al.
0

The reading of arbitrarily-shaped text has received increasing research attention. However, existing text spotters are mostly built on two-stage frameworks or character-based methods, which suffer from either Non-Maximum Suppression (NMS), Region-of-Interest (RoI) operations, or character-level annotations. In this paper, to address the above problems, we propose a novel fully convolutional Point Gathering Network (PGNet) for reading arbitrarily-shaped text in real-time. The PGNet is a single-shot text spotter, where the pixel-level character classification map is learned with proposed PG-CTC loss avoiding the usage of character-level annotations. With a PG-CTC decoder, we gather high-level character classification vectors from two-dimensional space and decode them into text symbols without NMS and RoI operations involved, which guarantees high efficiency. Additionally, reasoning the relations between each character and its neighbors, a graph refinement module (GRM) is proposed to optimize the coarse recognition and improve the end-to-end performance. Experiments prove that the proposed method achieves competitive accuracy, meanwhile significantly improving the running speed. In particular, in Total-Text, it runs at 46.7 FPS, surpassing the previous spotters with a large margin.

READ FULL TEXT

page 2

page 3

page 4

page 7

page 8

research
02/24/2020

ABCNet: Real-time Scene Text Spotting with Adaptive Bezier-Curve Network

Scene text detection and recognition has received increasing research at...
research
12/08/2020

MANGO: A Mask Attention Guided One-Stage Scene Text Spotter

Recently end-to-end scene text spotting has become a popular research to...
research
08/15/2019

A Single-Shot Arbitrarily-Shaped Text Detector based on Context Attended Multi-Task Learning

Detecting scene text of arbitrary shapes has been a challenging task ove...
research
04/14/2015

Efficient Scene Text Localization and Recognition with Local Character Refinement

An unconstrained end-to-end text localization and recognition method is ...
research
05/08/2021

ABCNet v2: Adaptive Bezier-Curve Network for Real-time End-to-end Text Spotting

End-to-end text-spotting, which aims to integrate detection and recognit...
research
06/06/2023

TextFormer: A Query-based End-to-End Text Spotter with Mixed Supervision

End-to-end text spotting is a vital computer vision task that aims to in...
research
10/07/2020

Unconstrained Text Detection in Manga

The detection and recognition of unconstrained text is an open problem i...

Please sign up or login with your details

Forgot password? Click here to reset