Deformation Robust Text Spotting with Geometric Prior

08/31/2023
by   Xixuan Hao, et al.
0

The goal of text spotting is to perform text detection and recognition in an end-to-end manner. Although the diversity of luminosity and orientation in scene texts has been widely studied, the font diversity and shape variance of the same character are ignored in recent works, since most characters in natural images are rendered in standard fonts. To solve this problem, we present a Chinese Artistic Dataset, termed as ARText, which contains 33,000 artistic images with rich shape deformation and font diversity. Based on this database, we develop a deformation robust text spotting method (DR TextSpotter) to solve the recognition problem of complex deformation of characters in different fonts. Specifically, we propose a geometric prior module to highlight the important features based on the unsupervised landmark detection sub-network. A graph convolution network is further constructed to fuse the character features and landmark features, and then performs semantic reasoning to enhance the discrimination for different characters. The experiments are conducted on ARText and IC19-ReCTS datasets. Our results demonstrate the effectiveness of our proposed method.

READ FULL TEXT

page 4

page 9

page 10

research
02/21/2023

A3S: Adversarial learning of semantic representations for Scene-Text Spotting

Scene-text spotting is a task that predicts a text area on natural scene...
research
06/02/2022

Disentangled Generation Network for Enlarged License Plate Recognition and A Unified Dataset

License plate recognition plays a critical role in many practical applic...
research
09/03/2023

Orientation-Independent Chinese Text Recognition in Scene Images

Scene text recognition (STR) has attracted much attention due to its bro...
research
05/18/2021

I2C2W: Image-to-Character-to-Word Transformers for Accurate Scene Text Recognition

Leveraging the advances of natural language processing, most recent scen...
research
04/20/2020

Landmark Detection and 3D Face Reconstruction for Caricature using a Nonlinear Parametric Model

Caricature is an artistic abstraction of the human face by distorting or...
research
05/21/2023

Social Context-aware GCN for Video Character Search via Scene-prior Enhancement

With the increasing demand for intelligent services of online video plat...
research
03/04/2019

STEFANN: Scene Text Editor using Font Adaptive Neural Network

Textual information in a captured scene play important role in scene int...

Please sign up or login with your details

Forgot password? Click here to reset