REZCR: A Zero-shot Character Recognition Method via Radical Extraction

07/12/2022
by   Xiaolei Diao, et al.
10

The long-tail effect is a common issue that limits the performance of deep learning models on real-world datasets. Character image dataset development is also affected by such unbalanced data distribution due to differences in character usage frequency. Thus, current character recognition methods are limited when applying to real-world datasets, in particular to the character categories in the tail which are lacking training samples, e.g., uncommon characters, or characters from historical documents. In this paper, we propose a zero-shot character recognition framework via radical extraction, i.e., REZCR, to improve the recognition performance of few-sample character categories, in which we exploit information on radicals, the graphical units of characters, by decomposing and reconstructing characters following orthography. REZCR consists of an attention-based radical information extractor (RIE) and a knowledge graph-based character reasoner (KGR). The RIE aims to recognize candidate radicals and their possible structural relations from character images. The results will be fed into KGR to recognize the target character by reasoning with a pre-designed character knowledge graph. We validate our method on multiple datasets, REZCR shows promising experimental results, especially for few-sample character datasets.

READ FULL TEXT

page 1

page 4

research
06/22/2021

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition

Chinese character recognition has attracted much research interest due t...
research
04/06/2021

Hippocampus-heuristic Character Recognition Network for Zero-shot Learning

The recognition of Chinese characters has always been a challenging task...
research
05/03/2021

Recognition of Oracle Bone Inscriptions by using Two Deep Learning Models

Oracle bone inscriptions (OBIs) contain some of the oldest characters in...
research
04/12/2022

Open-set Text Recognition via Character-Context Decoupling

The open-set text recognition task is an emerging challenge that require...
research
05/31/2021

Pho(SC)Net: An Approach Towards Zero-shot Word Image Recognition in Historical Documents

Annotating words in a historical document image archive for word image r...
research
07/21/2023

Character Time-series Matching For Robust License Plate Recognition

Automatic License Plate Recognition (ALPR) is becoming a popular study a...
research
12/22/2020

MailLeak: Obfuscation-Robust Character Extraction Using Transfer Learning

The following work presents a new algorithm for character recognition fr...

Please sign up or login with your details

Forgot password? Click here to reset