Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition

06/22/2021
by   Jingye Chen, et al.
0

Chinese character recognition has attracted much research interest due to its wide applications. Although it has been studied for many years, some issues in this field have not been completely resolved yet, e.g. the zero-shot problem. Previous character-based and radical-based methods have not fundamentally addressed the zero-shot problem since some characters or radicals in test sets may not appear in training sets under a data-hungry condition. Inspired by the fact that humans can generalize to know how to write characters unseen before if they have learned stroke orders of some characters, we propose a stroke-based method by decomposing each character into a sequence of strokes, which are the most basic units of Chinese characters. However, we observe that there is a one-to-many relationship between stroke sequences and Chinese characters. To tackle this challenge, we employ a matching-based strategy to transform the predicted stroke sequence to a specific character. We evaluate the proposed method on handwritten characters, printed artistic characters, and scene characters. The experimental results validate that the proposed method outperforms existing methods on both character zero-shot and radical zero-shot tasks. Moreover, the proposed method can be easily generalized to other languages whose characters can be decomposed into strokes.

READ FULL TEXT
research
04/06/2021

Hippocampus-heuristic Character Recognition Network for Zero-shot Learning

The recognition of Chinese characters has always been a challenging task...
research
10/16/2022

STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions

Zero-shot Chinese character recognition has attracted rising attention i...
research
09/03/2023

Chinese Text Recognition with A Pre-Trained CLIP-Like Model Through Image-IDS Aligning

Scene text recognition has been studied for decades due to its broad app...
research
07/17/2022

Stroke-Based Autoencoders: Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition

Chinese characters carry a wealth of morphological and semantic informat...
research
11/24/2022

Chinese Character Recognition with Radical-Structured Stroke Trees

The flourishing blossom of deep learning has witnessed the rapid develop...
research
07/12/2022

REZCR: A Zero-shot Character Recognition Method via Radical Extraction

The long-tail effect is a common issue that limits the performance of de...
research
12/12/2022

Diff-Font: Diffusion Model for Robust One-Shot Font Generation

Font generation is a difficult and time-consuming task, especially in th...

Please sign up or login with your details

Forgot password? Click here to reset