STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions

10/16/2022
by   Jinshan Zeng, et al.
0

Zero-shot Chinese character recognition has attracted rising attention in recent years. Existing methods for this problem are mainly based on either certain low-level stroke-based decomposition or medium-level radical-based decomposition. Considering that the stroke- and radical-level decompositions can provide different levels of information, we propose an effective zero-shot Chinese character recognition method by combining them. The proposed method consists of a training stage and an inference stage. In the training stage, we adopt two similar encoder-decoder models to yield the estimates of stroke and radical encodings, which together with the true encodings are then used to formalize the associated stroke and radical losses for training. A similarity loss is introduced to regularize stroke and radical encoders to yield features of the same characters with high correlation. In the inference stage, two key modules, i.e., the stroke screening module (SSM) and feature matching module (FMM) are introduced to tackle the deterministic and confusing cases respectively. In particular, we introduce an effective stroke rectification scheme in FMM to enlarge the candidate set of characters for final inference. Numerous experiments over three benchmark datasets covering the handwritten, printed artistic and street view scenarios are conducted to demonstrate the effectiveness of the proposed method. Numerical results show that the proposed method outperforms the state-of-the-art methods in both character and radical zero-shot settings, and maintains competitive performance in the traditional seen character setting.

READ FULL TEXT

page 5

page 7

page 18

page 19

research
06/22/2021

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition

Chinese character recognition has attracted much research interest due t...
research
11/24/2022

Chinese Character Recognition with Radical-Structured Stroke Trees

The flourishing blossom of deep learning has witnessed the rapid develop...
research
07/17/2022

Stroke-Based Autoencoders: Self-Supervised Learners for Efficient Zero-Shot Chinese Character Recognition

Chinese characters carry a wealth of morphological and semantic informat...
research
10/21/2021

HENet: Forcing a Network to Think More for Font Recognition

Although lots of progress were made in Text Recognition/OCR in recent ye...
research
11/03/2017

Radical analysis network for zero-shot learning in printed Chinese character recognition

Chinese characters have a huge set of character categories, more than 20...
research
11/11/2022

StrokeGAN+: Few-Shot Semi-Supervised Chinese Font Generation with Stroke Encoding

The generation of Chinese fonts has a wide range of applications. The cu...
research
07/30/2023

Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

Recently, handwritten Chinese character error correction has been greatl...

Please sign up or login with your details

Forgot password? Click here to reset