Count, Decode and Fetch: A New Approach to Handwritten Chinese Character Error Correction

07/30/2023
by   Pengfei Hu, et al.
0

Recently, handwritten Chinese character error correction has been greatly improved by employing encoder-decoder methods to decompose a Chinese character into an ideographic description sequence (IDS). However, existing methods implicitly capture and encode linguistic information inherent in IDS sequences, leading to a tendency to generate IDS sequences that match seen characters. This poses a challenge when dealing with an unseen misspelled character, as the decoder may generate an IDS sequence that matches a seen character instead. Therefore, we introduce Count, Decode and Fetch (CDF), a novel approach that exhibits better generalization towards unseen misspelled characters. CDF is mainly composed of three parts: the counter, the decoder, and the fetcher. In the first stage, the counter predicts the number of each radical class without the symbol-level position annotations. In the second stage, the decoder employs the counting information and generates the IDS sequence step by step. Moreover, by updating the counting information at each time step, the decoder becomes aware of the existence of each radical. With the decomposed IDS sequence, we can determine whether the given character is misspelled. If it is misspelled, the fetcher under the transductive transfer learning strategy predicts the ideal character that the user originally intended to write. We integrate our method into existing encoder-decoder models and significantly enhance their performance.

READ FULL TEXT
research
01/22/2018

Trajectory-based Radical Analysis Network for Online Handwritten Chinese Character Recognition

Recently, great progress has been made for online handwritten Chinese ch...
research
08/13/2018

DenseRAN for Offline Handwritten Chinese Character Recognition

Recently, great success has been achieved in offline handwritten Chinese...
research
11/03/2017

RAN: Radical analysis networks for zero-shot learning of Chinese characters

Chinese characters have a huge set of character categories, more than 20...
research
11/03/2017

Radical analysis network for zero-shot learning in printed Chinese character recognition

Chinese characters have a huge set of character categories, more than 20...
research
11/17/2021

Augmentation of base classifier performance via HMMs on a handwritten character data set

This paper presents results of a study of the performance of several bas...
research
11/24/2022

Chinese Character Recognition with Radical-Structured Stroke Trees

The flourishing blossom of deep learning has witnessed the rapid develop...
research
10/16/2022

STAR: Zero-Shot Chinese Character Recognition with Stroke- and Radical-Level Decompositions

Zero-shot Chinese character recognition has attracted rising attention i...

Please sign up or login with your details

Forgot password? Click here to reset