Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

05/26/2021
by   Heng-Da Xu, et al.
0

Chinese Spell Checking (CSC) aims to detect and correct erroneous characters for user-generated text in the Chinese language. Most of the Chinese spelling errors are misused semantically, phonetically or graphically similar characters. Previous attempts noticed this phenomenon and try to use the similarity for this task. However, these methods use either heuristics or handcrafted confusion sets to predict the correct character. In this paper, we propose a Chinese spell checker called ReaLiSe, by directly leveraging the multimodal information of the Chinese characters. The ReaLiSe model tackles the CSC task by (1) capturing the semantic, phonetic and graphic information of the input characters, and (2) selectively mixing the information in these modalities to predict the correct output. Experiments on the SIGHAN benchmarks show that the proposed model outperforms strong baselines by a large margin.

READ FULL TEXT

page 3

page 12

research
07/17/2022

Contextual Similarity is More Valuable than Character Similarity: Curriculum Learning for Chinese Spell Checking

Chinese Spell Checking (CSC) task aims to detect and correct Chinese spe...
research
04/26/2020

SpellGCN: Incorporating Phonological and Visual Similarities into Language Models for Chinese Spelling Check

Chinese Spelling Check (CSC) is a task to detect and correct spelling er...
research
05/24/2023

Disentangled Phonetic Representation for Chinese Spelling Correction

Chinese Spelling Correction (CSC) aims to detect and correct erroneous c...
research
08/26/2022

AiM: Taking Answers in Mind to Correct Chinese Cloze Tests in Educational Applications

To automatically correct handwritten assignments, the traditional approa...
research
05/05/2023

Block the Label and Noise: An N-Gram Masked Speller for Chinese Spell Checking

Recently, Chinese Spell Checking(CSC), a task to detect erroneous charac...
research
02/01/2021

Polyphone Disambiguition in Mandarin Chinese with Semi-Supervised Learning

The majority of Chinese characters are monophonic, i.e.their pronunciati...
research
10/25/2022

A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Chinese spelling check is a task to detect and correct spelling mistakes...

Please sign up or login with your details

Forgot password? Click here to reset