Character decomposition to resolve class imbalance problem in Hangul OCR

08/12/2022
by   Geonuk Kim, et al.
0

We present a novel approach to OCR(Optical Character Recognition) of Korean character, Hangul. As a phonogram, Hangul can represent 11,172 different characters with only 52 graphemes, by describing each character with a combination of the graphemes. As the total number of the characters could overwhelm the capacity of a neural network, the existing OCR encoding methods pre-define a smaller set of characters that are frequently used. This design choice naturally compromises the performance on long-tailed characters in the distribution. In this work, we demonstrate that grapheme encoding is not only efficient but also performant for Hangul OCR. Benchmark tests show that our approach resolves two main problems of Hangul OCR: class imbalance and target class selection.

READ FULL TEXT
research
09/28/2020

A complete character recognition and transliteration technique for Devanagari script

Transliteration involves transformation of one script to another based o...
research
02/22/2010

Handwritten Bangla Basic and Compound character recognition using MLP and SVM classifier

A novel approach for recognition of handwritten compound Bangla characte...
research
06/23/2021

CharacterChat: Supporting the Creation of Fictional Characters through Conversation and Progressive Manifestation with a Chatbot

We present CharacterChat, a concept and chatbot to support writers in cr...
research
06/04/2023

Encryption by using base-n systems with many characters

It is possible to interpret text as numbers (and vice versa) if one inte...
research
07/04/2019

A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples

Recognition of ancient Tamil characters has always been a challenge for ...
research
06/16/2021

Automatic Main Character Recognition for Photographic Studies

Main characters in images are the most important humans that catch the v...
research
04/29/2020

Measuring Information Propagation in Literary Social Networks

We present the task of modeling information propagation in literature, i...

Please sign up or login with your details

Forgot password? Click here to reset