DeepAI AI Chat
Log In Sign Up

Boosting Scene Character Recognition by Learning Canonical Forms of Glyphs

by   Yizhi Wang, et al.
Peking University

As one of the fundamental problems in document analysis, scene character recognition has attracted considerable interests in recent years. But the problem is still considered to be extremely challenging due to many uncontrollable factors including glyph transformation, blur, noisy background, uneven illumination, etc. In this paper, we propose a novel methodology for boosting scene character recognition by learning canonical forms of glyphs, based on the fact that characters appearing in scene images are all derived from their corresponding canonical forms. Our key observation is that more discriminative features can be learned by solving specially-designed generative tasks compared to traditional classification-based feature learning frameworks. Specifically, we design a GAN-based model to make the learned deep feature of a given scene character be capable of reconstructing corresponding glyphs in a number of standard font styles. In this manner, we obtain deep features for scene characters that are more discriminative in recognition and less sensitive against the above-mentioned factors. Our experiments conducted on several publicly-available databases demonstrate the superiority of our method compared to the state of the art.


page 2

page 4

page 8

page 9

page 10


Exploring Font-independent Features for Scene Text Recognition

Scene text recognition (STR) has been extensively studied in last few ye...

Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition

Artistic text recognition is an extremely challenging task with a wide r...

Scene Text Recognition with Sliding Convolutional Character Models

Scene text recognition has attracted great interests from the computer v...

TextScanner: Reading Characters in Order for Robust Scene Text Recognition

Driven by deep learning and the large volume of data, scene text recogni...

Word Recognition with Deep Conditional Random Fields

Recognition of handwritten words continues to be an important problem in...

SAFE: Scale Aware Feature Encoder for Scene Text Recognition

In this paper, we address the problem of having characters with differen...

Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction

Precise homography estimation between multiple images is a pre-requisite...