A CNN Based Scene Chinese Text Recognition Algorithm With Synthetic Data Engine

04/07/2016
by   Xiaohang Ren, et al.
0

Scene text recognition plays an important role in many computer vision applications. The small size of available public available scene text datasets is the main challenge when training a text recognition CNN model. In this paper, we propose a CNN based Chinese text recognition algorithm. To enlarge the dataset for training the CNN model, we design a synthetic data engine for Chinese scene character generation, which generates representative character images according to the fonts use frequency of Chinese texts. As the Chinese text is more complex, the English text recognition CNN architecture is modified for Chinese text. To ensure the small size nature character dataset and the large size artificial character dataset are comparable in training, the CNN model are trained progressively. The proposed Chinese text recognition algorithm is evaluated with two Chinese text datasets. The algorithm achieves better recognize accuracy compared to the baseline methods.

READ FULL TEXT

page 1

page 2

research
11/26/2021

Traditional Chinese Synthetic Datasets Verified with Labeled Data for Scene Text Recognition

Scene text recognition (STR) has been widely studied in academia and ind...
research
09/03/2023

Orientation-Independent Chinese Text Recognition in Scene Images

Scene text recognition (STR) has attracted much attention due to its bro...
research
03/11/2015

A Novel Hybrid CNN-AIS Visual Pattern Recognition Engine

Machine learning methods are used today for most recognition problems. C...
research
05/14/2020

Large Scale Font Independent Urdu Text Recognition System

OCR algorithms have received a significant improvement in performance re...
research
02/28/2018

Chinese Text in the Wild

We introduce Chinese Text in the Wild, a very large dataset of Chinese t...
research
12/30/2021

Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical Study

The flourishing blossom of deep learning has witnessed the rapid develop...
research
10/25/2021

Ultra Light OCR Competition Technical Report

Ultra Light OCR Competition is a Chinese scene text recognition competit...

Please sign up or login with your details

Forgot password? Click here to reset