Data Incubation – Synthesizing Missing Data for Handwriting Recognition

10/13/2021
by   Jen-Hao Rick Chang, et al.
0

In this paper, we demonstrate how a generative model can be used to build a better recognizer through the control of content and style. We are building an online handwriting recognizer from a modest amount of training samples. By training our controllable handwriting synthesizer on the same data, we can synthesize handwriting with previously underrepresented content (e.g., URLs and email addresses) and style (e.g., cursive and slanted). Moreover, we propose a framework to analyze a recognizer that is trained with a mixture of real and synthetic training data. We use the framework to optimize data synthesis and demonstrate significant improvement on handwriting recognition over a model trained on real data only. Overall, we achieve a 66 Error Rate.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2021

Style Equalization: Unsupervised Learning of Controllable Generative Sequence Models

Controllable generative sequence models with the capability to extract a...
research
09/29/2019

Controllable Data Synthesis Method for Grammatical Error Correction

Due to the lack of parallel data in current Grammatical Error Correction...
research
09/28/2022

Synthesizing Annotated Image and Video Data Using a Rendering-Based Pipeline for Improved License Plate Recognition

An insufficient number of training samples is a common problem in neural...
research
04/12/2022

Content and Style Aware Generation of Text-line Images for Handwriting Recognition

Handwritten Text Recognition has achieved an impressive performance in p...
research
03/22/2022

Dataset Distillation by Matching Training Trajectories

Dataset distillation is the task of synthesizing a small dataset such th...
research
10/02/2019

A Deep Factorization of Style and Structure in Fonts

We propose a deep factorization model for typographic analysis that dise...
research
11/14/2014

Deep Belief Network Training Improvement Using Elite Samples Minimizing Free Energy

Nowadays this is very popular to use deep architectures in machine learn...

Please sign up or login with your details

Forgot password? Click here to reset