Fonts-2-Handwriting: A Seed-Augment-Train framework for universal digit classification

05/16/2019
by   Vinay Uday Prabhu, et al.
7

In this paper, we propose a Seed-Augment-Train/Transfer (SAT) framework that contains a synthetic seed image dataset generation procedure for languages with different numeral systems using freely available open font file datasets. This seed dataset of images is then augmented to create a purely synthetic training dataset, which is in turn used to train a deep neural network and test on held-out real world handwritten digits dataset spanning five Indic scripts, Kannada, Tamil, Gujarati, Malayalam, and Devanagari. We showcase the efficacy of this approach both qualitatively, by training a Boundary-seeking GAN (BGAN) that generates realistic digit images in the five languages, and also quantitatively by testing a CNN trained on the synthetic data on the real-world datasets. This establishes not only an interesting nexus between the font-datasets-world and transfer learning but also provides a recipe for universal-digit classification in any script.

READ FULL TEXT

page 7

page 8

page 9

page 11

research
12/12/2022

Synthetic Image Data for Deep Learning

Realistic synthetic image data rendered from 3D models can be used to au...
research
04/17/2018

Synthetic data generation for Indic handwritten text recognition

This paper presents a novel approach to generate synthetic dataset for h...
research
12/24/2020

Seed Phenotyping on Neural Networks using Domain Randomization and Transfer Learning

Seed phenotyping is the idea of analyzing the morphometric characteristi...
research
11/30/2020

Sim2SG: Sim-to-Real Scene Graph Generation for Transfer Learning

Scene graph (SG) generation has been gaining a lot of traction recently....
research
03/29/2021

Classification of Seeds using Domain Randomization on Self-Supervised Learning Frameworks

The first step toward Seed Phenotyping i.e. the comprehensive assessment...
research
10/06/2021

Seed Classification using Synthetic Image Datasets Generated from Low-Altitude UAV Imagery

Plant breeding programs extensively monitor the evolution of seed kernel...
research
02/05/2021

Validating Seed Data Samples for Synthetic Identities – Methodology and Uniqueness Metrics

This work explores the identity attribute of synthetic face samples deri...

Please sign up or login with your details

Forgot password? Click here to reset