MNIST-MIX: A Multi-language Handwritten Digit Recognition Dataset

04/08/2020
by   Weiwei Jiang, et al.
0

In this letter, we contribute a multi-language handwritten digit recognition dataset named MNIST-MIX, which is the largest dataset of the same type in terms of both languages and data samples. With the same data format with MNIST, MNIST-MIX can be seamlessly applied in existing studies for handwritten digit recognition. By introducing digits from 10 different languages, MNIST-MIX becomes a more challenging dataset and its imbalanced classification requires a better design of models. We also present the results of applying a LeNet model which is pre-trained on MNIST as the baseline.

READ FULL TEXT

page 1

page 2

page 3

research
08/18/2020

Image Pre-processing on NumtaDB for Bengali Handwritten Digit Recognition

NumtaDB is by far the largest data-set collection for handwritten digits...
research
08/03/2019

Kannada-MNIST: A new handwritten digits dataset for the Kannada language

In this paper, we disseminate a new handwritten digits-dataset, termed K...
research
07/24/2018

Handwritten Digit Recognition by Elastic Matching

A simple model of MNIST handwritten digit recognition is presented here....
research
04/17/2019

TextCaps : Handwritten Character Recognition with Very Small Datasets

Many localized languages struggle to reap the benefits of recent advance...
research
09/11/2015

Learning Sparse Feature Representations using Probabilistic Quadtrees and Deep Belief Nets

Learning sparse feature representations is a useful instrument for solvi...
research
02/01/2011

High-Performance Neural Networks for Visual Object Classification

We present a fast, fully parameterizable GPU implementation of Convoluti...
research
04/27/2022

An Improved Nearest Neighbour Classifier

A windowed version of the Nearest Neighbour (WNN) classifier for images ...

Please sign up or login with your details

Forgot password? Click here to reset