NumtaDB - Assembled Bengali Handwritten Digits

06/06/2018
by   Samiul Alam, et al.
0

To benchmark Bengali digit recognition algorithms, a large publicly available dataset is required which is free from biases originating from geographical location, gender, and age. With this aim in mind, NumtaDB, a dataset consisting of more than 85,000 images of hand-written Bengali digits, has been assembled. This paper documents the collection and curation process of numerals along with the salient statistics of the dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2017

BanglaLekha-Isolated: A Comprehensive Bangla Handwritten Character Dataset

Bangla handwriting recognition is becoming a very important issue nowada...
research
04/29/2020

MatrriVasha: Bangla Handwritten Compound Character Dataset and Recognition

At present, recognition of the Bangla handwriting compound character has...
research
04/29/2020

MatriVasha: A Multipurpose Comprehensive Database for Bangla Handwritten Compound Characters

At present, recognition of the Bangla handwriting compound character has...
research
01/23/2018

The WiLI benchmark dataset for written language identification

This paper describes the WiLI-2018 benchmark dataset for monolingual wri...
research
05/25/2023

CENSUS-HWR: a large training dataset for offline handwriting recognition

Progress in Automated Handwriting Recognition has been hampered by the l...
research
08/16/2022

The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition

Handwritten Text Recognition (HTR) is an open problem at the intersectio...
research
06/20/2022

Open Set Classification of Untranscribed Handwritten Documents

Huge amounts of digital page images of important manuscripts are preserv...

Please sign up or login with your details

Forgot password? Click here to reset