Brno Mobile OCR Dataset

07/02/2019
by   Martin Kišš, et al.
2

We introduce the Brno Mobile OCR Dataset (B-MOD) for document Optical Character Recognition from low-quality images captured by handheld mobile devices. While OCR of high-quality scanned documents is a mature field where many commercial tools are available, and large datasets of text in the wild exist, no existing datasets can be used to develop and test document OCR methods robust to non-uniform lighting, image blur, strong noise, built-in denoising, sharpening, compression and other artifacts present in many photographs from mobile devices. This dataset contains 2 113 unique pages from random scientific papers, which were photographed by multiple people using 23 different mobile devices. The resulting 19 728 photographs of various visual quality are accompanied by precise positions and text annotations of 500k text lines. We further provide an evaluation methodology, including an evaluation server and a testset with non-public annotations. We provide a state-of-the-art text recognition baseline build on convolutional and recurrent neural networks trained with Connectionist Temporal Classification loss. This baseline achieves 2 on easy, medium and hard parts of the dataset, respectively, confirming that the dataset is challenging. The presented dataset will enable future development and evaluation of document analysis for low-quality images. It is primarily intended for line-level text recognition, and can be further used for line localization, layout analysis, image restoration and text binarization.

READ FULL TEXT

page 1

page 3

page 4

page 5

research
07/16/2018

MIDV-500: A Dataset for Identity Documents Analysis and Recognition on Mobile Devices in Video Stream

A lot of research has been devoted to identity documents analysis and re...
research
12/25/2019

DDI-100: Dataset for Text Detection and Recognition

Nowadays document analysis and recognition remain challenging tasks. How...
research
10/06/2022

Towards Better Semantic Understanding of Mobile Interfaces

Improving the accessibility and automation capabilities of mobile device...
research
10/09/2019

MIDV-2019: Challenges of the modern mobile-based document OCR

Recognition of identity documents using mobile devices has become a topi...
research
03/15/2019

GolfDB: A Video Database for Golf Swing Sequencing

The golf swing is a complex movement requiring considerable full-body co...
research
03/02/2010

Binarizing Business Card Images for Mobile Devices

Business card images are of multiple natures as these often contain grap...
research
11/27/2019

Methods of Weighted Combination for Text Field Recognition in a Video Stream

Due to a noticeable expansion of document recognition applicability, the...

Please sign up or login with your details

Forgot password? Click here to reset