An Image Dataset of Text Patches in Everyday Scenes

10/20/2016
by   Ahmed Ibrahim, et al.
0

This paper describes a dataset containing small images of text from everyday scenes. The purpose of the dataset is to support the development of new automated systems that can detect and analyze text. Although much research has been devoted to text detection and recognition in scanned documents, relatively little attention has been given to text detection in other types of images, such as photographs that are posted on social-media sites. This new dataset, known as COCO-Text-Patch, contains approximately 354,000 small images that are each labeled as "text" or "non-text". This dataset particularly addresses the problem of text verification, which is an essential stage in the end-to-end text detection and recognition pipeline. In order to evaluate the utility of this dataset, it has been used to train two deep convolution neural networks to distinguish text from non-text. One network is inspired by the GoogLeNet architecture, and the second one is based on CaffeNet. Accuracy levels of 90.2 and 90.9 images, source code, and deep-learning trained models described in this paper will be publicly available

READ FULL TEXT
research
03/16/2021

Digital Peter: Dataset, Competition and Handwriting Recognition Methods

This paper presents a new dataset of Peter the Great's manuscripts and d...
research
07/08/2022

Detection of Furigana Text in Images

Furigana are pronunciation notes used in Japanese writing. Being able to...
research
01/26/2016

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

This paper describes the COCO-Text dataset. In recent years large-scale ...
research
05/13/2022

An empirical study of CTC based models for OCR of Indian languages

Recognition of text on word or line images, without the need for sub-wor...
research
08/20/2020

Detecting natural disasters, damage, and incidents in the wild

Responding to natural disasters, such as earthquakes, floods, and wildfi...
research
11/14/2016

A DNN Framework For Text Image Rectification From Planar Transformations

In this paper, a novel neural network architecture is proposed attemptin...
research
09/02/2022

Which country is this picture from? New data and methods for DNN-based country recognition

Predicting the country where a picture has been taken from has many pote...

Please sign up or login with your details

Forgot password? Click here to reset