COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

01/26/2016
by   Andreas Veit, et al.
0

This paper describes the COCO-Text dataset. In recent years large-scale datasets like SUN and Imagenet drove the advancement of scene understanding and object recognition. The goal of COCO-Text is to advance state-of-the-art in text detection and recognition in natural images. The dataset is based on the MS COCO dataset, which contains images of complex everyday scenes. The images were not collected with text in mind and thus contain a broad variety of text instances. To reflect the diversity of text in natural scenes, we annotate text with (a) location in terms of a bounding box, (b) fine-grained classification into machine printed text and handwritten text, (c) classification into legible and illegible text, (d) script of the text and (e) transcriptions of legible text. The dataset contains over 173k text annotations in over 63k images. We provide a statistical analysis of the accuracy of our annotations. In addition, we present an analysis of three leading state-of-the-art photo Optical Character Recognition (OCR) approaches on our dataset. While scene text detection and recognition enjoys strong advances in recent years, we identify significant shortcomings motivating future work.

READ FULL TEXT

page 1

page 2

page 3

page 5

page 7

research
05/01/2014

Microsoft COCO: Common Objects in Context

We present a new dataset with the goal of advancing the state-of-the-art...
research
10/24/2018

Resolving Referring Expressions in Images With Labeled Elements

Images may have elements containing text and a bounding box associated w...
research
12/27/2022

A Comprehensive Gold Standard and Benchmark for Comics Text Detection and Recognition

This study focuses on improving the optical character recognition (OCR) ...
research
09/27/2018

Cursive Scene Text Analysis by Deep Convolutional Linear Pyramids

The camera captured images have various aspects to investigate. Generall...
research
10/20/2016

An Image Dataset of Text Patches in Everyday Scenes

This paper describes a dataset containing small images of text from ever...
research
02/28/2018

Chinese Text in the Wild

We introduce Chinese Text in the Wild, a very large dataset of Chinese t...
research
04/17/2020

Image Processing Based Scene-Text Detection and Recognition with Tesseract

Text Recognition is one of the challenging tasks of computer vision with...

Please sign up or login with your details

Forgot password? Click here to reset