Benchmarking recognition results on word image datasets

08/30/2012
by   Deepak Kumar, et al.
0

We have benchmarked the maximum obtainable recognition accuracy on various word image datasets using manual segmentation and a currently available commercial OCR. We have developed a Matlab program, with graphical user interface, for semi-automated pixel level segmentation of word images. We discuss the advantages of pixel level annotation. We have covered five databases adding up to over 3600 word images. These word images have been cropped from camera captured scene, born-digital and street view images. We recognize the segmented word image using the trial version of Nuance Omnipage OCR. We also discuss, how the degradations introduced during acquisition or inaccuracies introduced during creation of word images affect the recognition of the word present in the image. Word images for different kinds of degradations and correction for slant and curvy nature of words are also discussed. The word recognition rates obtained on ICDAR 2003, Sign evaluation, Street view, Born-digital and ICDAR 2011 datasets are 83.9 88.5

READ FULL TEXT
research
06/27/2019

Adversarial Pixel-Level Generation of Semantic Images

Generative Adversarial Networks (GANs) have obtained extraordinary succe...
research
04/09/2021

Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam

Inspired by the success of Deep Learning based approaches to English sce...
research
09/11/2023

Towards Content-based Pixel Retrieval in Revisited Oxford and Paris

This paper introduces the first two pixel retrieval benchmarks. Pixel re...
research
06/30/2021

Word-level Sign Language Recognition with Multi-stream Neural Networks Focusing on Local Regions

In recent years, Word-level Sign Language Recognition (WSLR) research ha...
research
04/25/2021

A novel segmentation dataset for signatures on bank checks

The dataset presented provides high-resolution images of real, filled ou...
research
10/01/2020

Joint Persian Word Segmentation Correction and Zero-Width Non-Joiner Recognition Using BERT

Words are properly segmented in the Persian writing system; in practice,...
research
10/07/2021

Design of an Intelligent Vision Algorithm for Recognition and Classification of Apples in an Orchard Scene

Apple is one of the remarkable fresh fruit that contains a high degree o...

Please sign up or login with your details

Forgot password? Click here to reset