OCR accuracy improvement on document images through a novel pre-processing approach

09/11/2015
by   Abdeslam El Harraj, et al.
0

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or text-photo images and natural images, leading to an unreliable OCR digitization. In this paper, we present a novel nonparametric and unsupervised method to compensate for undesirable document image distortions aiming to optimally improve OCR accuracy. Our approach relies on a very efficient stack of document image enhancing techniques to recover deformation of the entire document image. First, we propose a local brightness and contrast adjustment method to effectively handle lighting variations and the irregular distribution of image illumination. Second, we use an optimized greyscale conversion algorithm to transform our document image to greyscale level. Third, we sharpen the useful information in the resulting greyscale image using Un-sharp Masking method. Finally, an optimal global binarization approach is used to prepare the final document image to OCR recognition. The proposed approach can significantly improve text detection rate and optical character recognition accuracy. To demonstrate the efficiency of our approach, an exhaustive experimentation on a standard dataset is presented.

READ FULL TEXT

page 7

page 8

page 9

page 11

page 13

research
12/25/2019

DDI-100: Dataset for Text Detection and Recognition

Nowadays document analysis and recognition remain challenging tasks. How...
research
11/14/2019

Character Keypoint-based Homography Estimation in Scanned Documents for Efficient Information Extraction

Precise homography estimation between multiple images is a pre-requisite...
research
05/17/2021

Unknown-box Approximation to Improve Optical Character Recognition Performance

Optical character recognition (OCR) is a widely used pattern recognition...
research
12/05/2019

A Document Skew Detection Method Using Fast Hough Transform

The majority of document image analysis systems use a document skew dete...
research
07/28/2017

FontCode: Embedding Information in Text Documents using Glyph Perturbation

We introduce FontCode, an information embedding technique for text docum...
research
04/17/2020

Image Processing Based Scene-Text Detection and Recognition with Tesseract

Text Recognition is one of the challenging tasks of computer vision with...
research
01/30/2015

An Analytical Study of different Document Image Binarization Methods

Document image has been the area of research for a couple of decades bec...

Please sign up or login with your details

Forgot password? Click here to reset