A Gaussian Process Upsampling Model for Improvements in Optical Character Recognition

05/07/2020
by   Steven I Reeves, et al.
9

Optical Character Recognition and extraction is a key tool in the automatic evaluation of documents in a financial context. However, the image data provided to automated systems can have unreliable quality, and can be inherently low-resolution or downsampled and compressed by a transmitting program. In this paper, we illustrate the efficacy of a Gaussian Process upsampling model for the purposes of improving OCR and extraction through upsampling low resolution documents.

READ FULL TEXT
research
05/10/2021

An end-to-end Optical Character Recognition approach for ultra-low-resolution printed text images

Some historical and more recent printed documents have been scanned or s...
research
04/21/2017

Scatteract: Automated extraction of data from scatter plots

Charts are an excellent way to convey patterns and trends in data, but t...
research
10/05/2022

Intelligent Information Retrieval: Techniques for Character Recognition and Structured Data Extraction

The day-to-day activities of every corporation in-volve working with a h...
research
05/20/2019

Naive probability

We describe a rational, but low resolution model of probability....
research
09/10/2020

OCR Graph Features for Manipulation Detection in Documents

Detecting manipulations in digital documents is becoming increasingly im...
research
03/18/2020

Constraints in Developing a Complete Bengali Optical Character Recognition System

Technological advancement has led to digitizing hard copies of media eff...
research
02/26/2023

User-Centric Evaluation of OCR Systems for Kwak'wala

There has been recent interest in improving optical character recognitio...

Please sign up or login with your details

Forgot password? Click here to reset