Efficient, Lexicon-Free OCR using Deep Learning

06/05/2019
by   Marcin Namysl, et al.
0

Contrary to popular belief, Optical Character Recognition (OCR) remains a challenging problem when text occurs in unconstrained environments, like natural scenes, due to geometrical distortions, complex backgrounds, and diverse fonts. In this paper, we present a segmentation-free OCR system that combines deep learning methods, synthetic training data generation, and data augmentation techniques. We render synthetic training data using large text corpora and over 2000 fonts. To simulate text occurring in complex natural scenes, we augment extracted samples with geometric distortions and with a proposed data augmentation technique - alpha-compositing with background textures. Our models employ a convolutional neural network encoder to extract features from text images. Inspired by the recent progress in neural machine translation and language modeling, we examine the capabilities of both recurrent and convolutional neural networks in modeling the interactions between input elements.

READ FULL TEXT
research
07/26/2023

Data Augmentation for Neural Machine Translation using Generative Language Model

Despite the rapid growth in model architecture, the scarcity of large pa...
research
04/01/2022

CipherDAug: Ciphertext based Data Augmentation for Neural Machine Translation

We propose a novel data-augmentation technique for neural machine transl...
research
08/18/2020

EASTER: Efficient and Scalable Text Recognizer

Recent progress in deep learning has led to the development of Optical C...
research
12/07/2020

GenScan: A Generative Method for Populating Parametric 3D Scan Datasets

The availability of rich 3D datasets corresponding to the geometrical co...
research
09/20/2015

Telugu OCR Framework using Deep Learning

In this paper, we address the task of Optical Character Recognition(OCR)...
research
01/14/2022

ExtraPhrase: Efficient Data Augmentation for Abstractive Summarization

Neural models trained with large amount of parallel data have achieved i...
research
01/10/2017

Deep Learning for Logo Recognition

In this paper we propose a method for logo recognition using deep learni...

Please sign up or login with your details

Forgot password? Click here to reset