Calamari - A High-Performance Tensorflow-based Deep Learning Package for Optical Character Recognition

07/05/2018
by   Christoph Wick, et al.
0

Optical Character Recognition (OCR) on contemporary and historical data is still in the focus of many researchers. Especially historical prints require book specific trained OCR models to achieve applicable results (Springmann and Lüdeling, 2016, Reul et al., 2017a). To reduce the human effort for manually annotating ground truth (GT) various techniques such as voting and pretraining have shown to be very efficient (Reul et al., 2018a, Reul et al., 2018b). Calamari is a new open source OCR line recognition software that both uses state-of-the art Deep Neural Networks (DNNs) implemented in Tensorflow and giving native support for techniques such as pretraining and voting. The customizable network architectures constructed of Convolutional Neural Networks (CNNS) and Long-ShortTerm-Memory (LSTM) layers are trained by the so-called Connectionist Temporal Classification (CTC) algorithm of Graves et al. (2006). Optional usage of a GPU drastically reduces the computation times for both training and prediction. We use two different datasets to compare the performance of Calamari to OCRopy, OCRopus3, and Tesseract 4. Calamari reaches a Character Error Rate (CER) of 0.11 English and 0.18 considerably outperforms the results of the existing softwares.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2021

Optical Character Recognition of 19th Century Classical Commentaries: the Current State of Affairs

Together with critical editions and translations, commentaries are one o...
research
12/09/2017

Word Sense Disambiguation with LSTM: Do We Really Need 100 Billion Words?

Recently, Yuan et al. (2016) have shown the effectiveness of using Long ...
research
06/15/2021

Mixed Model OCR Training on Historical Latin Script for Out-of-the-Box Recognition and Finetuning

In order to apply Optical Character Recognition (OCR) to historical prin...
research
08/31/2016

Convolutional Neural Networks for Text Categorization: Shallow Word-level vs. Deep Character-level

This paper reports the performances of shallow word-level convolutional ...
research
04/06/2021

Point classification with Runge-Kutta networks and feature space augmentation

In this paper we combine an approach based on Runge-Kutta Nets considere...
research
04/30/2019

AdaNet: A Scalable and Flexible Framework for Automatically Learning Ensembles

AdaNet is a lightweight TensorFlow-based (Abadi et al., 2015) framework ...
research
10/10/2018

Response to Comment on "All-optical machine learning using diffractive deep neural networks"

In their Comment, Wei et al. (arXiv:1809.08360v1 [cs.LG]) claim that our...

Please sign up or login with your details

Forgot password? Click here to reset