A limited-size ensemble of homogeneous CNN/LSTMs for high-performance word classification

12/06/2019
by   Mahya Ameryan, et al.
0

In recent years, long short-term memory neural networks (LSTMs) have been applied quite successfully to problems in handwritten text recognition. However, their strength is more located in handling sequences of variable length than in handling geometric variability of the image patterns. Furthermore, the best results for LSTMs are often based on large-scale training of an ensemble of network instances. In this paper, an end-to-end convolutional LSTM Neural Network is used to handle both geometric variation and sequence variability. We show that high performances can be reached on a common benchmark set by using proper data augmentation for just five such networks using a proper coding scheme and a proper voting scheme. The networks have similar architectures (Convolutional Neural Network (CNN): five layers, bidirectional LSTM (BiLSTM): three layers followed by a connectionist temporal classification (CTC) processing step). The approach assumes differently-scaled input images and different feature map sizes. Two datasets are used for evaluation of the performance of our algorithm: A standard benchmark RIMES dataset (French), and a historical handwritten dataset KdK (Dutch). Final performance obtained for the word-recognition test of RIMES was 96.6 improvement over other state-of-the-art approaches. On the KdK dataset, our approach also shows good results. The proposed approach is deployed in the Monk search engine for historical-handwriting collections.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/18/2019

Effects of padding on LSTMs and CNNs

Long Short-Term Memory (LSTM) Networks and Convolutional Neural Networks...
research
10/01/2019

A Computationally Efficient Pipeline Approach to Full Page Offline Handwritten Text Recognition

Offline handwriting recognition with deep neural networks is usually lim...
research
01/03/2020

Question Type Classification Methods Comparison

The paper presents a comparative study of state-of-the-art approaches fo...
research
01/04/2022

HWRCNet: Handwritten Word Recognition in JPEG Compressed Domain using CNN-BiLSTM Network

The handwritten word recognition from images using deep learning is an a...
research
11/17/2019

The Proper Care and Feeding of CAMELS: How Limited Training Data Affects Streamflow Prediction

Accurate streamflow prediction largely relies on historical records of b...
research
12/10/2021

An Ensemble 1D-CNN-LSTM-GRU Model with Data Augmentation for Speech Emotion Recognition

In this paper, we propose an ensemble of deep neural networks along with...
research
07/16/2018

Longitudinal detection of radiological abnormalities with time-modulated LSTM

Convolutional neural networks (CNNs) have been successfully employed in ...

Please sign up or login with your details

Forgot password? Click here to reset