DeepAI AI Chat
Log In Sign Up

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

by   Théodore Bluche, et al.

We present an attention-based model for end-to-end handwriting recognition. Our system does not require any segmentation of the input paragraph. The model is inspired by the differentiable attention models presented recently for speech recognition, image captioning or translation. The main difference is the covert and overt attention, implemented as a multi-dimensional LSTM network. Our principal contribution towards handwriting recognition lies in the automatic transcription without a prior segmentation into lines, which was crucial in previous approaches. To the best of our knowledge this is the first successful attempt of end-to-end multi-line handwriting recognition. We carried out experiments on the well-known IAM Database. The results are encouraging and bring hope to perform full paragraph transcription in the near future.


page 1

page 2

page 3

page 4


Joint Line Segmentation and Transcription for End-to-End Handwritten Paragraph Recognition

Offline handwriting recognition systems require cropped text line images...

Streaming Attention-Based Models with Augmented Memory for End-to-End Speech Recognition

Attention-based models have been gaining popularity recently for their s...

Attention-Based Models for Text-Dependent Speaker Verification

Attention-based models have recently shown great performance on a range ...

SPAN: a Simple Predict Align Network for Handwritten Paragraph Recognition

Unconstrained handwriting recognition is an essential task in document a...

End-to-End Attention-based Image Captioning

In this paper, we address the problem of image captioning specifically f...

End-to-End Approach for Recognition of Historical Digit Strings

The plethora of digitalised historical document datasets released in rec...

A Comprehensive Comparison of End-to-End Approaches for Handwritten Digit String Recognition

Over the last decades, most approaches proposed for handwritten digit st...

Code Repositories


Open source implementation of Scan, Attend and Read.

view repo