SPAN: a Simple Predict Align Network for Handwritten Paragraph Recognition

02/17/2021
by   Denis Coquenet, et al.
0

Unconstrained handwriting recognition is an essential task in document analysis. It is usually carried out in two steps. First, the document is segmented into text lines. Second, an Optical Character Recognition model is applied on these line images. We propose the Simple Predict Align Network: an end-to-end recurrence-free Fully Convolutional Network performing OCR at paragraph level without any prior segmentation stage. The framework is as simple as the one used for the recognition of isolated lines and we achieve competitive results on three popular datasets: RIMES, IAM and READ 2016. The proposed model does not require any dataset adaptation, it can be trained from scratch, without segmentation labels, and it does not require line breaks in the transcription labels. Our code and trained model weights are available at https://github.com/FactoDeepLearning/SPAN.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/07/2020

End-to-end Handwritten Paragraph Text Recognition Using a Vertical Attention Network

Unconstrained handwritten text recognition remains challenging for compu...
research
03/23/2022

DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition

Unconstrained handwritten document recognition is a challenging computer...
research
01/25/2023

Faster DAN: Multi-target Queries with Document Positional Encoding for End-to-end Handwritten Document Recognition

Recent advances in handwritten text recognition enabled to recognize who...
research
05/24/2021

TRACE: A Differentiable Approach to Line-level Stroke Recovery for Offline Handwritten Text

Stroke order and velocity are helpful features in the fields of signatur...
research
12/09/2020

Recurrence-free unconstrained handwritten text recognition using gated fully convolutional network

Unconstrained handwritten text recognition is a major step in most docum...
research
04/12/2016

Scan, Attend and Read: End-to-End Handwritten Paragraph Recognition with MDLSTM Attention

We present an attention-based model for end-to-end handwriting recogniti...
research
06/12/2020

OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold

Text recognition is a major computer vision task with a big set of assoc...

Please sign up or login with your details

Forgot password? Click here to reset