Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions

08/17/2022
by   Silvia Cascianelli, et al.
0

Handwritten Text Recognition (HTR) in free-layout pages is a challenging image understanding task that can provide a relevant boost to the digitization of handwritten documents and reuse of their content. The task becomes even more challenging when dealing with historical documents due to the variability of the writing style and degradation of the page quality. State-of-the-art HTR approaches typically couple recurrent structures for sequence modeling with Convolutional Neural Networks for visual feature extraction. Since convolutional kernels are defined on fixed grids and focus on all input pixels independently while moving over the input image, this strategy disregards the fact that handwritten characters can vary in shape, scale, and orientation even within the same document and that the ink pixels are more relevant than the background ones. To cope with these specific HTR difficulties, we propose to adopt deformable convolutions, which can deform depending on the input at hand and better adapt to the geometric variations of the text. We design two deformable architectures and conduct extensive experiments on both modern and historical datasets. Experimental results confirm the suitability of deformable convolutions for the HTR task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/02/2022

DARE: A large-scale handwritten date recognition system

Handwritten text recognition for historical documents is an important ta...
research
03/11/2021

Full Page Handwriting Recognition via Image to Sequence Extraction

We present a Neural Network based Handwritten Text Recognition (HTR) mod...
research
05/26/2016

CITlab ARGUS for historical handwritten documents

We describe CITlab's recognition system for the HTRtS competition attach...
research
09/30/2022

Towards End-to-end Handwritten Document Recognition

Handwritten text recognition has been widely studied in the last decades...
research
11/13/2019

BiNet: Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Handwritten document-image binarization is a semantic segmentation proce...
research
12/26/2021

Continuous Offline Handwriting Recognition using Deep Learning Models

Handwritten text recognition is an open problem of great interest in the...
research
08/21/2021

Palmira: A Deep Deformable Network for Instance Segmentation of Dense and Uneven Layouts in Handwritten Manuscripts

Handwritten documents are often characterized by dense and uneven layout...

Please sign up or login with your details

Forgot password? Click here to reset