AttentionHTR: Handwritten Text Recognition Based on Attention Encoder-Decoder Networks

01/23/2022
by   Dmitrijs Kass, et al.
22

This work proposes an attention-based sequence-to-sequence model for handwritten word recognition and explores transfer learning for data-efficient training of HTR systems. To overcome training data scarcity, this work leverages models pre-trained on scene text images as a starting point towards tailoring the handwriting recognition models. ResNet feature extraction and bidirectional LSTM-based sequence modeling stages together form an encoder. The prediction stage consists of a decoder and a content-based attention mechanism. The effectiveness of the proposed end-to-end HTR system has been empirically evaluated on a novel multi-writer dataset Imgur5K and the IAM dataset. The experimental results evaluate the performance of the HTR framework, further supported by an in-depth analysis of the error cases. Source code and pre-trained models are available at https://github.com/dmitrijsk/AttentionHTR.

READ FULL TEXT
research
07/03/2022

M-Adapter: Modality Adaptation for End-to-End Speech-to-Text Translation

End-to-end speech-to-text translation models are often initialized with ...
research
08/12/2020

Attention-based Fully Gated CNN-BGRU for Russian Handwritten Text

This research approaches the task of handwritten text with attention enc...
research
01/21/2021

DAF:re: A Challenging, Crowd-Sourced, Large-Scale, Long-Tailed Dataset For Anime Character Recognition

In this work we tackle the challenging problem of anime character recogn...
research
08/16/2023

High-Fidelity Lake Extraction via Two-Stage Prompt Enhancement: Establishing a Novel Baseline and Benchmark

The extraction of lakes from remote sensing images is a complex challeng...
research
08/14/2023

A Novel Ehanced Move Recognition Algorithm Based on Pre-trained Models with Positional Embeddings

The recognition of abstracts is crucial for effectively locating the con...
research
04/23/2017

Deep Keyphrase Generation

Keyphrase provides highly-summative information that can be effectively ...
research
12/23/2020

ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition

Despite the recent advances in optical character recognition (OCR), math...

Please sign up or login with your details

Forgot password? Click here to reset