Offline Detection of Misspelled Handwritten Words by Convolving Recognition Model Features with Text Labels

09/18/2023
by   Andrey Totev, et al.
0

Offline handwriting recognition (HWR) has improved significantly with the advent of deep learning architectures in recent years. Nevertheless, it remains a challenging problem and practical applications often rely on post-processing techniques for restricting the predicted words via lexicons or language models. Despite their enhanced performance, such systems are less usable in contexts where out-of-vocabulary words are anticipated, e.g. for detecting misspelled words in school assessments. To that end, we introduce the task of comparing a handwriting image to text. To solve the problem, we propose an unrestricted binary classifier, consisting of a HWR feature extractor and a multimodal classification head which convolves the feature extractor output with the vector representation of the input text. Our model's classification head is trained entirely on synthetic data created using a state-of-the-art generative adversarial network. We demonstrate that, while maintaining high recall, the classifier can be calibrated to achieve an average precision increase of 19.5 compared to addressing the task by directly using state-of-the-art HWR models. Such massive performance gains can lead to significant productivity increases in applications utilizing human-in-the-loop automation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2019

Synthetic Image Augmentation for Improved Classification using Generative Adversarial Networks

Object detection and recognition has been an ongoing research topic for ...
research
11/23/2020

Using Synthetic Audio to Improve The Recognition of Out-Of-Vocabulary Words in End-To-End ASR Systems

Today, many state-of-the-art automatic speech recognition (ASR) systems ...
research
03/01/2019

Adversarial Generation of Handwritten Text Images Conditioned on Sequences

State-of-the-art offline handwriting text recognition systems tend to us...
research
10/24/2020

Discriminative feature generation for classification of imbalanced data

The data imbalance problem is a frequent bottleneck in the classificatio...
research
10/22/2022

A Visual Tour Of Current Challenges In Multimodal Language Models

Transformer models trained on massive text corpora have become the de fa...
research
05/26/2020

Pay Attention to What You Read: Non-recurrent Handwritten Text-Line Recognition

The advent of recurrent neural networks for handwriting recognition mark...
research
04/25/2023

AI-Synthesized Voice Detection Using Neural Vocoder Artifacts

Advancements in AI-synthesized human voices have created a growing threa...

Please sign up or login with your details

Forgot password? Click here to reset