Enhance to Read Better: An Improved Generative Adversarial Network for Handwritten Document Image Enhancement

05/26/2021
by   Sana Khamekhem Jemni, et al.
0

Handwritten document images can be highly affected by degradation for different reasons: Paper ageing, daily-life scenarios (wrinkles, dust, etc.), bad scanning process and so on. These artifacts raise many readability issues for current Handwritten Text Recognition (HTR) algorithms and severely devalue their efficiency. In this paper, we propose an end to end architecture based on Generative Adversarial Networks (GANs) to recover the degraded documents into a clean and readable form. Unlike the most well-known document binarization methods, which try to improve the visual quality of the degraded document, the proposed architecture integrates a handwritten text recognizer that promotes the generated document image to be more readable. To the best of our knowledge, this is the first work to use the text information while binarizing handwritten documents. Extensive experiments conducted on degraded Arabic and Latin handwritten documents demonstrate the usefulness of integrating the recognizer within the GAN architecture, which improves both the visual quality and the readability of the degraded document images. Moreover, we outperform the state of the art in H-DIBCO 2018 challenge, after fine tuning our pre-trained model with synthetically degraded Latin handwritten images, on this task.

READ FULL TEXT

page 3

page 14

research
10/17/2020

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

Documents often exhibit various forms of degradation, which make it hard...
research
03/01/2019

Adversarial Generation of Handwritten Text Images Conditioned on Sequences

State-of-the-art offline handwriting text recognition systems tend to us...
research
10/11/2019

Illegible Text to Readable Text: An Image-to-Image Transformation using Conditional Sliced Wasserstein Adversarial Networks

Automatic text recognition from ancient handwritten record images is an ...
research
05/27/2023

CCDWT-GAN: Generative Adversarial Networks Based on Color Channel Using Discrete Wavelet Transform for Document Image Binarization

To efficiently extract the textual information from color degraded docum...
research
01/25/2022

DocEnTr: An End-to-End Document Image Enhancement Transformer

Document images can be affected by many degradation scenarios, which cau...
research
11/09/2012

Localisation of Numerical Date Field in an Indian Handwritten Document

This paper describes a method to localise all those areas which may cons...
research
11/29/2022

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

The efficient segmentation of foreground text information from the backg...

Please sign up or login with your details

Forgot password? Click here to reset