Two-Stage Generative Adversarial Networks for Document Image Binarization with Color Noise and Background Removal

10/20/2020
by   Sungho Suh, et al.
3

Document image enhancement and binarization methods are often used to improve the accuracy and efficiency of document image analysis tasks such as text recognition. Traditional non-machine-learning methods are constructed on low-level features in an unsupervised manner but have difficulty with binarization on documents with severely degraded backgrounds. Convolutional neural network-based methods focus only on grayscale images and on local textual features. In this paper, we propose a two-stage color document image enhancement and binarization method using generative adversarial neural networks. In the first stage, four color-independent adversarial networks are trained to extract color foreground information from an input image for document image enhancement. In the second stage, two independent adversarial networks with global and local features are trained for image binarization of documents of variable size. For the adversarial neural networks, we formulate loss functions between a discriminator and generators having an encoder-decoder structure. Experimental results show that the proposed method achieves better performance than many classical and state-of-the-art algorithms over the Document Image Binarization Contest (DIBCO) datasets, the LRDE Document Binarization Dataset (LRDE DBD), and our shipping label image dataset.

READ FULL TEXT

page 2

page 4

page 10

page 12

research
11/29/2022

Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

The efficient segmentation of foreground text information from the backg...
research
09/13/2022

Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Image binarization techniques are being popularly used in enhancement of...
research
01/18/2019

DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning

This paper presents a novel iterative deep learning framework and apply ...
research
05/27/2023

CCDWT-GAN: Generative Adversarial Networks Based on Color Channel Using Discrete Wavelet Transform for Document Image Binarization

To efficiently extract the textual information from color degraded docum...
research
01/30/2018

Predicting Rapid Fire Growth (Flashover) Using Conditional Generative Adversarial Networks

A flashover occurs when a fire spreads very rapidly through crevices due...
research
10/17/2020

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

Documents often exhibit various forms of degradation, which make it hard...
research
09/19/2019

Deeply Matting-based Dual Generative Adversarial Network for Image and Document Label Supervision

Although many methods have been proposed to deal with nature image super...

Please sign up or login with your details

Forgot password? Click here to reset