Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

11/29/2022
by   Yu-Shian Lin, et al.
0

The efficient segmentation of foreground text information from the background in degraded color document images is a hot research topic. Due to the imperfect preservation of ancient documents over a long period of time, various types of degradation, including staining, yellowing, and ink seepage, have seriously affected the results of image binarization. In this paper, a three-stage method is proposed for image enhancement and binarization of degraded color document images by using discrete wavelet transform (DWT) and generative adversarial network (GAN). In Stage-1, we use DWT and retain the LL subband images to achieve the image enhancement. In Stage-2, the original input image is split into four (Red, Green, Blue and Gray) single-channel images, each of which trains the independent adversarial networks. The trained adversarial network models are used to extract the color foreground information from the images. In Stage-3, in order to combine global and local features, the output image from Stage-2 and the original input image are used to train the independent adversarial networks for document binarization. The experimental results demonstrate that our proposed method outperforms many classical and state-of-the-art (SOTA) methods on the Document Image Binarization Contest (DIBCO) dataset. We release our implementation code at https://github.com/abcpp12383/ThreeStageBinarization.

READ FULL TEXT

page 4

page 6

page 8

page 13

page 15

research
05/27/2023

CCDWT-GAN: Generative Adversarial Networks Based on Color Channel Using Discrete Wavelet Transform for Document Image Binarization

To efficiently extract the textual information from color degraded docum...
research
10/20/2020

Two-Stage Generative Adversarial Networks for Document Image Binarization with Color Noise and Background Removal

Document image enhancement and binarization methods are often used to im...
research
09/19/2019

Deeply Matting-based Dual Generative Adversarial Network for Image and Document Label Supervision

Although many methods have been proposed to deal with nature image super...
research
05/26/2021

Enhance to Read Better: An Improved Generative Adversarial Network for Handwritten Document Image Enhancement

Handwritten document images can be highly affected by degradation for di...
research
04/11/2021

SIGAN: A Novel Image Generation Method for Solar Cell Defect Segmentation and Augmentation

Solar cell electroluminescence (EL) defect segmentation is an interestin...
research
01/18/2019

DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning

This paper presents a novel iterative deep learning framework and apply ...

Please sign up or login with your details

Forgot password? Click here to reset