CCDWT-GAN: Generative Adversarial Networks Based on Color Channel Using Discrete Wavelet Transform for Document Image Binarization

by   Rui-Yang Ju, et al.

To efficiently extract the textual information from color degraded document images is an important research topic. Long-term imperfect preservation of ancient documents has led to various types of degradation such as page staining, paper yellowing, and ink bleeding; these degradations badly impact the image processing for information extraction. In this paper, we present CCDWT-GAN, a generative adversarial network (GAN) that utilizes the discrete wavelet transform (DWT) on RGB (red, green, blue) channel splited images. The proposed method comprises three stages: image preprocessing, image enhancement, and image binarization. This work conducts comparative experiments in the image preprocessing stage to determine the optimal selection of DWT with normalization. Additionally, we perform an ablation study on the results of the image enhancement stage and the image binarization stage to validate their positive effect on the model performance. This work compares the performance of the proposed method with other state-of-the-art (SOTA) methods on DIBCO and H-DIBCO ((Handwritten) Document Image Binarization Competition) datasets. The experimental results demonstrate that CCDWT-GAN achieves a top two performance on multiple benchmark datasets, and outperforms other SOTA methods.


Three-stage binarization of color document images based on discrete wavelet transform and generative adversarial networks

The efficient segmentation of foreground text information from the backg...

Two-Stage Generative Adversarial Networks for Document Image Binarization with Color Noise and Background Removal

Document image enhancement and binarization methods are often used to im...

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

Documents often exhibit various forms of degradation, which make it hard...

Enhance to Read Better: An Improved Generative Adversarial Network for Handwritten Document Image Enhancement

Handwritten document images can be highly affected by degradation for di...

Painting Style-Aware Manga Colorization Based on Generative Adversarial Networks

Japanese comics (called manga) are traditionally created in monochrome f...

Classifying Fonts and Calligraphy Styles Using Complex Wavelet Transform

Recognizing fonts has become an important task in document analysis, due...

Please sign up or login with your details

Forgot password? Click here to reset