Learning to Clean: A GAN Perspective

01/28/2019
by   Monika Sharma, et al.
0

In the big data era, the impetus to digitize the vast reservoirs of data trapped in unstructured scanned documents such as invoices, bank documents and courier receipts has gained fresh momentum. The scanning process often results in the introduction of artifacts such as background noise, blur due to camera motion, watermarkings, coffee stains, or faded text. These artifacts pose many readability challenges to current text recognition algorithms and significantly degrade their performance. Existing learning based denoising techniques require a dataset comprising of noisy documents paired with cleaned versions. In such scenarios, a model can be trained to generate clean documents from noisy versions. However, very often in the real world such a paired dataset is not available, and all we have for training our denoising model are unpaired sets of noisy and clean images. This paper explores the use of GANs to generate denoised versions of the noisy documents. In particular, where paired information is available, we formulate the problem as an image-to-image translation task i.e, translating a document from noisy domain ( i.e., background noise, blurred, faded, watermarked ) to a target clean document using Generative Adversarial Networks (GAN). However, in the absence of paired images for training, we employed CycleGAN which is known to learn a mapping between the distributions of the noisy images to the denoised images using unpaired data to achieve image-to-image translation for cleaning the noisy documents. We compare the performance of CycleGAN for document cleaning tasks using unpaired images with a Conditional GAN trained on paired data from the same dataset. Experiments were performed on a public document dataset on which different types of noise were artificially induced, results demonstrate that CycleGAN learns a more robust mapping from the space of noisy to clean documents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2019

Noise2Blur: Online Noise Extraction and Denoising

We propose a new framework called Noise2Blur (N2B) for training robust i...
research
08/25/2021

Automatic Feature Highlighting in Noisy RES Data With CycleGAN

Radio echo sounding (RES) is a common technique used in subsurface glaci...
research
11/28/2018

Cartoon-to-real: An Approach to Translate Cartoon to Realistic Images using GAN

We propose a method to translate cartoon images to real world images usi...
research
05/06/2019

Label-Noise Robust Multi-Domain Image-to-Image Translation

Multi-domain image-to-image translation is a problem where the goal is t...
research
11/30/2020

Adaptive noise imitation for image denoising

The effectiveness of existing denoising algorithms typically relies on a...
research
02/08/2020

Bone Suppression on Chest Radiographs With Adversarial Learning

Dual-energy (DE) chest radiography provides the capability of selectivel...
research
10/25/2018

Improving Document Binarization via Adversarial Noise-Texture Augmentation

Binarization of degraded document images is an elementary step in most o...

Please sign up or login with your details

Forgot password? Click here to reset