Light-weight Document Image Cleanup using Perceptual Loss

05/19/2021
by   Soumyadeep Dey, et al.
0

Smartphones have enabled effortless capturing and sharing of documents in digital form. The documents, however, often undergo various types of degradation due to aging, stains, or shortcoming of capturing environment such as shadow, non-uniform lighting, etc., which reduces the comprehensibility of the document images. In this work, we consider the problem of document image cleanup on embedded applications such as smartphone apps, which usually have memory, energy, and latency limitations due to the device and/or for best human user experience. We propose a light-weight encoder decoder based convolutional neural network architecture for removing the noisy elements from document images. To compensate for generalization performance with a low network capacity, we incorporate the perceptual loss for knowledge transfer from pre-trained deep CNN network in our loss function. In terms of the number of parameters and product-sum operations, our models are 65-1030 and 3-27 times, respectively, smaller than existing state-of-the-art document enhancement models. Overall, the proposed models offer a favorable resource versus accuracy trade-off and we empirically illustrate the efficacy of our approach on several real-world benchmark datasets.

READ FULL TEXT

page 2

page 4

page 9

research
07/31/2020

Exploring Image Enhancement for Salient Object Detection in Low Light Images

Low light images captured in a non-uniform illumination environment usua...
research
11/13/2019

BiNet: Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Handwritten document-image binarization is a semantic segmentation proce...
research
04/14/2021

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

As camera-based documents are increasingly used, the rectification of di...
research
06/18/2021

Advanced Hough-based method for on-device document localization

The demand for on-device document recognition systems increases in conju...
research
08/17/2018

First Steps Toward CNN based Source Classification of Document Images Shared Over Messaging App

Knowledge of source smartphone corresponding to a document image can be ...
research
12/14/2020

DSM Refinement with Deep Encoder-Decoder Networks

3D city models can be generated from aerial images. However, the calcula...
research
01/05/2021

Domain Generalization for Document Authentication against Practical Recapturing Attacks

Recapturing attack can be employed as a simple but effective anti-forens...

Please sign up or login with your details

Forgot password? Click here to reset