Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

04/14/2021
by   Guo-Wang Xie, et al.
0

As camera-based documents are increasingly used, the rectification of distorted document images becomes a need to improve the recognition performance. In this paper, we propose a novel framework for both rectifying distorted document image and removing background finely, by estimating pixel-wise displacements using a fully convolutional network (FCN). The document image is rectified by transformation according to the displacements of pixels. The FCN is trained by regressing displacements of synthesized distorted documents, and to control the smoothness of displacements, we propose a Local Smooth Constraint (LSC) in regularization. Our approach is easy to implement and consumes moderate computing resource. Experiments proved that our approach can dewarp document images effectively under various geometric distortions, and has achieved the state-of-the-art performance in terms of local details and overall effect.

READ FULL TEXT

page 5

page 8

page 11

page 12

research
06/07/2017

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network

We present an end-to-end, multimodal, fully convolutional network for ex...
research
08/10/2017

Document Image Binarization with Fully Convolutional Neural Networks

Binarization of degraded historical manuscript images is an important pr...
research
06/08/2018

PatchFCN for Intracranial Hemorrhage Detection

This paper studies the problem of detecting acute intracranial hemorrhag...
research
03/20/2022

Document Dewarping with Control Points

Document images are now widely captured by handheld devices such as mobi...
research
03/18/2022

Fourier Document Restoration for Robust Document Dewarping and Recognition

State-of-the-art document dewarping techniques learn to predict 3-dimens...
research
05/19/2021

Light-weight Document Image Cleanup using Perceptual Loss

Smartphones have enabled effortless capturing and sharing of documents i...
research
01/26/2018

PDNet: Semantic Segmentation integrated with a Primal-Dual Network for Document binarization

Binarization of digital documents is the task of classifying each pixel ...

Please sign up or login with your details

Forgot password? Click here to reset