Fourier Document Restoration for Robust Document Dewarping and Recognition

03/18/2022
by   Chuhui Xue, et al.
0

State-of-the-art document dewarping techniques learn to predict 3-dimensional information of documents which are prone to errors while dealing with documents with irregular distortions or large variations in depth. This paper presents FDRNet, a Fourier Document Restoration Network that can restore documents with different distortions and improve document recognition in a reliable and simpler manner. FDRNet focuses on high-frequency components in the Fourier space that capture most structural information but are largely free of degradation in appearance. It dewarps documents by a flexible Thin-Plate Spline transformation which can handle various deformations effectively without requiring deformation annotations in training. These features allow FDRNet to learn from a small amount of simply labeled training images, and the learned model can dewarp documents with complex geometric distortion and recognize the restored texts accurately. To facilitate document restoration research, we create a benchmark dataset consisting of over one thousand camera documents with different types of geometric and photometric distortion. Extensive experiments show that FDRNet outperforms the state-of-the-art by large margins on both dewarping and text recognition tasks. In addition, FDRNet requires a small amount of simply labeled training data and is easy to deploy.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
10/28/2021

DocScanner: Robust Document Image Rectification with Progressive Learning

Compared to flatbed scanners, portable smartphones are much more conveni...
research
12/16/2022

Geometric Rectification of Creased Document Images based on Isometric Mapping

Geometric rectification of images of distorted documents finds wide appl...
research
04/14/2021

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

As camera-based documents are increasingly used, the rectification of di...
research
08/06/2021

Lights, Camera, Action! A Framework to Improve NLP Accuracy over OCR documents

Document digitization is essential for the digital transformation of our...
research
02/01/2019

Dating Documents using Graph Convolution Networks

Document date is essential for many important tasks, such as document re...
research
06/20/2017

Passive Classification of Source Printer using Text-line-level Geometric Distortion Signatures from Scanned Images of Printed Documents

In this digital era, one thing that still holds the convention is a prin...
research
10/03/2022

EraseNet: A Recurrent Residual Network for Supervised Document Cleaning

Document denoising is considered one of the most challenging tasks in co...

Please sign up or login with your details

Forgot password? Click here to reset