RectiNet-v2: A stacked network architecture for document image dewarping

02/01/2021
by   Hmrishav Bandyopadhyay, et al.
0

With the advent of mobile and hand-held cameras, document images have found their way into almost every domain. Dewarping of these images for the removal of perspective distortions and folds is essential so that they can be understood by document recognition algorithms. For this, we propose an end-to-end CNN architecture that can produce distortion free document images from warped documents it takes as input. We train this model on warped document images simulated synthetically to compensate for lack of enough natural data. Our method is novel in the use of a bifurcated decoder with shared weights to prevent intermingling of grid coordinates, in the use of residual networks in the U-Net skip connections to allow flow of data from different receptive fields in the model, and in the use of a gated network to help the model focus on structure and line level detail of the document image. We evaluate our method on the DocUNet dataset, a benchmark in this domain, and obtain results comparable to state-of-the-art methods.

READ FULL TEXT

page 2

page 5

research
07/20/2020

A Gated and Bifurcated Stacked U-Net Module for Document Image Dewarping

Capturing images of documents is one of the easiest and most used method...
research
09/11/2017

Recovering Homography from Camera Captured Documents using Convolutional Neural Networks

Removing perspective distortion from hand held camera captured document ...
research
01/27/2022

DocSegTr: An Instance-Level End-to-End Document Image Segmentation Transformer

Understanding documents with rich layouts is an essential step towards i...
research
01/25/2022

DocEnTr: An End-to-End Document Image Enhancement Transformer

Document images can be affected by many degradation scenarios, which cau...
research
11/30/2021

Donut: Document Understanding Transformer without OCR

Understanding document images (e.g., invoices) has been an important res...
research
08/04/2023

CTP-Net: Character Texture Perception Network for Document Image Forgery Localization

Due to the progression of information technology in recent years, docume...
research
02/06/2023

Neural Document Unwarping using Coupled Grids

Restoring the original, flat appearance of a printed document from casua...

Please sign up or login with your details

Forgot password? Click here to reset