Document Structure Extraction for Forms using Very High Resolution Semantic Segmentation

11/27/2019
by   Mausoom Sarkar, et al.
33

In this work, we look at the problem of structure extraction from document images with a specific focus on forms. Forms as a document class have not received much attention, even though they comprise a significant fraction of documents and enable several applications. Forms possess a rich, complex, hierarchical, and high-density semantic structure that poses several challenges to semantic segmentation methods. We propose a prior based deep CNN-RNN hierarchical network architecture that enables document structure extraction using very high resolution(1800 x 1000) images. We divide the document image into overlapping horizontal strips such that the network segments a strip and uses its prediction mask as prior while predicting the segmentation for the subsequent strip. We perform experiments establishing the effectiveness of our strip based network architecture through ablation methods and comparison with low-resolution variations. We introduce our new rich human-annotated forms dataset, and we show that our method significantly outperforms other segmentation baselines in extracting several hierarchical structures on this dataset. We also outperform other baselines in table detection task on the Marmot dataset. Our method is currently being used in a world-leading customer experience management software suite for automated conversion of paper and PDF forms to modern HTML based forms.

READ FULL TEXT

page 2

page 5

page 8

research
07/09/2021

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for deca...
research
07/09/2021

Multi-Modal Association based Grouping for Form Structure Extraction

Document structure extraction has been a widely researched area for deca...
research
10/24/2018

The UAVid Dataset for Video Semantic Segmentation

Video semantic segmentation has been one of the research focus in comput...
research
11/10/2020

MP-ResNet: Multi-path Residual Network for the Semantic segmentation of High-Resolution PolSAR Images

There are limited studies on the semantic segmentation of high-resolutio...
research
07/15/2019

CA-RefineNet:A Dual Input WSI Image Segmentation Algorithm Based on Attention

Due to the high resolution of pathological images, the automated semanti...
research
11/30/2018

TextureNet: Consistent Local Parametrizations for Learning from High-Resolution Signals on Meshes

We introduce, TextureNet, a neural network architecture designed to extr...
research
07/02/2018

Semantic Segmentation with Scarce Data

Semantic segmentation is a challenging vision problem that usually neces...

Please sign up or login with your details

Forgot password? Click here to reset