Tabular Structure Detection from Document Images for Resource Constrained Devices Using A Row Based Similarity Measure

08/26/2020
by   Soumyadeep Dey, et al.
0

Tabular structures are used to present crucial information in a structured and crisp manner. Detection of such regions is of great importance for proper understanding of a document. Tabular structures can be of various layouts and types. Therefore, detection of these regions is a hard problem. Most of the existing techniques detect tables from a document image by using prior knowledge of the structures of the tables. However, these methods are not applicable for generalized tabular structures. In this work, we propose a similarity measure to find similarities between pairs of rows in a tabular structure. This similarity measure is utilized to identify a tabular region. Since the tabular regions are detected exploiting the similarities among all rows, the method is inherently independent of layouts of the tabular regions present in the training data. Moreover, the proposed similarity measure can be used to identify tabular regions without using large sets of parameters associated with recent deep learning based methods. Thus, the proposed method can easily be used with resource constrained devices such as mobile devices without much of an overhead.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2020

Table Structure Recognition using Top-Down and Bottom-Up Cues

Tables are information-rich structured objects in document images. While...
research
05/07/2015

Shadow Optimization from Structured Deep Edge Detection

Local structures of shadow boundaries as well as complex interactions of...
research
10/06/2021

On Cropped versus Uncropped Training Sets in Tabular Structure Detection

Automated document processing for tabular information extraction is high...
research
03/02/2010

Binarizing Business Card Images for Mobile Devices

Business card images are of multiple natures as these often contain grap...
research
11/30/2018

Document Structure Measure for Hypernym discovery

Hypernym discovery is the problem of finding terms that have is-a relati...
research
12/30/2021

Utilizing Wordnets for Cognate Detection among Indian Languages

Automatic Cognate Detection (ACD) is a challenging task which has been u...
research
05/05/2021

Towards an efficient framework for Data Extraction from Chart Images

In this paper, we fill the research gap by adopting state-of-the-art com...

Please sign up or login with your details

Forgot password? Click here to reset