Table Structure Recognition using Top-Down and Bottom-Up Cues

10/09/2020
by   Sachin Raja, et al.
0

Tables are information-rich structured objects in document images. While significant work has been done in localizing tables as graphic objects in document images, only limited attempts exist on table structure recognition. Most existing literature on structure recognition depends on extraction of meta-features from the PDF document or on the optical character recognition (OCR) models to extract low-level layout features from the image. However, these methods fail to generalize well because of the absence of meta-features or errors made by the OCR when there is a significant variance in table layouts and text organization. In our work, we focus on tables that have complex structures, dense content, and varying layouts with no dependency on meta-features and/or OCR. We present an approach for table structure recognition that combines cell detection and interaction modules to localize the cells and predict their row and column associations with other detected cells. We incorporate structural constraints as additional differential components to the loss function for cell detection. We empirically validate our method on the publicly available real-world datasets - ICDAR-2013, ICDAR-2019 (cTDaR) archival, UNLV, SciTSR, SciTSR-COMP, TableBank, and PubTabNet. Our attempt opens up a new direction for table structure recognition by combining top-down (table cells detection) and bottom-up (structure recognition) cues in visually understanding the tables.

READ FULL TEXT

page 13

page 27

page 29

research
01/08/2020

Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks

Tables present summarized and structured information to the reader, whic...
research
11/13/2021

Visual Understanding of Complex Table Structures from Document Images

Table structure recognition is necessary for a comprehensive understandi...
research
04/29/2021

Current Status and Performance Analysis of Table Recognition in Document Images with Deep Neural Networks

The first phase of table recognition is to detect the tabular area in a ...
research
08/26/2020

Tabular Structure Detection from Document Images for Resource Constrained Devices Using A Row Based Similarity Measure

Tabular structures are used to present crucial information in a structur...
research
05/01/2023

TRACE: Table Reconstruction Aligned to Corner and Edges

A table is an object that captures structured and informative content wi...
research
03/14/2022

TSR-DSAW: Table Structure Recognition via Deep Spatial Association of Words

Existing methods for Table Structure Recognition (TSR) from camera-captu...
research
04/22/2021

Tablext: A Combined Neural Network And Heuristic Based Table Extractor

A significant portion of the data available today is found within tables...

Please sign up or login with your details

Forgot password? Click here to reset