GriTS: Grid table similarity metric for table structure recognition

03/23/2022
by   Brandon Smock, et al.
0

In this paper, we propose a new class of evaluation metric for table structure recognition, grid table similarity (GriTS). Unlike prior metrics, GriTS evaluates the correctness of a predicted table directly in its natural form as a matrix. To create a similarity measure between matrices, we generalize the two-dimensional largest common substructure (2D-LCS) problem, which is NP-hard, to the 2D most similar substructures (2D-MSS) problem and propose a polynomial-time heuristic for solving it. We validate empirically using the PubTables-1M dataset that comparison between matrices exhibits more desirable behavior than alternatives for table structure recognition evaluation. GriTS also unifies all three subtasks of cell topology recognition, cell location recognition, and cell content recognition within the same framework, which simplifies the evaluation and enables more meaningful comparisons across different types of structure recognition approaches. Code will be released at https://github.com/microsoft/table-transformer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2021

Scientific evidence extraction

Recently, interest has grown in applying machine learning to the problem...
research
07/31/2022

Evaluating Table Structure Recognition: A New Perspective

Existing metrics used to evaluate table structure recognition algorithms...
research
03/07/2023

LORE: Logical Location Regression Network for Table Structure Recognition

Table structure recognition (TSR) aims at extracting tables in images in...
research
03/14/2023

Rethinking Image-based Table Recognition Using Weakly Supervised Methods

Most of the previous methods for table recognition rely on training data...
research
03/03/2023

T360RRD: A dataset for 360 degree rotated rectangular box table detection

To address the problem of scarcity and high annotation costs of rotated ...
research
04/29/2021

TabAug: Data Driven Augmentation for Enhanced Table Structure Recognition

Table Structure Recognition is an essential part of end-to-end tabular d...
research
05/05/2021

PingAn-VCGroup's Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML

This paper presents our solution for ICDAR 2021 competition on scientifi...

Please sign up or login with your details

Forgot password? Click here to reset