A Study on Reproducibility and Replicability of Table Structure Recognition Methods

04/20/2023
by   Kehinde Ajayi, et al.
0

Concerns about reproducibility in artificial intelligence (AI) have emerged, as researchers have reported unsuccessful attempts to directly reproduce published findings in the field. Replicability, the ability to affirm a finding using the same procedures on new data, has not been well studied. In this paper, we examine both reproducibility and replicability of a corpus of 16 papers on table structure recognition (TSR), an AI task aimed at identifying cell locations of tables in digital documents. We attempt to reproduce published results using codes and datasets provided by the original authors. We then examine replicability using a dataset similar to the original as well as a new dataset, GenTSR, consisting of 386 annotated tables extracted from scientific papers. Out of 16 papers studied, we reproduce results consistent with the original in only four. Two of the four papers are identified as replicable using the similar dataset under certain IoU values. No paper is identified as replicable using the new dataset. We offer observations on the causes of irreproducibility and irreplicability. All code and data are available on Codeocean at https://codeocean.com/capsule/6680116/tree.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/28/2022

Automatic Analysis of Available Source Code of Top Artificial Intelligence Conference Papers

Source code is essential for researchers to reproduce the methods and re...
research
11/30/2019

[Re] Learning to Learn By Self-Critique

This work is a reproducibility study of the paper of Antoniou and Storke...
research
05/14/2021

RC2020 Report: Learning De-biased Representations with Biased Representations

As part of the ML Reproducibility Challenge 2020, we investigated the IC...
research
04/13/2018

Exploration of reproducibility issues in scientometric research Part 1: Direct reproducibility

This is the first part of a small-scale explorative study in an effort t...
research
04/13/2018

Exploration of Reproducibility Issues in Scientometric Research Part 2: Conceptual Reproducibility

This is the second part of a small-scale explorative study in an effort ...
research
02/16/2019

An Explorative Study of GitHub Repositories of AI Papers

With the rapid development of AI technologies, thousands of AI papers ar...
research
05/01/2020

Code Replicability in Computer Graphics

Being able to duplicate published research results is an important proce...

Please sign up or login with your details

Forgot password? Click here to reset