TableBank: Table Benchmark for Image-based Table Detection and Recognition

03/05/2019
by   Minghao Li, et al.
0

We present TableBank, a new image-based table detection and recognition dataset built with novel weak supervision from Word and Latex documents on the internet. Existing research for image-based table detection and recognition usually fine-tunes pre-trained models on out-of-domain data with a few thousands human labeled examples, which is difficult to generalize on real world applications. With TableBank that contains 417K high-quality labeled tables, we build several strong baselines using state-of-the-art models with deep neural networks. We make TableBank publicly available (https://github.com/doc-analysis/TableBank) and hope it will empower more deep learning approaches in the table detection and recognition task.

READ FULL TEXT
research
04/27/2020

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

An automatic table recognition method for interpretation of tabular data...
research
11/25/2019

Image-based table recognition: data, model, and evaluation

Important information that relates to a specific topic in a document is ...
research
03/03/2023

T360RRD: A dataset for 360 degree rotated rectangular box table detection

To address the problem of scarcity and high annotation costs of rotated ...
research
11/15/2022

Deep learning for table detection and structure recognition: A survey

Tables are everywhere, from scientific journals, papers, websites, and n...
research
02/16/2021

TableLab: An Interactive Table Extraction System with Adaptive Deep Learning

Table extraction from PDF and image documents is a ubiquitous task in th...
research
05/03/2023

Pre-train and Search: Efficient Embedding Table Sharding with Pre-trained Neural Cost Models

Sharding a large machine learning model across multiple devices to balan...
research
02/08/2023

Geometric Perception based Efficient Text Recognition

Every Scene Text Recognition (STR) task consists of text localization & ...

Please sign up or login with your details

Forgot password? Click here to reset