S2abEL: A Dataset for Entity Linking from Scientific Tables

04/30/2023
by   Yuze Lou, et al.
0

Entity linking (EL) is the task of linking a textual mention to its corresponding entry in a knowledge base, and is critical for many knowledge-intensive NLP applications. When applied to tables in scientific papers, EL is a step toward large-scale scientific knowledge bases that could enable advanced scientific question answering and analytics. We present the first dataset for EL in scientific tables. EL for scientific tables is especially challenging because scientific knowledge bases can be very incomplete, and disambiguating table mentions typically requires understanding the papers's tet in addition to the table. Our dataset, S2abEL, focuses on EL in machine learning results tables and includes hand-labeled cell types, attributed sources, and entity links from the PaperswithCode taxonomy for 8,429 cells from 732 tables. We introduce a neural baseline method designed for EL on scientific tables containing many out-of-knowledge-base mentions, and show that it significantly outperforms a state-of-the-art generic table EL method. The best baselines fall below human performance, and our analysis highlights avenues for improvement.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2023

A Practical Entity Linking System for Tables in Scientific Literature

Entity linking is an important step towards constructing knowledge graph...
research
07/28/2021

Tab2Know: Building a Knowledge Base from Tables in Scientific Papers

Tables in scientific papers contain a wealth of valuable knowledge for t...
research
07/05/2022

Entity Linking in Tabular Data Needs the Right Attention

Understanding the semantic meaning of tabular data requires Entity Linki...
research
01/28/2023

ACL-Fig: A Dataset for Scientific Figure Classification

Most existing large-scale academic search engines are built to retrieve ...
research
02/01/2020

Novel Entity Discovery from Web Tables

When working with any sort of knowledge base (KB) one has to make sure i...
research
02/01/2021

Metric-Type Identification for Multi-Level Header Numerical Tables in Scientific Papers

Numerical tables are widely used to present experimental results in scie...
research
08/29/2017

EntiTables: Smart Assistance for Entity-Focused Tables

Tables are among the most powerful and practical tools for organizing an...

Please sign up or login with your details

Forgot password? Click here to reset