SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts

04/18/2021
by   Arie Cattan, et al.
11

Determining coreference of concept mentions across multiple documents is fundamental for natural language understanding. Work on cross-document coreference resolution (CDCR) typically considers mentions of events in the news, which do not often involve abstract technical concepts that are prevalent in science and technology. These complex concepts take diverse or ambiguous forms and have many hierarchical levels of granularity (e.g., tasks and subtasks), posing challenges for CDCR. We present a new task of hierarchical CDCR for concepts in scientific papers, with the goal of jointly inferring coreference clusters and hierarchy between them. We create SciCo, an expert-annotated dataset for this task, which is 3X larger than the prominent ECB+ resource. We find that tackling both coreference and hierarchy at once outperforms disjoint models, which we hope will spur development of joint models for SciCo.

READ FULL TEXT

page 1

page 4

research
01/29/2021

CD2CR: Co-reference Resolution Across Documents and Domains

Cross-document co-reference resolution (CDCR) is the task of identifying...
research
05/30/2018

A Web-scale system for scientific knowledge exploration

To enable efficient exploration of Web-scale scientific knowledge, it is...
research
05/14/2022

ACCoRD: A Multi-Document Approach to Generating Diverse Descriptions of Scientific Concepts

Systems that can automatically define unfamiliar terms hold the promise ...
research
04/17/2021

Sequential Cross-Document Coreference Resolution

Relating entities and events in text is a key component of natural langu...
research
09/11/2021

XCoref: Cross-document Coreference Resolution in the Wild

Datasets and methods for cross-document coreference resolution (CDCR) fo...
research
09/20/2017

Constructing a Hierarchical User Interest Structure based on User Profiles

The interests of individual internet users fall into a hierarchical stru...
research
04/26/2022

Symlink: A New Dataset for Scientific Symbol-Description Linking

Mathematical symbols and descriptions appear in various forms across doc...

Please sign up or login with your details

Forgot password? Click here to reset