XCoref: Cross-document Coreference Resolution in the Wild

09/11/2021
by   Anastasia Zhukova, et al.
0

Datasets and methods for cross-document coreference resolution (CDCR) focus on events or entities with strict coreference relations. They lack, however, annotating and resolving coreference mentions with more abstract or loose relations that may occur when news articles report about controversial and polarized events. Bridging and loose coreference relations trigger associations that may lead to exposing news readers to bias by word choice and labeling. For example, coreferential mentions of "direct talks between U.S. President Donald Trump and Kim" such as "an extraordinary meeting following months of heated rhetoric" or "great chance to solve a world problem" form a more positive perception of this event. A step towards bringing awareness of bias by word choice and labeling is the reliable resolution of coreferences with high lexical diversity. We propose an unsupervised method named XCoref, which is a CDCR method that capably resolves not only previously prevalent entities, such as persons, e.g., "Donald Trump," but also abstractly defined concepts, such as groups of persons, "caravan of immigrants," events and actions, e.g., "marching to the U.S. border." In an extensive evaluation, we compare the proposed XCoref to a state-of-the-art CDCR method and a previous method TCA that resolves such complex coreference relations and find that XCoref outperforms these methods. Outperforming an established CDCR model shows that the new CDCR models need to be evaluated on semantically complex mentions with more loose coreference relations to indicate their applicability of models to resolve mentions in the "wild" of political news articles.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2021

Qualitative and Quantitative Analysis of Diversity in Cross-document Coreference Resolution Datasets

Cross-document coreference resolution (CDCR) datasets, such as ECB+, con...
research
07/18/2016

Joint Event Detection and Entity Resolution: a Virtuous Cycle

Clustering web documents has numerous applications, such as aggregating ...
research
04/13/2017

Identity and Granularity of Events in Text

In this paper we describe a method to detect event descrip- tions in dif...
research
01/29/2021

CD2CR: Co-reference Resolution Across Documents and Domains

Cross-document co-reference resolution (CDCR) is the task of identifying...
research
04/18/2019

No Permanent Friends or Enemies: Tracking Relationships between Nations from News

Understanding the dynamics of international politics is important yet ch...
research
04/18/2021

SciCo: Hierarchical Cross-Document Coreference for Scientific Concepts

Determining coreference of concept mentions across multiple documents is...
research
06/30/2020

Segmentation Approach for Coreference Resolution Task

In coreference resolution, it is important to consider all members of a ...

Please sign up or login with your details

Forgot password? Click here to reset