Joint Event Detection and Entity Resolution: a Virtuous Cycle

07/18/2016
by   Matthias Gallé, et al.
0

Clustering web documents has numerous applications, such as aggregating news articles into meaningful events, detecting trends and hot topics on the Web, preserving diversity in search results, etc. At the same time, the importance of named entities and, in particular, the ability to recognize them and to solve the associated co-reference resolution problem are widely recognized as key enabling factors when mining, aggregating and comparing content on the Web. Instead of considering these two problems separately, we propose in this paper a method that tackles jointly the problem of clustering news articles into events and cross-document co-reference resolution of named entities. The co-occurrence of named entities in the same clusters is used as an additional signal to decide whether two referents should be merged into one entity. These refined entities can in turn be used as enhanced features to re-cluster the documents and then be refined again, entering into a virtuous cycle that improves simultaneously the performances of both tasks. We implemented a prototype system and report results using the TDT5 collection of news articles, demonstrating the potential of our approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2020

Complex networks for event detection in heterogeneous high volume news streams

Detecting important events in high volume news streams is an important t...
research
12/11/2021

Show and Write: Entity-aware News Generation with Image Information

Automatically writing long articles is a complex and challenging languag...
research
09/11/2021

XCoref: Cross-document Coreference Resolution in the Wild

Datasets and methods for cross-document coreference resolution (CDCR) fo...
research
05/14/2016

Occurrence Statistics of Entities, Relations and Types on the Web

The problem of collecting reliable estimates of occurrence of entities o...
research
12/15/2021

Responsive parallelized architecture for deploying deep learning models in production environments

Recruiters can easily shortlist candidates for jobs via viewing their cu...
research
08/16/2019

CommentsRadar: Dive into Unique Data on All Comments on the Web

We introduce an entity-centric search engineCommentsRadarthatpairs entit...
research
12/31/2020

Understanding Politics via Contextualized Discourse Processing

Politicians often have underlying agendas when reacting to events. Argum...

Please sign up or login with your details

Forgot password? Click here to reset