Presenting a New Dataset for the Timeline Generation Problem

11/07/2016
by   Xavier Holt, et al.
0

The timeline generation task summarises an entity's biography by selecting stories representing key events from a large pool of relevant documents. This paper addresses the lack of a standard dataset and evaluative methodology for the problem. We present and make publicly available a new dataset of 18,793 news articles covering 39 entities. For each entity, we provide a gold standard timeline and a set of entity-related articles. We propose ROUGE as an evaluation metric and validate our dataset by showing that top Google results outperform straw-man baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2021

Tracking entities in technical procedures – a new dataset and baselines

We introduce TechTrack, a new dataset for tracking entities in technical...
research
11/02/2022

Generative Entity-to-Entity Stance Detection with Knowledge Graph Augmentation

Stance detection is typically framed as predicting the sentiment in a gi...
research
04/05/2019

PoMo: Generating Entity-Specific Post-Modifiers in Context

We introduce entity post-modifier generation as an instance of a collabo...
research
04/01/2021

Mitigating Media Bias through Neutral Article Generation

Media bias can lead to increased political polarization, and thus, the n...
research
05/27/2019

FUNSD: A Dataset for Form Understanding in Noisy Scanned Documents

In this paper, we present a new dataset for Form Understanding in Noisy ...
research
04/25/2019

Importance of Copying Mechanism for News Headline Generation

News headline generation is an essential problem of text summarization b...
research
04/05/2021

LAGOS-AND: A Large, Gold Standard Dataset for Scholarly Author Name Disambiguation

In this paper, we present a method to automatically generate a large-sca...

Please sign up or login with your details

Forgot password? Click here to reset