Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset

11/02/2022
by   Haolin Deng, et al.
0

Event extraction (EE) is crucial to downstream tasks such as new aggregation and event knowledge graph construction. Most existing EE datasets manually define fixed event types and design specific schema for each of them, failing to cover diverse events emerging from the online text. Moreover, news titles, an important source of event mentions, have not gained enough attention in current EE research. In this paper, We present Title2Event, a large-scale sentence-level dataset benchmarking Open Event Extraction without restricting event types. Title2Event contains more than 42,000 news titles in 34 topics collected from Chinese web pages. To the best of our knowledge, it is currently the largest manually-annotated Chinese dataset for open event extraction. We further conduct experiments on Title2Event with different models and show that the characteristics of titles make it challenging for event extraction, addressing the significance of advanced study on this problem. The dataset and baseline codes are available at https://open-event-hub.github.io/title2event.

READ FULL TEXT

page 5

page 8

page 14

research
06/17/2019

Open Domain Event Extraction Using Neural Latent Variable Models

We consider open domain event extraction, the task of extracting unconst...
research
05/25/2022

GENEVA: Pushing the Limit of Generalizability for Event Argument Extraction with 100+ Event Types

Numerous events occur worldwide and are documented in the news, social m...
research
11/25/2022

MUSIED: A Benchmark for Event Detection from Multi-Source Heterogeneous Informal Texts

Event detection (ED) identifies and classifies event triggers from unstr...
research
04/28/2020

MAVEN: A Massive General Domain Event Detection Dataset

Event detection (ED), which identifies event trigger words and classifie...
research
09/20/2021

Modality and Negation in Event Extraction

Language provides speakers with a rich system of modality for expressing...
research
03/16/2023

GLEN: General-Purpose Event Detection for Thousands of Types

The development of event extraction systems has been hindered by the abs...
research
08/19/2019

Tale of tails using rule augmented sequence labeling for event extraction

The problem of event extraction is a relatively difficult task for low r...

Please sign up or login with your details

Forgot password? Click here to reset