Tale of tails using rule augmented sequence labeling for event extraction

08/19/2019
by   Hrishikesh Patel, et al.
0

The problem of event extraction is a relatively difficult task for low resource languages due to the non-availability of sufficient annotated data. Moreover, the task becomes complex for tail (rarely occurring) labels wherein extremely less data is available. In this paper, we present a new dataset (InDEE-2019) in the disaster domain for multiple Indic languages, collected from news websites. Using this dataset, we evaluate several rule-based mechanisms to augment deep learning based models. We formulate our problem of event extraction as a sequence labeling task and perform extensive experiments to study and understand the effectiveness of different approaches. We further show that tail labels can be easily incorporated by creating new rules without the requirement of large annotated data.

READ FULL TEXT

page 3

page 4

page 6

page 7

research
01/15/2022

Extracting Space Situational Awareness Events from News Text

Space situational awareness typically makes use of physical measurements...
research
05/22/2023

MAILEX: Email Event and Argument Extraction

In this work, we present the first dataset, , for performing event extra...
research
12/20/2022

Pre-trained Language Models for Keyphrase Generation: A Thorough Empirical Study

Neural models that do not rely on pre-training have excelled in the keyp...
research
11/02/2022

Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset

Event extraction (EE) is crucial to downstream tasks such as new aggrega...
research
11/11/2022

MEE: A Novel Multilingual Event Extraction Dataset

Event Extraction (EE) is one of the fundamental tasks in Information Ext...
research
03/18/2022

CaMEL: Case Marker Extraction without Labels

We introduce CaMEL (Case Marker Extraction without Labels), a novel and ...
research
05/04/2017

A Finite State and Rule-based Akshara to Prosodeme (A2P) Converter in Hindi

This article describes a software module called Akshara to Prosodeme (A2...

Please sign up or login with your details

Forgot password? Click here to reset