Scale Up Event Extraction Learning via Automatic Training Data Generation

12/11/2017
by   Ying Zeng, et al.
0

The task of event extraction has long been investigated in a supervised learning paradigm, which is bound by the number and the quality of the training instances. Existing training data must be manually generated through a combination of expert domain knowledge and extensive human involvement. However, due to drastic efforts required in annotating text, the resultant datasets are usually small, which severally affects the quality of the learned model, making it hard to generalize. Our work develops an automatic approach for generating training data for event extraction. Our approach allows us to scale up event extraction training instances from thousands to hundreds of thousands, and it does this at a much lower cost than a manual approach. We achieve this by employing distant supervision to automatically create event annotations from unlabelled text using existing structured knowledge bases or tables.We then develop a neural network model with post inference to transfer the knowledge extracted from structured knowledge bases to automatically annotate typed events with corresponding arguments in text.We evaluate our approach by using the knowledge extracted from Freebase to label texts from Wikipedia articles. Experimental results show that our approach can generate a large number of high quality training instances. We show that this large volume of training data not only leads to a better event extractor, but also allows us to detect multiple typed events.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2023

Boosting Event Extraction with Denoised Structure-to-Text Augmentation

Event extraction aims to recognize pre-defined event triggers and argume...
research
07/07/2017

External Evaluation of Event Extraction Classifiers for Automatic Pathway Curation: An extended study of the mTOR pathway

This paper evaluates the impact of various event extraction systems on a...
research
11/25/2019

Financial Event Extraction Using Wikipedia-Based Weak Supervision

Extraction of financial and economic events from text has previously bee...
research
02/14/2012

Reasoning about RoboCup Soccer Narratives

This paper presents an approach for learning to translate simple narrati...
research
08/26/2018

Semi-Supervised Event Extraction with Paraphrase Clusters

Supervised event extraction systems are limited in their accuracy due to...
research
09/20/2018

Rapid Customization for Event Extraction

We present a system for rapidly customizing event extraction capability ...
research
08/25/2019

Open Event Extraction from Online Text using a Generative Adversarial Network

To extract the structured representations of open-domain events, Bayesia...

Please sign up or login with your details

Forgot password? Click here to reset