STAR: Boosting Low-Resource Event Extraction by Structure-to-Text Data Generation with Large Language Models

05/24/2023
by   Mingyu Derek Ma, et al.
0

Structure prediction tasks such as event extraction require an in-depth understanding of the output structure and sub-task dependencies, thus they still heavily rely on task-specific training data to obtain reasonable performance. Due to the high cost of human annotation, low-resource event extraction, which requires minimal human cost, is urgently needed in real-world information extraction applications. We propose to synthesize data instances given limited seed demonstrations to boost low-resource event extraction performance. We propose STAR, a structure-to-text data generation method that first generates complicated event structures (Y) and then generates input passages (X), all with Large Language Models. We design fine-grained step-by-step instructions and the error cases and quality issues identified through self-reflection can be self-refined. Our experiments indicate that data generated by STAR can significantly improve the low-resource event extraction performance and they are even more effective than human-curated data points in some cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2023

Boosting Event Extraction with Denoised Structure-to-Text Augmentation

Event extraction aims to recognize pre-defined event triggers and argume...
research
12/30/2020

DEER: A Data Efficient Language Model for Event Temporal Reasoning

Pretrained language models (LMs) such as BERT, RoBERTa, and ELECTRA are ...
research
08/16/2022

DICE: Data-Efficient Clinical Event Extraction with Generative Models

Event extraction in the clinical domain is an under-explored research ar...
research
03/23/2022

Unified Structure Generation for Universal Information Extraction

Information extraction suffers from its varying targets, heterogeneous s...
research
03/07/2023

Exploring the Feasibility of ChatGPT for Event Extraction

Event extraction is a fundamental task in natural language processing th...
research
01/06/2023

Mask-then-Fill: A Flexible and Effective Data Augmentation Framework for Event Extraction

We present Mask-then-Fill, a flexible and effective data augmentation fr...
research
09/11/2023

From Artificially Real to Real: Leveraging Pseudo Data from Large Language Models for Low-Resource Molecule Discovery

Molecule discovery serves as a cornerstone in numerous scientific domain...

Please sign up or login with your details

Forgot password? Click here to reset