ArgFuse: A Weakly-Supervised Framework for Document-Level Event Argument Aggregation

06/21/2021
by   Debanjana Kar, et al.
0

Most of the existing information extraction frameworks (Wadden et al., 2019; Veysehet al., 2020) focus on sentence-level tasks and are hardly able to capture the consolidated information from a given document. In our endeavour to generate precise document-level information frames from lengthy textual records, we introduce the task of Information Aggregation or Argument Aggregation. More specifically, our aim is to filter irrelevant and redundant argument mentions that were extracted at a sentence level and render a document level information frame. Majority of the existing works have been observed to resolve related tasks of document-level event argument extraction (Yang et al., 2018a; Zheng et al., 2019a) and salient entity identification (Jain et al.,2020) using supervised techniques. To remove dependency from large amounts of labelled data, we explore the task of information aggregation using weakly-supervised techniques. In particular, we present an extractive algorithm with multiple sieves which adopts active learning strategies to work efficiently in low-resource settings. For this task, we have annotated our own test dataset comprising of 131 document information frames and have released the code and dataset to further research prospects in this new domain. To the best of our knowledge, we are the first to establish baseline results for this task in English. Our data and code are publicly available at https://github.com/DebanjanaKar/ArgFuse.

READ FULL TEXT
research
09/06/2022

Few-Shot Document-Level Event Argument Extraction

Event argument extraction (EAE) has been well studied at the sentence le...
research
10/14/2021

Making Document-Level Information Extraction Right for the Right Reasons

Document-level information extraction is a flexible framework compatible...
research
05/01/2020

SciREX: A Challenge Dataset for Document-Level Information Extraction

Extracting information from full documents is an important problem in ma...
research
01/31/2020

Similarità per la ricerca del dominio di una frase

English. This document aims to study the best algorithms to verify the b...
research
01/16/2021

Weakly-Supervised Hierarchical Models for Predicting Persuasive Strategies in Good-faith Textual Requests

Modeling persuasive language has the potential to better facilitate our ...
research
09/18/2022

Dynamic Global Memory for Document-level Argument Extraction

Extracting informative arguments of events from news articles is a chall...
research
06/29/2021

SDL: New data generation tools for full-level annotated document layout

We present a novel data generation tool for document processing. The too...

Please sign up or login with your details

Forgot password? Click here to reset