Extracting Victim Counts from Text

02/23/2023
by   Mian Zhong, et al.
0

Decision-makers in the humanitarian sector rely on timely and exact information during crisis events. Knowing how many civilians were injured during an earthquake is vital to allocate aids properly. Information about such victim counts is often only available within full-text event descriptions from newspapers and other reports. Extracting numbers from text is challenging: numbers have different formats and may require numeric reasoning. This renders purely string matching-based approaches insufficient. As a consequence, fine-grained counts of injured, displaced, or abused victims beyond fatalities are often not extracted and remain unseen. We cast victim count extraction as a question answering (QA) task with a regression or classification objective. We compare regex, dependency parsing, semantic role labeling-based approaches, and advanced text-to-text models. Beyond model accuracy, we analyze extraction reliability and robustness which are key for this sensitive task. In particular, we discuss model calibration and investigate few-shot and out-of-distribution performance. Ultimately, we make a comprehensive recommendation on which model to select for different desiderata and data domains. Our work is among the first to apply numeracy-focused large language models in a real-world use case with a positive impact.

READ FULL TEXT
research
03/08/2023

Comprehensive Event Representations using Event Knowledge Graphs and Natural Language Processing

Recent work has utilised knowledge-aware approaches to natural language ...
research
03/07/2023

Exploring the Feasibility of ChatGPT for Event Extraction

Event extraction is a fundamental task in natural language processing th...
research
07/10/2023

Event Extraction as Question Generation and Answering

Recent work on Event Extraction has reframed the task as Question Answer...
research
12/01/2016

On Coreferring Text-extracted Event Descriptions with the aid of Ontological Reasoning

Systems for automatic extraction of semantic information about events fr...
research
07/25/2023

GPT-3 Models are Few-Shot Financial Reasoners

Financial analysis is an important tool for evaluating company performan...
research
05/11/2023

Long-Tailed Question Answering in an Open World

Real-world data often have an open long-tailed distribution, and buildin...

Please sign up or login with your details

Forgot password? Click here to reset