Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages

05/19/2016
by   Muhammad Imran, et al.
0

Microblogging platforms such as Twitter provide active communication channels during mass convergence and emergency events such as earthquakes, typhoons. During the sudden onset of a crisis situation, affected people post useful information on Twitter that can be used for situational awareness and other humanitarian disaster response efforts, if processed timely and effectively. Processing social media information pose multiple challenges such as parsing noisy, brief and informal messages, learning information categories from the incoming stream of messages and classifying them into different classes among others. One of the basic necessities of many of these tasks is the availability of data, in particular human-annotated data. In this paper, we present human-annotated Twitter corpora collected during 19 different crises that took place between 2013 and 2015. To demonstrate the utility of the annotations, we train machine learning classifiers. Moreover, we publish first largest word2vec word embeddings trained on 52 million crisis-related tweets. To deal with tweets language issues, we present human-annotated normalized lexical resources for different lexical variations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/02/2021

A multi-modal approach towards mining social media data during natural disasters – a case study of Hurricane Irma

Streaming social media provides a real-time glimpse of extreme weather i...
research
01/18/2018

Unsupervised Hashtag Retrieval and Visualization for Crisis Informatics

In social media like Twitter, hashtags carry a lot of semantic informati...
research
10/04/2016

Applications of Online Deep Learning for Crisis Response Using Social Media Information

During natural or man-made disasters, humanitarian response organization...
research
04/26/2021

Continual Distributed Learning for Crisis Management

Social media platforms such as Twitter provide an excellent resource for...
research
06/16/2017

Active learning in annotating micro-blogs dealing with e-reputation

Elections unleash strong political views on Twitter, but what do people ...
research
01/28/2017

Feature Studies to Inform the Classification of Depressive Symptoms from Twitter Data for Population Health

The utility of Twitter data as a medium to support population-level ment...
research
06/02/2020

An Empirical Methodology for Detecting and Prioritizing Needs during Crisis Events

In times of crisis, identifying the essential needs is a crucial step to...

Please sign up or login with your details

Forgot password? Click here to reset