Bridging the gap between supervised classification and unsupervised topic modelling for social-media assisted crisis management

03/22/2021
by   Mikael Brunila, et al.
0

Social media such as Twitter provide valuable information to crisis managers and affected people during natural disasters. Machine learning can help structure and extract information from the large volume of messages shared during a crisis; however, the constantly evolving nature of crises makes effective domain adaptation essential. Supervised classification is limited by unchangeable class labels that may not be relevant to new events, and unsupervised topic modelling by insufficient prior knowledge. In this paper, we bridge the gap between the two and show that BERT embeddings finetuned on crisis-related tweet classification can effectively be used to adapt to a new crisis, discovering novel topics while preserving relevant classes from supervised training, and leveraging bidirectional self-attention to extract topic keywords. We create a dataset of tweets from a snowstorm to evaluate our method's transferability to new crises, and find that it outperforms traditional topic models in both automatic, and human evaluations grounded in the needs of crisis managers. More broadly, our method can be used for textual domain adaptation where the latent classes are unknown but overlap with known classes from other domains.

READ FULL TEXT
research
08/08/2016

Topic Modelling and Event Identification from Twitter Textual Data

The tremendous growth of social media content on the Internet has inspir...
research
09/17/2019

Two Computational Models for Analyzing Political Attention in Social Media

Understanding how political attention is divided and over what subjects ...
research
10/21/2019

Using machine learning and information visualisation for discovering latent topics in Twitter news

We propose a method to discover latent topics and visualise large collec...
research
09/27/2020

How do people describe locations during a natural disaster: an analysis of tweets from Hurricane Harvey

Social media platforms, such as Twitter, have been increasingly used by ...
research
05/26/2023

Coping with low data availability for social media crisis message categorisation

During crisis situations, social media allows people to quickly share in...
research
05/03/2022

CTM – A Model for Large-Scale Multi-View Tweet Topic Classification

Automatically associating social media posts with topics is an important...
research
06/01/2023

Topic-Guided Sampling For Data-Efficient Multi-Domain Stance Detection

Stance Detection is concerned with identifying the attitudes expressed b...

Please sign up or login with your details

Forgot password? Click here to reset