Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter

04/13/2023
by   Mohammad Reza Zarei, et al.
0

Vaccine hesitancy continues to be a main challenge for public health officials during the COVID-19 pandemic. As this hesitancy undermines vaccine campaigns, many researchers have sought to identify its root causes, finding that the increasing volume of anti-vaccine misinformation on social media platforms is a key element of this problem. We explored Twitter as a source of misleading content with the goal of extracting overlapping cultural and political beliefs that motivate the spread of vaccine misinformation. To do this, we have collected a data set of vaccine-related Tweets and annotated them with the help of a team of annotators with a background in communications and journalism. Ultimately we hope this can lead to effective and targeted public health communication strategies for reaching individuals with anti-vaccine beliefs. Moreover, this information helps with developing Machine Learning models to automatically detect vaccine misinformation posts and combat their negative impacts. In this paper, we present Vax-Culture, a novel Twitter COVID-19 dataset consisting of 6373 vaccine-related tweets accompanied by an extensive set of human-provided annotations including vaccine-hesitancy stance, indication of any misinformation in tweets, the entities criticized and supported in each tweet and the communicated message of each tweet. Moreover, we define five baseline tasks including four classification and one sequence generation tasks, and report the results of a set of recent transformer-based models for them. The dataset and code are publicly available at https://github.com/mrzarei5/Vax-Culture.

READ FULL TEXT
research
04/05/2022

The COVMis-Stance dataset: Stance Detection on Twitter for COVID-19 Misinformation

During the COVID-19 pandemic, large amounts of COVID-19 misinformation a...
research
05/24/2022

VoynaSlov: A Data Set of Russian Social Media Activity during the 2022 Ukraine-Russia War

In this report, we describe a new data set called VoynaSlov which contai...
research
08/02/2021

Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset

We introduce the well-established social scientific concept of social so...
research
10/21/2021

A Python Package to Detect Anti-Vaccine Users on Twitter

Vaccine hesitancy has a long history but has been recently driven by the...
research
08/03/2021

Predicting Zip Code-Level Vaccine Hesitancy in US Metropolitan Areas Using Machine Learning Models on Public Tweets

Although the recent rise and uptake of COVID-19 vaccines in the United S...
research
05/22/2022

TWEET-FID: An Annotated Dataset for Multiple Foodborne Illness Detection Tasks

Foodborne illness is a serious but preventable public health problem – w...

Please sign up or login with your details

Forgot password? Click here to reset