Predicting Zip Code-Level Vaccine Hesitancy in US Metropolitan Areas Using Machine Learning Models on Public Tweets

08/03/2021
by   Sara Melotte, et al.
0

Although the recent rise and uptake of COVID-19 vaccines in the United States has been encouraging, there continues to be significant vaccine hesitancy in various geographic and demographic clusters of the adult population. Surveys, such as the one conducted by Gallup over the past year, can be useful in determining vaccine hesitancy, but can be expensive to conduct and do not provide real-time data. At the same time, the advent of social media suggests that it may be possible to get vaccine hesitancy signals at an aggregate level (such as at the level of zip codes) by using machine learning models and socioeconomic (and other) features from publicly available sources. It is an open question at present whether such an endeavor is feasible, and how it compares to baselines that only use constant priors. To our knowledge, a proper methodology and evaluation results using real data has also not been presented. In this article, we present such a methodology and experimental study, using publicly available Twitter data collected over the last year. Our goal is not to devise novel machine learning algorithms, but to evaluate existing and established models in a comparative framework. We show that the best models significantly outperform constant priors, and can be set up using open-source tools.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2020

A large-scale Twitter dataset for drug safety applications mined from publicly existing resources

With the increase in popularity of deep learning models for natural lang...
research
11/10/2021

Understanding COVID-19 Vaccine Reaction through Comparative Analysis on Twitter

Although multiple COVID-19 vaccines have been available for several mont...
research
02/19/2020

Descriptive and Predictive Analysis of Euroleague Basketball Games and the Wisdom of Basketball Crowds

In this study we focus on the prediction of basketball games in the Euro...
research
04/13/2023

Vax-Culture: A Dataset for Studying Vaccine Discourse on Twitter

Vaccine hesitancy continues to be a main challenge for public health off...
research
10/22/2018

Automatically Detecting Self-Reported Birth Defect Outcomes on Twitter for Large-scale Epidemiological Research

In recent work, we identified and studied a small cohort of Twitter user...
research
03/03/2023

Early Warning Signals of Social Instabilities in Twitter Data

The goal of this project is to create and study novel techniques to iden...
research
04/07/2022

Collecting, Classifying, Analyzing, and Using Real-World Elections

We present a collection of 7582 real-world elections divided into 25 dat...

Please sign up or login with your details

Forgot password? Click here to reset