Forecasting COVID-19 Caseloads Using Unsupervised Embedding Clusters of Social Media Posts

05/20/2022
by   Felix Drinkall, et al.
0

We present a novel approach incorporating transformer-based language models into infectious disease modelling. Text-derived features are quantified by tracking high-density clusters of sentence-level representations of Reddit posts within specific US states' COVID-19 subreddits. We benchmark these clustered embedding features against features extracted from other high-quality datasets. In a threshold-classification task, we show that they outperform all other feature types at predicting upward trend signals, a significant result for infectious disease modelling in areas where epidemiological data is unreliable. Subsequently, in a time-series forecasting task we fully utilise the predictive power of the caseload and compare the relative strengths of using different supplementary datasets as covariate feature sets in a transformer-based time-series model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2021

Analyzing COVID-19 Tweets with Transformer-based Language Models

This paper describes a method for using Transformer-based Language Model...
research
09/22/2022

Adaptation of domain-specific transformer models with text oversampling for sentiment analysis of social media posts on Covid-19 vaccines

Covid-19 has spread across the world and several vaccines have been deve...
research
11/09/2021

Deep diffusion-based forecasting of COVID-19 by incorporating network-level mobility information

Modeling the spatiotemporal nature of the spread of infectious diseases ...
research
07/03/2023

A novel approach for predicting epidemiological forecasting parameters based on real-time signals and Data Assimilation

This paper proposes a novel approach to predict epidemiological paramete...
research
10/25/2020

Inter-Series Attention Model for COVID-19 Forecasting

COVID-19 pandemic has an unprecedented impact all over the world since e...
research
07/04/2019

Application of Transfer Learning for Automatic Triage of Social Media Posts

Mental illness affects a significant portion of the worldwide population...
research
04/19/2021

UVCE-IIITT@DravidianLangTech-EACL2021: Tamil Troll Meme Classification: You need to Pay more Attention

Tamil is a Dravidian language that is commonly used and spoken in the so...

Please sign up or login with your details

Forgot password? Click here to reset