Correlating Twitter Language with Community-Level Health Outcomes

06/13/2019
by   Arno Schneuwly, et al.
0

We study how language on social media is linked to diseases such as atherosclerotic heart disease (AHD), diabetes and various types of cancer. Our proposed model leverages state-of-the-art sentence embeddings, followed by a regression model and clustering, without the need of additional labelled data. It allows to predict community-level medical outcomes from language, and thereby potentially translate these to the individual level. The method is applicable to a wide range of target variables and allows us to discover known and potentially novel correlations of medical outcomes with life-style aspects and other socioeconomic risk factors.

READ FULL TEXT

page 6

page 8

research
08/29/2018

The Remarkable Benefit of User-Level Aggregation for Lexical-based Population-Level Predictions

Nowcasting based on social media text promises to provide unobtrusive an...
research
08/28/2018

Residualized Factor Adaptation for Community Social Media Prediction Tasks

Predictive models over social media language have shown promise in captu...
research
03/01/2021

The Healthy States of America: Creating a Health Taxonomy with Social Media

Since the uptake of social media, researchers have mined online discussi...
research
03/11/2016

Towards using social media to identify individuals at risk for preventable chronic illness

We describe a strategy for the acquisition of training data necessary to...
research
04/14/2020

Quantifying Community Characteristics of Maternal Mortality Using Social Media

While most mortality rates have decreased in the US, maternal mortality ...
research
05/23/2020

From Witch's Shot to Music Making Bones – Resources for Medical Laymen to Technical Language and Vice Versa

Many people share information in social media or forums, like food they ...

Please sign up or login with your details

Forgot password? Click here to reset