Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

11/26/2019
by   Xiaoyi Zhang, et al.
0

Twitter data has been shown broadly applicable for public health surveillance. Previous public heath studies based on Twitter data have largely relied on keyword-matching or topic models for clustering relevant tweets. However, both methods suffer from the short-length of texts and unpredictable noise that naturally occurs in user-generated contexts. In response, we introduce a deep learning approach that uses hashtags as a form of supervision and learns tweet embeddings for extracting informative textual features. In this case study, we address the specific task of estimating state-level obesity from dietary-related textual features. Our approach yields an estimation that strongly correlates the textual features to government data and outperforms the keyword-matching baseline. The results also demonstrate the potential of discovering risk factors using the textual features. This method is general-purpose and can be applied to a wide range of Twitter-based public health studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/22/2017

Computational Content Analysis of Negative Tweets for Obesity, Diet, Diabetes, and Exercise

Social media based digital epidemiology has the potential to support fas...
research
07/05/2020

Detecting Community Depression Dynamics Due to COVID-19 Pandemic in Australia

The recent COVID-19 pandemic has caused unprecedented impact across the ...
research
09/22/2022

Active Keyword Selection to Track Evolving Topics on Twitter

How can we study social interactions on evolving topics at a mass scale?...
research
10/17/2019

Keyphrase Extraction from Disaster-related Tweets

While keyphrase extraction has received considerable attention in recent...
research
12/25/2018

Deep Representation Learning for Clustering of Health Tweets

Twitter has been a prominent social media platform for mining population...
research
10/05/2021

Exploiting Twitter as Source of Large Corpora of Weakly Similar Pairs for Semantic Sentence Embeddings

Semantic sentence embeddings are usually supervisedly built minimizing d...
research
01/28/2017

Feature Studies to Inform the Classification of Depressive Symptoms from Twitter Data for Population Health

The utility of Twitter data as a medium to support population-level ment...

Please sign up or login with your details

Forgot password? Click here to reset