Deep Representation Learning for Clustering of Health Tweets

12/25/2018
by   Oguzhan Gencoglu, et al.
0

Twitter has been a prominent social media platform for mining population-level health data and accurate clustering of health-related tweets into topics is important for extracting relevant health insights. In this work, we propose deep convolutional autoencoders for learning compact representations of health-related tweets, further to be employed in clustering. We compare our method to several conventional tweet representation methods including bag-of-words, term frequency-inverse document frequency, Latent Dirichlet Allocation and Non-negative Matrix Factorization with 3 different clustering algorithms. Our results show that the clustering performance using proposed representation learning scheme significantly outperforms that of conventional methods for all experiments of different number of clusters. In addition, we propose a constraint on the learned representations during the neural network training in order to further enhance the clustering performance. All in all, this study introduces utilization of deep neural network-based architectures, i.e., deep convolutional autoencoders, for learning informative representations of health-related tweets.

READ FULL TEXT
research
09/22/2017

Computational Content Analysis of Negative Tweets for Obesity, Diet, Diabetes, and Exercise

Social media based digital epidemiology has the potential to support fas...
research
09/27/2021

Evaluation of Non-Negative Matrix Factorization and n-stage Latent Dirichlet Allocation for Emotion Analysis in Turkish Tweets

With the development of technology, the use of social media has become q...
research
11/19/2019

Event detection in Colombian security Twitter news using fine-grained latent topic analysis

Cultural and social dynamics are important concepts that must be underst...
research
11/03/2020

Mixing Consistent Deep Clustering

Finding well-defined clusters in data represents a fundamental challenge...
research
10/24/2019

Learning eating environments through scene clustering

It is well known that dietary habits have a significant influence on hea...
research
11/26/2019

Tracing State-Level Obesity Prevalence from Sentence Embeddings of Tweets: A Feasibility Study

Twitter data has been shown broadly applicable for public health surveil...
research
06/12/2023

Deep denoising autoencoder-based non-invasive blood flow detection for arteriovenous fistula

Clinical guidelines underscore the importance of regularly monitoring an...

Please sign up or login with your details

Forgot password? Click here to reset