Detecting Potentially Harmful and Protective Suicide-related Content on Twitter: A Machine Learning Approach

12/09/2021
by   Hannah Metzler, et al.
34

Research shows that exposure to suicide-related news media content is associated with suicide rates, with some content characteristics likely having harmful and others potentially protective effects. Although good evidence exists for a few selected characteristics, systematic large scale investigations are missing in general, and in particular for social media data. We apply machine learning methods to automatically label large quantities of Twitter data. We developed a novel annotation scheme that classifies suicide-related tweets into different message types and problem- vs. solution-focused perspectives. We then trained a benchmark of machine learning models including a majority classifier, an approach based on word frequency (TF-IDF with a linear SVM) and two state-of-the-art deep learning models (BERT, XLNet). The two deep learning models achieved the best performance in two classification tasks: First, we classified six main content categories, including personal stories about either suicidal ideation and attempts or coping, calls for action intending to spread either problem awareness or prevention-related information, reportings of suicide cases, and other suicide-related and off-topic tweets. The deep learning models reach accuracy scores above 73 69 Second, in separating postings referring to actual suicide from off-topic tweets, they correctly labelled around 88 F1-scores of 93 performances are comparable to the state-of-the-art on similar tasks. By making data labeling more efficient, this work enables future large-scale investigations on harmful and protective effects of various kinds of social media content on suicide rates and on help-seeking behavior.

READ FULL TEXT

page 13

page 17

page 30

page 31

page 32

page 33

page 34

research
09/01/2023

Detecting Suicidality in Arabic Tweets Using Machine Learning and Deep Learning Techniques

Social media platforms have revolutionized traditional communication tec...
research
10/03/2019

Mapping (Dis-)Information Flow about the MH17 Plane Crash

Digital media enables not only fast sharing of information, but also dis...
research
01/09/2021

Eating Garlic Prevents COVID-19 Infection: Detecting Misinformation on the Arabic Content of Twitter

The rapid growth of social media content during the current pandemic pro...
research
07/12/2023

Detecting the Presence of COVID-19 Vaccination Hesitancy from South African Twitter Data Using Machine Learning

Very few social media studies have been done on South African user-gener...
research
07/06/2023

A Novel Site-Agnostic Multimodal Deep Learning Model to Identify Pro-Eating Disorder Content on Social Media

Over the last decade, there has been a vast increase in eating disorder ...
research
02/23/2022

MuMiN: A Large-Scale Multilingual Multimodal Fact-Checked Misinformation Social Network Dataset

Misinformation is becoming increasingly prevalent on social media and in...
research
03/27/2019

Sensing Social Media Signals for Cryptocurrency News

The ability to track and monitor relevant and important news in real-tim...

Please sign up or login with your details

Forgot password? Click here to reset