Explainable and High-Performance Hate and Offensive Speech Detection

06/26/2022
by   Marzieh Babaeianjelodar, et al.
0

The spread of information through social media platforms can create environments possibly hostile to vulnerable communities and silence certain groups in society. To mitigate such instances, several models have been developed to detect hate and offensive speech. Since detecting hate and offensive speech in social media platforms could incorrectly exclude individuals from social media platforms, which can reduce trust, there is a need to create explainable and interpretable models. Thus, we build an explainable and interpretable high performance model based on the XGBoost algorithm, trained on Twitter data. For unbalanced Twitter data, XGboost outperformed the LSTM, AutoGluon, and ULMFiT models on hate speech detection with an F1 score of 0.75 compared to 0.38 and 0.37, and 0.38 respectively. When we down-sampled the data to three separate classes of approximately 5000 tweets, XGBoost performed better than LSTM, AutoGluon, and ULMFiT; with F1 scores for hate speech detection of 0.79 vs 0.69, 0.77, and 0.66 respectively. XGBoost also performed better than LSTM, AutoGluon, and ULMFiT in the down-sampled version for offensive speech detection with F1 score of 0.83 vs 0.88, 0.82, and 0.79 respectively. We use Shapley Additive Explanations (SHAP) on our XGBoost models' outputs to makes it explainable and interpretable compared to LSTM, AutoGluon and ULMFiT that are black-box models.

READ FULL TEXT
research
04/07/2023

SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers

This paper describes our submission to Task 10 at SemEval 2023-Explainab...
research
03/20/2022

Explainable Misinformation Detection Across Multiple Social Media Platforms

In this work, the integration of two machine learning approaches, namely...
research
04/26/2020

Ensemble Deep Learning on Time-Series Representation of Tweets for Rumor Detection in Social Media

Social media is a popular platform for timely information sharing. One o...
research
05/18/2023

What Symptoms and How Long? An Interpretable AI Approach for Depression Detection in Social Media

Depression is the most prevalent and serious mental illness, which induc...
research
12/28/2020

DeepHateExplainer: Explainable Hate Speech Detection in Under-resourced Bengali Language

Exponential growths of social media and micro-blogging sites not only pr...
research
01/31/2022

Detecting False Rumors from Retweet Dynamics on Social Media

False rumors are known to have detrimental effects on society. To preven...

Please sign up or login with your details

Forgot password? Click here to reset