Statistical Analysis of Perspective Scores on Hate Speech Detection

06/22/2021
by   Hadi Mansourifar, et al.
0

Hate speech detection has become a hot topic in recent years due to the exponential growth of offensive language in social media. It has proven that, state-of-the-art hate speech classifiers are efficient only when tested on the data with the same feature distribution as training data. As a consequence, model architecture plays the second role to improve the current results. In such a diverse data distribution relying on low level features is the main cause of deficiency due to natural bias in data. That's why we need to use high level features to avoid a biased judgement. In this paper, we statistically analyze the Perspective Scores and their impact on hate speech detection. We show that, different hate speech datasets are very similar when it comes to extract their Perspective Scores. Eventually, we prove that, over-sampling the Perspective Scores of a hate speech dataset can significantly improve the generalization performance when it comes to be tested on other hate speech datasets.

READ FULL TEXT
research
06/24/2021

Hate Speech Detection in Clubhouse

With the rise of voice chat rooms, a gigantic resource of data can be ex...
research
08/28/2018

All You Need is "Love": Evading Hate-speech Detection

With the spread of social networks and their unfortunate use for hate sp...
research
09/24/2022

Joint Speech Activity and Overlap Detection with Multi-Exit Architecture

Overlapped speech detection (OSD) is critical for speech applications in...
research
07/04/2023

Robust Hate Speech Detection in Social Media: A Cross-Dataset Empirical Evaluation

The automatic detection of hate speech online is an active research area...
research
06/18/2020

Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features

Deep generative networks trained via maximum likelihood on a natural ima...
research
11/27/2019

High- and Low-level image component decomposition using VAEs for improved reconstruction and anomaly detection

Variational Auto-Encoders have often been used for unsupervised pretrain...
research
06/30/2021

Whose Opinions Matter? Perspective-aware Models to Identify Opinions of Hate Speech Victims in Abusive Language Detection

Social media platforms provide users the freedom of expression and a med...

Please sign up or login with your details

Forgot password? Click here to reset