ExtremeBB: Enabling Large-Scale Research into Extremism, the Manosphere and Their Correlation by Online Forum Data

11/08/2021
by   Anh V. Vu, et al.
0

Online extremism is a growing and pernicious problem, and increasingly linked to real-world violence. We introduce a new resource to help research and understand it: ExtremeBB is a structured textual dataset containing nearly 44M posts made by more than 300K registered members on 12 different online extremist forums, enabling both qualitative and quantitative large-scale analyses of historical trends going back two decades. It enables us to trace the evolution of different strands of extremist ideology; to measure levels of toxicity while exploring and developing the tools to do so better; to track the relationships between online subcultures and external political movements such as MAGA and to explore links with misogyny and violence, including radicalisation and recruitment. To illustrate a few potential uses, we apply statistical and data-mining techniques to analyse the online extremist landscape in a variety of ways, from posting patterns through topic modelling to toxicity and the membership overlap across different communities. A picture emerges of communities working as support networks, with complex discussions over a wide variety of topics. The discussions of many topics show a level of disagreement which challenges the perception of homogeneity among these groups. These two features of mutual support and a wide range of attitudes lead us to suggest a more nuanced policy approach than simply shutting down these websites. Censorship might remove the support that lonely and troubled individuals are receiving, and fuel paranoid perceptions that the world is against them, though this must be balanced with other benefits of de-platforming. ExtremeBB can help develop a better understanding of these sub-cultures which may lead to more effective interventions; it also opens up the prospect of research to monitor the effectiveness of any interventions that are undertaken.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/27/2019

To Act or React: Investigating Proactive Strategies For Online Community Moderation

Reddit administrators have generally struggled to prevent or contain suc...
research
07/18/2023

With Flying Colors: Predicting Community Success in Large-scale Collaborative Campaigns

Online communities develop unique characteristics, establish social norm...
research
09/05/2018

A Quantitative Approach to Understanding Online Antisemitism

A new wave of growing antisemitism, driven by fringe Web communities, is...
research
01/21/2020

From Pick-Up Artists to Incels: A Data-Driven Sketch of the Manosphere

Over the past few years, a number of "fringe" online communities have be...
research
03/23/2022

Author Multidisciplinarity and Disciplinary Roles in Field of Study Networks

When studying large research corpora, "distant reading" methods are vita...
research
06/06/2020

StackOverflow vs Kaggle: A Study of Developer Discussions About Data Science

Software developers are increasingly required to understand fundamental ...

Please sign up or login with your details

Forgot password? Click here to reset