Toxicity in the Decentralized Web and the Potential for Model Sharing

04/27/2022
by   Haris Bin Zia, et al.
0

The "Decentralised Web" (DW) is an evolving concept, which encompasses technologies aimed at providing greater transparency and openness on the web. The DW relies on independent servers (aka instances) that mesh together in a peer-to-peer fashion to deliver a range of services (e.g. micro-blogs, image sharing, video streaming). However, toxic content moderation in this decentralised context is challenging. This is because there is no central entity that can define toxicity, nor a large central pool of data that can be used to build universal classifiers. It is therefore unsurprising that there have been several high-profile cases of the DW being misused to coordinate and disseminate harmful material. Using a dataset of 9.9M posts from 117K users on Pleroma (a popular DW microblogging service), we quantify the presence of toxic content. We find that toxic content is prevalent and spreads rapidly between instances. We show that automating per-instance content moderation is challenging due to the lack of sufficient training data available and the effort required in labelling. We therefore propose and evaluate ModPair, a model sharing system that effectively detects toxic content, gaining an average per-instance macro-F1 score 0.89.

READ FULL TEXT

page 15

page 25

research
03/02/2021

Spam Prevention Using zk-SNARKs for Anonymous Peer-to-Peer Content Sharing Systems

Decentralized unpermissioned peer-to-peer networks are inherently vulner...
research
09/12/2019

Challenges in the Decentralised Web: The Mastodon Case

The Decentralised Web (DW) has recently seen a renewed momentum, with a ...
research
08/11/2022

Design and Evaluation of IPFS: A Storage Layer for the Decentralized Web

Recent years have witnessed growing consolidation of web operations. For...
research
12/17/2022

Farm Environmental Data Analyzer using a Decentralised system and R

Data/Web Hosting is a service that lets enterprises or selves present th...
research
06/22/2019

Efficient Peer-to-Peer Content Sharing for Learning in Virtual Worlds

Virtual world technologies provide new and immersive space for learning,...
research
06/11/2019

Measuring and exploiting the cloud consolidation of the Web

We present measurements showing that the top one million most popular We...
research
03/31/2020

Merkle-CRDTs: Merkle-DAGs meet CRDTs

We study Merkle-DAGs as a transport and persistence layer for Conflict-F...

Please sign up or login with your details

Forgot password? Click here to reset