Reliable Decision from Multiple Subtasks through Threshold Optimization: Content Moderation in the Wild

08/16/2022
by   Donghyun Son, et al.
0

Social media platforms struggle to protect users from harmful content through content moderation. These platforms have recently leveraged machine learning models to cope with the vast amount of user-generated content daily. Since moderation policies vary depending on countries and types of products, it is common to train and deploy the models per policy. However, this approach is highly inefficient, especially when the policies change, requiring dataset re-labeling and model re-training on the shifted data distribution. To alleviate this cost inefficiency, social media platforms often employ third-party content moderation services that provide prediction scores of multiple subtasks, such as predicting the existence of underage personnel, rude gestures, or weapons, instead of directly providing final moderation decisions. However, making a reliable automated moderation decision from the prediction scores of the multiple subtasks for a specific target policy has not been widely explored yet. In this study, we formulate real-world scenarios of content moderation and introduce a simple yet effective threshold optimization method that searches the optimal thresholds of the multiple subtasks to make a reliable moderation decision in a cost-effective way. Extensive experiments demonstrate that our approach shows better performance in content moderation compared to existing threshold optimization methods and heuristics.

READ FULL TEXT
research
11/27/2021

Abusive and Threatening Language Detection in Urdu using Boosting based and BERT based models: A Comparative Approach

Online hatred is a growing concern on many social media platforms. To ad...
research
10/16/2021

DFW-PP: Dynamic Feature Weighting based Popularity Prediction for Social Media Content

The increasing popularity of social media platforms makes it important t...
research
06/01/2022

In the Eye of the Beholder: Robust Prediction with Causal User Modeling

Accurately predicting the relevance of items to users is crucial to the ...
research
05/28/2020

Deceptive Deletions for Protecting Withdrawn Posts on Social Platforms

Over-sharing poorly-worded thoughts and personal information is prevalen...
research
11/11/2022

Bandits for Online Calibration: An Application to Content Moderation on Social Media Platforms

We describe the current content moderation strategy employed by Meta to ...
research
07/01/2021

When Curation Becomes Creation: Algorithms, Microcontent, and the Vanishing Distinction between Platforms and Creators

Ever since social activity on the Internet began migrating from the wild...
research
04/17/2023

Designing Policies for Truth: Combating Misinformation with Transparency and Information Design

Misinformation has become a growing issue on online social platforms (OS...

Please sign up or login with your details

Forgot password? Click here to reset