Automated Content Moderation Increases Adherence to Community Guidelines

by   Manoel Horta Ribeiro, et al.

Online social media platforms use automated moderation systems to remove or reduce the visibility of rule-breaking content. While previous work has documented the importance of manual content moderation, the effects of automated content moderation remain largely unknown, in part due to the technical and ethical challenges in assessing their impact using randomized experiments. Here, in a large study of Facebook comments (n=412M), we used a fuzzy regression discontinuity design to measure the impact of automated content moderation on subsequent rule-breaking behavior (number of comments hidden or deleted) and engagement (number of additional comments posted). We found that comment deletion decreased subsequent rule-breaking behavior in shorter threads (20 or fewer comments), even among other participants, suggesting that the intervention prevented conversations from derailing. Further, the effect of deletion on the affected user's subsequent rule-breaking behavior was longer-lived than its effect on reducing commenting in general, suggesting that users were deterred from rule-breaking but not from continuing to comment. However, hiding (rather than deleting) content had small and statistically insignificant effects. Overall, our results suggest that automated content moderation can increase adherence to community guidelines.


page 1

page 2

page 3

page 4


The Impact of Content Commenting on User Continuance in Online Q A Communities: An Affordance Perspective

Online question-and-answer (Q A) communities provide convenient and in...

"My Friend Wanted to Talk About It and I Didn't": Understanding Perceptions of Deletion Privacy in Social Platforms

There is a growing concern and awareness about the right-to-be-forgotten...

Classification of social media Toxic comments using Machine learning models

The abstract outlines the problem of toxic comments on social media plat...

Friction Interventions to Curb the Spread of Misinformation on Social Media

Social media has enabled the spread of information at unprecedented spee...

"HOT" ChatGPT: The promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media

Harmful content is pervasive on social media, poisoning online communiti...

Content Removal as a Moderation Strategy: Compliance and Other Outcomes in the ChangeMyView Community

Moderators of online communities often employ comment deletion as a tool...

Governing for Free: Rule Process Effects on Reddit Moderator Motivations

Developing a strong community requires empowered leadership capable of o...

Please sign up or login with your details

Forgot password? Click here to reset