Towards Ethics by Design in Online Abusive Content Detection

10/28/2020
by   Svetlana Kiritchenko, et al.
0

To support safety and inclusion in online communications, significant efforts in NLP research have been put towards addressing the problem of abusive content detection, commonly defined as a supervised classification task. The research effort has spread out across several closely related sub-areas, such as detection of hate speech, toxicity, cyberbullying, etc. There is a pressing need to consolidate the field under a common framework for task formulation, dataset design and performance evaluation. Further, despite current technologies achieving high classification accuracies, several ethical issues have been revealed. We bring ethical issues to forefront and propose a unified framework as a two-step process. First, online content is categorized around personal and identity-related subject matters. Second, severity of abuse is identified through comparative annotation within each category. The novel framework is guided by the Ethics by Design principle and is a step towards building more accurate and trusted models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/22/2020

Confronting Abusive Language Online: A Survey from the Ethical and Human Rights Perspective

The pervasiveness of abusive content on the internet can lead to severe ...
research
06/27/2022

Which one is more toxic? Findings from Jigsaw Rate Severity of Toxic Comments

The proliferation of online hate speech has necessitated the creation of...
research
06/06/2023

Applying Standards to Advance Upstream Downstream Ethics in Large Language Models

This paper explores how AI-owners can develop safeguards for AI-generate...
research
06/02/2021

Use of Formal Ethical Reviews in NLP Literature: Historical Trends and Current Practices

Ethical aspects of research in language technologies have received much ...
research
03/27/2023

Beyond Toxic: Toxicity Detection Datasets are Not Enough for Brand Safety

The rapid growth in user generated content on social media has resulted ...
research
07/15/2020

The Moral-IT Deck: A Tool for Ethics by Design

This paper presents the design process and empirical evaluation of a new...
research
06/09/2022

CrowdWorkSheets: Accounting for Individual and Collective Identities Underlying Crowdsourced Dataset Annotation

Human annotated data plays a crucial role in machine learning (ML) resea...

Please sign up or login with your details

Forgot password? Click here to reset