Independent Ethical Assessment of Text Classification Models: A Hate Speech Detection Case Study

07/19/2021
by   Amitoj Singh, et al.
0

An independent ethical assessment of an artificial intelligence system is an impartial examination of the system's development, deployment, and use in alignment with ethical values. System-level qualitative frameworks that describe high-level requirements and component-level quantitative metrics that measure individual ethical dimensions have been developed over the past few years. However, there exists a gap between the two, which hinders the execution of independent ethical assessments in practice. This study bridges this gap and designs a holistic independent ethical assessment process for a text classification model with a special focus on the task of hate speech detection. The assessment is further augmented with protected attributes mining and counterfactual-based analysis to enhance bias assessment. It covers assessments of technical performance, data bias, embedding bias, classification bias, and interpretability. The proposed process is demonstrated through an assessment of a deep hate speech detection model.

READ FULL TEXT
research
07/31/2020

RoboTed: a case study in Ethical Risk Assessment

Risk Assessment is a well known and powerful method for discovering and ...
research
10/09/2020

Case Study: Deontological Ethics in NLP

Recent work in natural language processing (NLP) has focused on ethical ...
research
05/04/2019

Ethically Aligned Design: An empirical evaluation of the RESOLVEDD-strategy in Software and Systems development context

Use of artificial intelligence (AI) in human contexts calls for ethical ...
research
04/26/2023

Towards ethical multimodal systems

The impact of artificial intelligence systems on our society is increasi...
research
11/11/2021

Implementation of Ethically Aligned Design with Ethical User stories in SMART terminal Digitalization project: Use case Passenger Flow

Digitalization and Smart systems are part of our everyday lives today. S...
research
08/03/2023

NBIAS: A Natural Language Processing Framework for Bias Identification in Text

Bias in textual data can lead to skewed interpretations and outcomes whe...
research
05/09/2022

Towards a multi-stakeholder value-based assessment framework for algorithmic systems

In an effort to regulate Machine Learning-driven (ML) systems, current a...

Please sign up or login with your details

Forgot password? Click here to reset