Minimum Levels of Interpretability for Artificial Moral Agents

07/02/2023
by   Avish Vijayaraghavan, et al.
0

As artificial intelligence (AI) models continue to scale up, they are becoming more capable and integrated into various forms of decision-making systems. For models involved in moral decision-making, also known as artificial moral agents (AMA), interpretability provides a way to trust and understand the agent's internal reasoning mechanisms for effective use and error correction. In this paper, we provide an overview of this rapidly-evolving sub-field of AI interpretability, introduce the concept of the Minimum Level of Interpretability (MLI) and recommend an MLI for various types of agents, to aid their safe deployment in real-world settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2019

On the Semantic Interpretability of Artificial Intelligence Models

Artificial Intelligence models are becoming increasingly more powerful a...
research
12/21/2022

Circumventing interpretability: How to defeat mind-readers

The increasing capabilities of artificial intelligence (AI) systems make...
research
02/01/2021

Human Perceptions on Moral Responsibility of AI: A Case Study in AI-Assisted Bail Decision-Making

How to attribute responsibility for autonomous artificial intelligence (...
research
08/13/2021

TDM: Trustworthy Decision-Making via Interpretability Enhancement

Human-robot interactive decision-making is increasingly becoming ubiquit...
research
10/07/2022

1st ICLR International Workshop on Privacy, Accountability, Interpretability, Robustness, Reasoning on Structured Data (PAIR^2Struct)

Recent years have seen advances on principles and guidance relating to a...
research
09/05/2020

Generalization on the Enhancement of Layerwise Relevance Interpretability of Deep Neural Network

The practical application of deep neural networks are still limited by t...
research
05/08/2022

Introduction to Soar

This paper is the recommended initial reading for a functional overview ...

Please sign up or login with your details

Forgot password? Click here to reset