Artificial Intelligence, Values and Alignment

01/13/2020
by   Iason Gabriel, et al.
0

This paper looks at philosophical questions that arise in the context of AI alignment. It defends three propositions. First, normative and technical aspects of the AI alignment problem are interrelated, creating space for productive engagement between people working in both domains. Second, it is important to be clear about the goal of alignment. There are significant differences between AI that aligns with instructions, intentions, revealed preferences, ideal preferences, interests and values. A principle-based approach to AI alignment, which combines these elements in a systematic way, has considerable advantages in this context. Third, the central challenge for theorists is not to identify 'true' moral principles for AI; rather, it is to identify fair principles for alignment, that receive reflective endorsement despite widespread variation in people's moral beliefs. The final part of the paper explores three ways in which fair principles for AI alignment could potentially be identified.

READ FULL TEXT
research
01/10/2023

A Multi-Level Framework for the AI Alignment Problem

AI alignment considers how we can encode AI systems in a way that is com...
research
09/10/2023

Decolonial AI Alignment: Viśesadharma, Argument, and Artistic Expression

Prior work has explicated the coloniality of artificial intelligence (AI...
research
12/22/2022

Methodological reflections for AI alignment research using human feedback

The field of artificial intelligence (AI) alignment aims to investigate ...
research
12/09/2022

FAIR AI Models in High Energy Physics

The findable, accessible, interoperable, and reusable (FAIR) data princi...
research
07/27/2023

Designing Fiduciary Artificial Intelligence

A fiduciary is a trusted agent that has the legal duty to act with loyal...
research
03/07/2018

Value Alignment, Fair Play, and the Rights of Service Robots

Ethics and safety research in artificial intelligence is increasingly fr...
research
12/19/2021

Demanding and Designing Aligned Cognitive Architectures

With AI systems becoming more powerful and pervasive, there is increasin...

Please sign up or login with your details

Forgot password? Click here to reset