Value alignment: a formal approach

10/18/2021
by   Carles Sierra, et al.
0

principles that should govern autonomous AI systems. It essentially states that a system's goals and behaviour should be aligned with human values. But how to ensure value alignment? In this paper we first provide a formal model to represent values through preferences and ways to compute value aggregations; i.e. preferences with respect to a group of agents and/or preferences with respect to sets of values. Value alignment is then defined, and computed, for a given norm with respect to a given value through the increase/decrease that it results in the preferences of future states of the world. We focus on norms as it is norms that govern behaviour, and as such, the alignment of a given system with a given value will be dictated by the norms the system follows.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2020

Value Alignment Equilibrium in Multiagent Systems

Value alignment has emerged in recent years as a basic principle to prod...
research
05/12/2023

Multi-Value Alignment in Normative Multi-Agent System: Evolutionary Optimisation Approach

Value-alignment in normative multi-agent systems is used to promote a ce...
research
02/17/2023

Value Engineering for Autonomous Agents

Machine Ethics (ME) is concerned with the design of Artificial Moral Age...
research
12/02/2020

Value Alignment Verification

As humans interact with autonomous agents to perform increasingly compli...
research
12/07/2019

Learning Norms from Stories: A Prior for Value Aligned Agents

Value alignment is a property of an intelligent agent indicating that it...
research
08/01/2023

Collaborative filtering to capture AI user's preferences as norms

Customising AI technologies to each user's preferences is fundamental to...
research
07/13/2023

Towards a resolution of the spin alignment problem

Consider minimizing the entropy of a mixture of states by choosing each ...

Please sign up or login with your details

Forgot password? Click here to reset