AI Alignment Dialogues: An Interactive Approach to AI Alignment in Support Agents

01/16/2023
by   Pei-Yu Chen, et al.
0

AI alignment is about ensuring AI systems only pursue goals and activities that are beneficial to humans. Most of the current approach to AI alignment is to learn what humans value from their behavioural data. This paper proposes a different way of looking at the notion of alignment, namely by introducing AI Alignment Dialogues: dialogues with which users and agents try to achieve and maintain alignment via interaction. We argue that alignment dialogues have a number of advantages in comparison to data-driven approaches, especially for behaviour support agents, which aim to support users in achieving their desired future behaviours rather than their current behaviours. The advantages of alignment dialogues include allowing the users to directly convey higher-level concepts to the agent, and making the agent more transparent and trustworthy. In this paper we outline the concept and high-level structure of alignment dialogues. Moreover, we conducted a qualitative focus group user study from which we developed a model that describes how alignment dialogues affect users, and created design suggestions for AI alignment dialogues. Through this we establish foundations for AI alignment dialogues and shed light on what requires further development and research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2023

A Multi-Level Framework for the AI Alignment Problem

AI alignment considers how we can encode AI systems in a way that is com...
research
05/09/2022

Aligned with Whom? Direct and social goals for AI systems

As artificial intelligence (AI) becomes more powerful and widespread, th...
research
04/03/2017

Brief Notes on Hard Takeoff, Value Alignment, and Coherent Extrapolated Volition

I make some basic observations about hard takeoff, value alignment, and ...
research
08/03/2023

VisAlign: Dataset for Measuring the Degree of Alignment between AI and Humans in Visual Perception

AI alignment refers to models acting towards human-intended goals, prefe...
research
10/22/2020

Migratable AI : Investigating users' affect on identity and information migration of a conversational AI agent

Conversational AI agents are becoming ubiquitous and provide assistance ...
research
06/23/2023

Exploring Qualitative Research Using LLMs

The advent of AI driven large language models (LLMs) have stirred discus...
research
01/12/2022

The Concept of Criticality in AI Safety

When AI agents don't align their actions with human values they may caus...

Please sign up or login with your details

Forgot password? Click here to reset