Learning Norms from Stories: A Prior for Value Aligned Agents

12/07/2019
by   Spencer Frazier, et al.
0

Value alignment is a property of an intelligent agent indicating that it can only pursue goals and activities that are beneficial to humans. Traditional approaches to value alignment use imitation learning or preference learning to infer the values of humans by observing their behavior. We introduce a complementary technique in which a value aligned prior is learned from naturally occurring stories which encode societal norms. Training data is sourced from the childrens educational comic strip, Goofus and Gallant. In this work, we train multiple machine learning models to classify natural language descriptions of situations found in the comic strip as normative or non normative by identifying if they align with the main characters behavior. We also report the models performance when transferring to two unrelated tasks with little to no additional training on the new task.

READ FULL TEXT
research
04/02/2020

Improving Confidence in the Estimation of Values and Norms

Autonomous agents (AA) will increasingly be interacting with us in our d...
research
10/18/2021

Value alignment: a formal approach

principles that should govern autonomous AI systems. It essentially stat...
research
05/12/2023

Multi-Value Alignment in Normative Multi-Agent System: Evolutionary Optimisation Approach

Value-alignment in normative multi-agent systems is used to promote a ce...
research
12/02/2020

Value Alignment Verification

As humans interact with autonomous agents to perform increasingly compli...
research
09/01/2022

In conversation with Artificial Intelligence: aligning language models with human values

Large-scale language technologies are increasingly used in various forms...
research
12/01/2021

A General Language Assistant as a Laboratory for Alignment

Given the broad capabilities of large language models, it should be poss...
research
08/23/2023

From Instructions to Intrinsic Human Values – A Survey of Alignment Goals for Big Models

Big models, exemplified by Large Language Models (LLMs), are models typi...

Please sign up or login with your details

Forgot password? Click here to reset