Ad hoc Teamwork and Moral Feedback as a Framework for Safe Agent Behavior

09/20/2018
by   Shani Alkoby, et al.
0

As technology develops, it is only a matter of time before agents will be capable of long term autonomy, i.e., will need to choose their actions by themselves for a long period of time. Thus, in many cases agents will not be able to be coordinated in advance with all other agents with which they may interact. Instead, agents will need to cooperate in order to accomplish unanticipated joint goals without pre-coordination. As a result, the "ad hoc teamwork" problem, in which teammates must work together to obtain a common goal without any prior agreement regarding how to do so, has emerged as a recent area of study in the AI literature. However, to date, no attention has been dedicated to the moral aspect of the agents' behavior, which is required to ensure that their actions' influences on other agents conform with social norms. In this research, we introduce the M-TAMER framework (a novel variant of TAMER) used to teach agents to act in accordance with human morality with respect to their teammates. Using a hybrid team (agents and people), if taking an action considered to be morally unacceptable, the agents will receive negative feedback from the human teammate(s). Using M-TAMER, agents will learn to act more consistently with respect to human morality.

READ FULL TEXT
research
09/20/2018

Teaching Social Behavior through Human Reinforcement for Ad hoc Teamwork -The STAR Framework

As technology develops, it is only a matter of time before agents will b...
research
09/01/2014

Team Behavior in Interactive Dynamic Influence Diagrams with Applications to Ad Hoc Teams

Planning for ad hoc teamwork is challenging because it involves agents c...
research
07/22/2019

Comparative Evaluation of Multiagent Learning Algorithms in a Diverse Set of Ad Hoc Team Problems

This paper is concerned with evaluating different multiagent learning (M...
research
08/24/2022

Knowledge-based and Data-driven Reasoning and Learning for Ad Hoc Teamwork

We present an architecture for ad hoc teamwork, which refers to collabor...
research
07/19/2022

Few-Shot Teamwork

We propose the novel few-shot teamwork (FST) problem, where skilled agen...
research
10/01/2020

How to Motivate Your Dragon: Teaching Goal-Driven Agents to Speak and Act in Fantasy Worlds

We seek to create agents that both act and communicate with other agents...
research
05/12/2020

Observational Learning with Fake Agents

It is common in online markets for agents to learn from other's actions....

Please sign up or login with your details

Forgot password? Click here to reset