Selective imitation on the basis of reward function similarity

05/12/2023
by   Max Taylor-Davies, et al.
0

Imitation is a key component of human social behavior, and is widely used by both children and adults as a way to navigate uncertain or unfamiliar situations. But in an environment populated by multiple heterogeneous agents pursuing different goals or objectives, indiscriminate imitation is unlikely to be an effective strategy – the imitator must instead determine who is most useful to copy. There are likely many factors that play into these judgements, depending on context and availability of information. Here we investigate the hypothesis that these decisions involve inferences about other agents' reward functions. We suggest that people preferentially imitate the behavior of others they deem to have similar reward functions to their own. We further argue that these inferences can be made on the basis of very sparse or indirect data, by leveraging an inductive bias toward positing the existence of different groups or types of people with similar reward functions, allowing learners to select imitation targets without direct evidence of alignment.

READ FULL TEXT

page 3

page 5

research
09/20/2020

Addressing reward bias in Adversarial Imitation Learning with neutral reward functions

Generative Adversarial Imitation Learning suffers from the fundamental p...
research
05/25/2021

Hyperparameter Selection for Imitation Learning

We address the issue of tuning hyperparameters (HPs) for imitation learn...
research
04/14/2021

Reward function shape exploration in adversarial imitation learning: an empirical study

For adversarial imitation learning algorithms (AILs), no true rewards ar...
research
02/20/2017

Parent Oriented Teacher Selection Causes Language Diversity

An evolutionary model for emergence of diversity in language is develope...
research
03/25/2023

Embedding Contextual Information through Reward Shaping in Multi-Agent Learning: A Case Study from Google Football

Artificial Intelligence has been used to help human complete difficult t...
research
09/02/2022

Co-Imitation: Learning Design and Behaviour by Imitation

The co-adaptation of robots has been a long-standing research endeavour ...

Please sign up or login with your details

Forgot password? Click here to reset