Exploring the Impact of Tunable Agents in Sequential Social Dilemmas

01/28/2021
by   David O'Callaghan, et al.
0

When developing reinforcement learning agents, the standard approach is to train an agent to converge to a fixed policy that is as close to optimal as possible for a single fixed reward function. If different agent behaviour is required in the future, an agent trained in this way must normally be either fully or partially retrained, wasting valuable time and resources. In this study, we leverage multi-objective reinforcement learning to create tunable agents, i.e. agents that can adopt a range of different behaviours according to the designer's preferences, without the need for retraining. We apply this technique to sequential social dilemmas, settings where there is inherent tension between individual and collective rationality. Learning a single fixed policy in such settings leaves one at a significant disadvantage if the opponents' strategies change after learning is complete. In our work, we demonstrate empirically that the tunable agents framework allows easy adaption between cooperative and competitive behaviours in sequential social dilemmas without the need for retraining, allowing a single trained agent model to be adjusted to cater for a wide range of behaviours and opponent strategies.

READ FULL TEXT

page 4

page 6

research
01/15/2020

Inducing Cooperative behaviour in Sequential-Social dilemmas through Multi-Agent Reinforcement Learning using Status-Quo Loss

In social dilemma situations, individual rationality leads to sub-optima...
research
04/26/2022

Social learning spontaneously emerges by searching optimal heuristics with deep reinforcement learning

How have individuals of social animals in nature evolved to learn from e...
research
05/10/2023

Learning Optimal "Pigovian Tax" in Sequential Social Dilemmas

In multi-agent reinforcement learning, each agent acts to maximize its i...
research
12/20/2022

Automated Configuration and Usage of Strategy Portfolios for Bargaining

Bargaining can be used to resolve mixed-motive games in multi-agent syst...
research
04/01/2020

Development of swarm behavior in artificial learning agents that adapt to different foraging environments

Collective behavior, and swarm formation in particular, has been studied...
research
05/15/2015

Reinforcement Learning applied to Single Neuron

This paper extends the reinforcement learning ideas into the multi-agent...
research
05/13/2018

An Optimal Rewiring Strategy for Reinforcement Social Learning in Cooperative Multiagent Systems

Multiagent coordination in cooperative multiagent systems (MASs) has bee...

Please sign up or login with your details

Forgot password? Click here to reset