Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning

05/17/2023
by   Daniel Waelchli, et al.
0

The discovery of individual objectives in collective behavior of complex dynamical systems such as fish schools and bacteria colonies is a long-standing challenge. Inverse reinforcement learning is a potent approach for addressing this challenge but its applicability to dynamical systems, involving continuous state-action spaces and multiple interacting agents, has been limited. In this study, we tackle this challenge by introducing an off-policy inverse multi-agent reinforcement learning algorithm (IMARL). Our approach combines the ReF-ER techniques with guided cost learning. By leveraging demonstrations, our algorithm automatically uncovers the reward function and learns an effective policy for the agents. Through extensive experimentation, we demonstrate that the proposed policy captures the behavior observed in the provided data, and achieves promising results across problem domains including single agent models in the OpenAI gym and multi-agent models of schooling behavior. The present study shows that the proposed IMARL algorithm is a significant step towards understanding collective dynamics from the perspective of its constituents, and showcases its value as a tool for studying complex physical systems exhibiting collective behaviour.

READ FULL TEXT
research
01/27/2020

Regret Bounds for Decentralized Learning in Cooperative Multi-Agent Dynamical Systems

Regret analysis is challenging in Multi-Agent Reinforcement Learning (MA...
research
02/21/2018

Learning to Gather without Communication

A standard belief on emerging collective behavior is that it emerges fro...
research
02/17/2016

Inverse Reinforcement Learning in Swarm Systems

Inverse reinforcement learning (IRL) has become a useful tool for learni...
research
05/13/2019

Physically-interpretable classification of network dynamics for complex collective motions

Understanding complex network dynamics is a fundamental issue in various...
research
12/11/2022

Random Feature Models for Learning Interacting Dynamical Systems

Particle dynamics and multi-agent systems provide accurate dynamical mod...
research
01/17/2023

Show me what you want: Inverse reinforcement learning to automatically design robot swarms by demonstration

Automatic design is a promising approach to generating control software ...
research
04/15/2021

Collective Iterative Learning Control: Exploiting Diversity in Multi-Agent Systems for Reference Tracking Tasks

This paper considers a group of autonomous agents learning to track the ...

Please sign up or login with your details

Forgot password? Click here to reset