Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis

06/17/2022
by   Shayegan Omidshafiei, et al.
0

Each year, expert-level performance is attained in increasingly-complex multiagent domains, notable examples including Go, Poker, and StarCraft II. This rapid progression is accompanied by a commensurate need to better understand how such agents attain this performance, to enable their safe deployment, identify limitations, and reveal potential means of improving them. In this paper we take a step back from performance-focused multiagent learning, and instead turn our attention towards agent behavior analysis. We introduce a model-agnostic method for discovery of behavior clusters in multiagent domains, using variational inference to learn a hierarchy of behaviors at the joint and local agent levels. Our framework makes no assumption about agents' underlying learning algorithms, does not require access to their latent states or policies, and is trained using only offline observational data. We illustrate the effectiveness of our method for enabling the coupled understanding of behaviors at the joint and local agent level, detection of behavior changepoints throughout training, discovery of core behavioral concepts, demonstrate the approach's scalability to a high-dimensional multiagent MuJoCo control domain, and also illustrate that the approach can disentangle previously-trained policies in OpenAI's hide-and-seek domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2018

Learning Safe Policies with Expert Guidance

We propose a framework for ensuring safe behavior of a reinforcement lea...
research
01/29/2020

Variational Autoencoders for Opponent Modeling in Multi-Agent Systems

Multi-agent systems exhibit complex behaviors that emanate from the inte...
research
04/19/2018

Hierarchical Behavioral Repertoires with Unsupervised Descriptors

Enabling artificial agents to automatically learn complex, versatile and...
research
04/01/2022

Learnable latent embeddings for joint behavioral and neural analysis

Mapping behavioral actions to neural activity is a fundamental goal of n...
research
11/02/2021

Discovering and Exploiting Sparse Rewards in a Learned Behavior Space

Learning optimal policies in sparse rewards settings is difficult as the...
research
10/10/2019

Using Neural Networks for Programming by Demonstration

Agent-based modeling is a paradigm of modeling dynamic systems of intera...
research
03/14/2022

Safe adaptation in multiagent competition

Achieving the capability of adapting to ever-changing environments is a ...

Please sign up or login with your details

Forgot password? Click here to reset