ERMAS: Becoming Robust to Reward Function Sim-to-Real Gaps in Multi-Agent Simulations

06/10/2021
by   Eric Zhao, et al.
0

Multi-agent simulations provide a scalable environment for learning policies that interact with rational agents. However, such policies may fail to generalize to the real-world where agents may differ from simulated counterparts due to unmodeled irrationality and misspecified reward functions. We introduce Epsilon-Robust Multi-Agent Simulation (ERMAS), a robust optimization framework for learning AI policies that are robust to such multiagent sim-to-real gaps. While existing notions of multi-agent robustness concern perturbations in the actions of agents, we address a novel robustness objective concerning perturbations in the reward functions of agents. ERMAS provides this robustness by anticipating suboptimal behaviors from other agents, formalized as the worst-case epsilon-equilibrium. We show empirically that ERMAS yields robust policies for repeated bimatrix games and optimal taxation problems in economic simulations. In particular, in the two-level RL problem posed by the AI Economist (Zheng et al., 2020) ERMAS learns tax policies that are robust to changes in agent risk aversion, improving social welfare by up to 15

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2023

Robustness Testing for Multi-Agent Reinforcement Learning: State Perturbations on Critical Agents

Multi-Agent Reinforcement Learning (MARL) has been widely applied in man...
research
12/06/2022

What is the Solution for State-Adversarial Multi-Agent Reinforcement Learning?

Various methods for Multi-Agent Reinforcement Learning (MARL) have been ...
research
06/09/2021

Deception in Social Learning: A Multi-Agent Reinforcement Learning Perspective

Within the framework of Multi-Agent Reinforcement Learning, Social Learn...
research
06/23/2022

A Fast Algorithm for Robust Action Selection in Multi-Agent Systems

In this paper, we consider a robust action selection problem in multi-ag...
research
09/16/2016

A Formal Solution to the Grain of Truth Problem

A Bayesian agent acting in a multi-agent environment learns to predict t...
research
10/31/2019

Learning Fairness in Multi-Agent Systems

Fairness is essential for human society, contributing to stability and p...
research
04/28/2020

The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies

Tackling real-world socio-economic challenges requires designing and tes...

Please sign up or login with your details

Forgot password? Click here to reset