Decentralized Reinforcement Learning: Global Decision-Making via Local Economic Transactions

07/05/2020
by   Michael Chang, et al.
0

This paper seeks to establish a framework for directing a society of simple, specialized, self-interested agents to solve what traditionally are posed as monolithic single-agent sequential decision problems. What makes it challenging to use a decentralized approach to collectively optimize a central objective is the difficulty in characterizing the equilibrium strategy profile of non-cooperative games. To overcome this challenge, we design a mechanism for defining the learning environment of each agent for which we know that the optimal solution for the global objective coincides with a Nash equilibrium strategy profile of the agents optimizing their own local objectives. The society functions as an economy of agents that learn the credit assignment process itself by buying and selling to each other the right to operate on the environment state. We derive a class of decentralized reinforcement learning algorithms that are broadly applicable not only to standard reinforcement learning but also for selecting options in semi-MDPs and dynamically composing computation graphs. Lastly, we demonstrate the potential advantages of a society's inherent modular structure for more efficient transfer learning.

READ FULL TEXT

page 9

page 16

research
12/15/2021

Finite-Sample Analysis of Decentralized Q-Learning for Stochastic Games

Learning in stochastic games is arguably the most standard and fundament...
research
09/17/2022

MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning

Decentralized learning has shown great promise for cooperative multi-age...
research
07/07/2022

For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria

Although it has been known since the 1970s that a globally optimal strat...
research
04/20/2023

Inducing Stackelberg Equilibrium through Spatio-Temporal Sequential Decision-Making in Multi-Agent Reinforcement Learning

In multi-agent reinforcement learning (MARL), self-interested agents att...
research
09/29/2021

Adversarial Linear-Quadratic Mean-Field Games over Multigraphs

In this paper, we propose a game between an exogenous adversary and a ne...
research
03/31/2021

Solving Heterogeneous General Equilibrium Economic Models with Deep Reinforcement Learning

General equilibrium macroeconomic models are a core tool used by policym...
research
06/06/2021

Unbiased Self-Play

We present a general optimization framework for emergent belief-state re...

Please sign up or login with your details

Forgot password? Click here to reset