Game-theoretical control with continuous action sets

12/01/2014
by   Steven Perkins, et al.
0

Motivated by the recent applications of game-theoretical learning techniques to the design of distributed control systems, we study a class of control problems that can be formulated as potential games with continuous action sets, and we propose an actor-critic reinforcement learning algorithm that provably converges to equilibrium in this class of problems. The method employed is to analyse the learning process under study through a mean-field dynamical system that evolves in an infinite-dimensional function space (the space of probability distributions over the players' continuous controls). To do so, we extend the theory of finite-dimensional two-timescale stochastic approximation to an infinite-dimensional, Banach space setting, and we prove that the continuous dynamics of the process converge to equilibrium in the case of potential games. These results combine to give a provably-convergent learning algorithm in which players do not need to keep track of the controls selected by the other agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/19/2023

Deep Reinforcement Learning for Infinite Horizon Mean Field Problems in Continuous Spaces

We present the development and analysis of a reinforcement learning (RL)...
research
06/29/2018

Learning with minimal information in continuous games

We introduce a stochastic learning process called the dampened gradient ...
research
02/10/2020

Q-Learning for Mean-Field Controls

Multi-agent reinforcement learning (MARL) has been applied to many chall...
research
05/30/2019

Reinforcement Learning for Mean Field Game

Stochastic games provide a framework for interactions among multi-agents...
research
06/12/2020

Continuous Control for Searching and Planning with a Learned Model

Decision-making agents with planning capabilities have achieved huge suc...
research
09/13/2022

Independent Learning in Mean-Field Games: Satisficing Paths and Convergence to Subjective Equilibria

Independent learners are learning agents that naively employ single-agen...
research
07/25/2022

Flowsheet synthesis through hierarchical reinforcement learning and graph neural networks

Process synthesis experiences a disruptive transformation accelerated by...

Please sign up or login with your details

Forgot password? Click here to reset