Cooperative Control of Mobile Robots with Stackelberg Learning

08/03/2020
by   Joewie J. Koh, et al.
0

Multi-robot cooperation requires agents to make decisions that are consistent with the shared goal without disregarding action-specific preferences that might arise from asymmetry in capabilities and individual objectives. To accomplish this goal, we propose a method named SLiCC: Stackelberg Learning in Cooperative Control. SLiCC models the problem as a partially observable stochastic game composed of Stackelberg bimatrix games, and uses deep reinforcement learning to obtain the payoff matrices associated with these games. Appropriate cooperative actions are then selected with the derived Stackelberg equilibria. Using a bi-robot cooperative object transportation problem, we validate the performance of SLiCC against centralized multi-agent Q-learning and demonstrate that SLiCC achieves better combined utility.

READ FULL TEXT

page 1

page 7

page 8

research
03/14/2020

Deep Multi-Agent Reinforcement Learning for Decentralized Continuous Cooperative Control

Deep multi-agent reinforcement learning (MARL) holds the promise of auto...
research
11/18/2019

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning

Existing value-factorized based Multi-Agent deep Reinforce-ment Learning...
research
07/17/2020

Multi-robot Cooperative Object Transportation using Decentralized Deep Reinforcement Learning

Object transportation could be a challenging problem for a single robot ...
research
09/18/2018

SCC-rFMQ Learning in Cooperative Markov Games with Continuous Actions

Although many reinforcement learning methods have been proposed for lear...
research
03/01/2018

Q-CP: Learning Action Values for Cooperative Planning

Research on multi-robot systems has demonstrated promising results in ma...
research
07/19/2020

Multi-Principal Assistance Games

Assistance games (also known as cooperative inverse reinforcement learni...
research
04/22/2021

Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem

The adaptive traffic signal control (ATSC) problem can be modeled as a m...

Please sign up or login with your details

Forgot password? Click here to reset