Value-based CTDE Methods in Symmetric Two-team Markov Game: from Cooperation to Team Competition

11/21/2022
by   Pascal Leroy, et al.
3

In this paper, we identify the best learning scenario to train a team of agents to compete against multiple possible strategies of opposing teams. We evaluate cooperative value-based methods in a mixed cooperative-competitive environment. We restrict ourselves to the case of a symmetric, partially observable, two-team Markov game. We selected three training methods based on the centralised training and decentralised execution (CTDE) paradigm: QMIX, MAVEN and QVMix. For each method, we considered three learning scenarios differentiated by the variety of team policies encountered during training. For our experiments, we modified the StarCraft Multi-Agent Challenge environment to create competitive environments where both teams could learn and compete simultaneously. Our results suggest that training against multiple evolving strategies achieves the best results when, for scoring their performances, teams are faced with several strategies.

READ FULL TEXT
research
10/29/2021

Mixed Cooperative-Competitive Communication Using Multi-Agent Reinforcement Learning

By using communication between multiple agents in multi-agent environmen...
research
11/18/2019

Inducing Cooperation via Team Regret Minimization based Multi-Agent Deep Reinforcement Learning

Existing value-factorized based Multi-Agent deep Reinforce-ment Learning...
research
07/14/2023

Optimal Symmetric Strategies in Multi-Agent Systems with Decentralized Information

We consider a cooperative multi-agent system consisting of a team of age...
research
06/16/2020

Evaluating and Rewarding Teamwork Using Cooperative Game Abstractions

Can we predict how well a team of individuals will perform together? How...
research
04/05/2011

Evolving Pacing Strategies for Team Pursuit Track Cycling

Team pursuit track cycling is a bicycle racing sport held on velodromes ...
research
06/04/2022

Estimating the Effect of Team Hitting Strategies Using Counterfactual Virtual Simulation in Baseball

In baseball, every play on the field is quantitatively evaluated and has...
research
10/29/2010

Analysing the behaviour of robot teams through relational sequential pattern mining

This report outlines the use of a relational representation in a Multi-A...

Please sign up or login with your details

Forgot password? Click here to reset