Learning to Play Soccer From Scratch: Sample-Efficient Emergent Coordination through Curriculum-Learning and Competition

03/09/2021
by   Pavan Samtani, et al.
0

This work proposes a scheme that allows learning complex multi-agent behaviors in a sample efficient manner, applied to 2v2 soccer. The problem is formulated as a Markov game, and solved using deep reinforcement learning. We propose a basic multi-agent extension of TD3 for learning the policy of each player, in a decentralized manner. To ease learning, the task of 2v2 soccer is divided in three stages: 1v0, 1v1 and 2v2. The process of learning in multi-agent stages (1v1 and 2v2) uses agents trained on a previous stage as fixed opponents. In addition, we propose using experience sharing, a method that shares experience from a fixed opponent, trained in a previous stage, for training the agent currently learning, and a form of frame-skipping, to raise performance significantly. Our results show that high quality soccer play can be obtained with our approach in just under 40M interactions. A summarized video of the resulting game play can be found in https://youtu.be/f25l1j1U9RM.

READ FULL TEXT
research
08/18/2020

Learning Complex Multi-Agent Policies in Presence of an Adversary

In recent years, there has been some outstanding work on applying deep r...
research
02/15/2023

TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play

Multi-agent football poses an unsolved challenge in AI research. Existin...
research
09/13/2018

CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning

We propose CM3, a new deep reinforcement learning method for cooperative...
research
02/09/2023

Learning Complex Teamwork Tasks using a Sub-task Curriculum

Training a team to complete a complex task via multi-agent reinforcement...
research
07/13/2023

Layered controller synthesis for dynamic multi-agent systems

In this paper we present a layered approach for multi-agent control prob...
research
03/07/2023

Mastering Strategy Card Game (Legends of Code and Magic) via End-to-End Policy and Optimistic Smooth Fictitious Play

Deep Reinforcement Learning combined with Fictitious Play shows impressi...
research
04/02/2020

Adversarial Reinforcement Learning-based Robust Access Point Coordination Against Uncoordinated Interference

This paper proposes a robust adversarial reinforcement learning (RARL)-b...

Please sign up or login with your details

Forgot password? Click here to reset