Experience Augmentation: Boosting and Accelerating Off-Policy Multi-Agent Reinforcement Learning

05/19/2020
by   Zhenhui Ye, et al.
0

Exploration of the high-dimensional state action space is one of the biggest challenges in Reinforcement Learning (RL), especially in multi-agent domain. We present a novel technique called Experience Augmentation, which enables a time-efficient and boosted learning based on a fast, fair and thorough exploration to the environment. It can be combined with arbitrary off-policy MARL algorithms and is applicable to either homogeneous or heterogeneous environments. We demonstrate our approach by combining it with MADDPG and verifing the performance in two homogeneous and one heterogeneous environments. In the best performing scenario, the MADDPG with experience augmentation reaches to the convergence reward of vanilla MADDPG with 1/4 realistic time, and its convergence beats the original model by a significant margin. Our ablation studies show that experience augmentation is a crucial ingredient which accelerates the training process and boosts the convergence.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/27/2020

Parameter Sharing is Surprisingly Useful for Multi-Agent Deep Reinforcement Learning

"Nonstationarity" is a fundamental problem in cooperative multi-agent re...
research
03/16/2023

SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning

Value-decomposition methods, which reduce the difficulty of a multi-agen...
research
11/10/2021

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

We present the PowerGridworld software package to provide users with a l...
research
01/18/2019

WALL-E: An Efficient Reinforcement Learning Research Framework

There are two halves to RL systems: experience collection time and polic...
research
05/04/2023

Simple Noisy Environment Augmentation for Reinforcement Learning

Data augmentation is a widely used technique for improving model perform...
research
07/04/2023

Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning

We present a novel Diffusion Offline Multi-agent Model (DOM2) for offlin...
research
06/14/2020

Non-local Policy Optimization via Diversity-regularized Collaborative Exploration

Conventional Reinforcement Learning (RL) algorithms usually have one sin...

Please sign up or login with your details

Forgot password? Click here to reset