Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations

12/03/2018
by   Ozsel Kilinc, et al.
8

Multi-agent reinforcement learning systems aim to provide interacting agents with the ability to collaboratively learn and adapt to the behaviour of other agents. In many real-world applications, the agents can only acquire a partial view of the world. Here we consider a setting whereby most agents' observations are also extremely noisy, hence only weakly correlated to the true state of the environment. Under these circumstances, learning an optimal policy becomes particularly challenging, even in the unrealistic case that an agent's policy can be made conditional upon all other agents' observations. To overcome these difficulties, we propose a multi-agent deep deterministic policy gradient algorithm enhanced by a communication medium (MADDPG-M), which implements a two-level, concurrent learning mechanism. An agent's policy depends on its own private observations as well as those explicitly shared by others through a communication medium. At any given point in time, an agent must decide whether its private observations are sufficiently informative to be shared with others. However, our environments provide no explicit feedback informing an agent whether a communication action is beneficial, rather the communication policies must also be learned through experience concurrently to the main policies. Our experimental results demonstrate that the algorithm performs well in six highly non-stationary environments of progressively higher complexity, and offers substantial performance gains compared to the baselines.

READ FULL TEXT

page 7

page 13

research
10/05/2019

Attention-based Fault-tolerant Approach for Multi-agent Reinforcement Learning Systems

The aim of multi-agent reinforcement learning systems is to provide inte...
research
01/12/2019

Improving Coordination in Multi-Agent Deep Reinforcement Learning through Memory-driven Communication

Deep reinforcement learning algorithms have recently been used to train ...
research
09/14/2021

GALOPP: Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Localization Constraints

Persistently monitoring a region under localization and communication co...
research
12/21/2017

A Deep Policy Inference Q-Network for Multi-Agent Systems

We present DPIQN, a deep policy inference Q-network that targets multi-a...
research
02/17/2021

Distributed Fair Scheduling for Information Exchange in Multi-Agent Systems

Information exchange is a crucial component of many real-world multi-age...
research
03/06/2017

Context-Based Concurrent Experience Sharing in Multiagent Systems

One of the key challenges for multi-agent learning is scalability. In th...
research
05/10/2023

Fast Teammate Adaptation in the Presence of Sudden Policy Change

In cooperative multi-agent reinforcement learning (MARL), where an agent...

Please sign up or login with your details

Forgot password? Click here to reset