Frequency-Based Patrolling with Heterogeneous Agents and Limited Communication

02/07/2014
by   Tao Mao, et al.
0

This paper investigates multi-agent frequencybased patrolling of intersecting, circle graphs under conditions where graph nodes have non-uniform visitation requirements and agents have limited ability to communicate. The task is modeled as a partially observable Markov decision process, and a reinforcement learning solution is developed. Each agent generates its own policy from Markov chains, and policies are exchanged only when agents occupy the same or adjacent nodes. This constraint on policy exchange models sparse communication conditions over large, unstructured environments. Empirical results provide perspectives on convergence properties, agent cooperation, and generalization of learned patrolling policies to new instances of the task. The emergent behavior indicates learned coordination strategies between heterogeneous agents for patrolling large, unstructured regions as well as the ability to generalize to dynamic variation in node visitation requirements.

READ FULL TEXT

page 4

page 6

research
11/16/2020

Scalable Reinforcement Learning Policies for Multi-Agent Control

This paper develops a stochastic Multi-Agent Reinforcement Learning (MAR...
research
01/02/2021

A Joint Learning and Communication Framework for Multi-Agent Reinforcement Learning over Noisy Channels

We propose a novel formulation of the "effectiveness problem" in communi...
research
05/29/2023

Experience Filter: Using Past Experiences on Unseen Tasks or Environments

One of the bottlenecks of training autonomous vehicle (AV) agents is the...
research
02/16/2018

Learning Implicit Communication Strategies for the Purpose of Illicit Collusion

Winner-take-all dynamics are prevalent throughout the human and natural ...
research
11/13/2019

Learning to Communicate in Multi-Agent Reinforcement Learning : A Review

We consider the issue of multiple agents learning to communicate through...
research
05/25/2021

Bayesian Nonparametric Reinforcement Learning in LTE and Wi-Fi Coexistence

With the formation of next generation wireless communication, a growing ...
research
08/28/2023

Context-Aware Composition of Agent Policies by Markov Decision Process Entity Embeddings and Agent Ensembles

Computational agents support humans in many areas of life and are theref...

Please sign up or login with your details

Forgot password? Click here to reset