Inferring and Learning Multi-Robot Policies by Observing an Expert

09/17/2019
by   Pietro Pierpaoli, et al.
0

In this paper we present a technique for learning how to solve a multi-robot mission that requires interaction with an external environment by repeatedly observing an expert system executing the same mission. We define the expert system as a team of robots equipped with a library of controllers, each designed to solve a specific task, supervised by an expert policy that appropriately selects controllers based on the states of robots and environment. The objective is for an un-trained team of robots equipped with the same library of controllers, but agnostic to the expert policy, to execute the mission, with performances comparable to those of the expert system. From observations of the expert system, the Interactive Multiple Model technique is used to estimate individual controllers executed by the expert policy. Then, the history of estimated controllers and environmental state is used to learn a policy for the un-trained robots. Considering a perimeter protection scenario on a team of simulated differential-drive robots, we show that the learned policy endows the un-trained team with performances comparable to those of the expert system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/12/2019

A Reinforcement Learning Framework for Sequencing Multi-Robot Behaviors

Given a list of behaviors and associated parameterized controllers for s...
research
02/14/2019

Multi-Objective Policy Generation for Multi-Robot Systems Using Riemannian Motion Policies

In the multi-robot systems literature, control policies are typically ob...
research
05/03/2023

Decentralised Active Perception in Continuous Action Spaces for the Coordinated Escort Problem

We consider the coordinated escort problem, where a decentralised team o...
research
09/28/2018

Using Deep Reinforcement Learning to Learn High-Level Policies on the ATRIAS Biped

Learning controllers for bipedal robots is a challenging problem, often ...
research
09/03/2020

Fast Byzantine Gathering with Visibility in Graphs

We consider the gathering task by a team of m synchronous mobile robots ...
research
01/30/2018

Learning to Emulate an Expert Projective Cone Scheduler

Projective cone scheduling defines a large class of rate-stabilizing pol...
research
11/14/2014

Learning Fuzzy Controllers in Mobile Robotics with Embedded Preprocessing

The automatic design of controllers for mobile robots usually requires t...

Please sign up or login with your details

Forgot password? Click here to reset