Learning Observation-Based Certifiable Safe Policy for Decentralized Multi-Robot Navigation

09/16/2021
by   Yuxiang Cui, et al.
0

Safety is of great importance in multi-robot navigation problems. In this paper, we propose a control barrier function (CBF) based optimizer that ensures robot safety with both high probability and flexibility, using only sensor measurement. The optimizer takes action commands from the policy network as initial values and then provides refinement to drive the potentially dangerous ones back into safe regions. With the help of a deep transition model that predicts the evolution of surrounding dynamics and the consequences of different actions, the CBF module can guide the optimization in a reasonable time horizon. We also present a novel joint training framework that improves the cooperation between the Reinforcement Learning (RL) based policy and the CBF-based optimizer both in training and inference procedures by utilizing reward feedback from the CBF module. We observe that the policy using our method can achieve a higher success rate while maintaining the safety of multiple robots in significantly fewer episodes compared with other methods. Experiments are conducted in multiple scenarios both in simulation and the real world, the results demonstrate the effectiveness of our method in maintaining the safety of multi-robot navigation. Code is available at <https://github.com/YuxiangCui/MARL-OCBF>

READ FULL TEXT

page 1

page 6

research
11/08/2020

Learning World Transition Model for Socially Aware Robot Navigation

Moving in dynamic pedestrian environments is one of the important requir...
research
09/17/2021

Decentralized Global Connectivity Maintenance for Multi-Robot Navigation: A Reinforcement Learning Approach

The problem of multi-robot navigation of connectivity maintenance is cha...
research
12/06/2022

Safe Inverse Reinforcement Learning via Control Barrier Function

Learning from Demonstration (LfD) is a powerful method for enabling robo...
research
11/08/2019

Mapless Navigation among Dynamics with Social-safety-awareness: a reinforcement learning approach from 2D laser scans

We propose a method to tackle the problem of mapless collision-avoidance...
research
11/15/2018

Intervention Aided Reinforcement Learning for Safe and Practical Policy Optimization in Navigation

Combining deep neural networks with reinforcement learning has shown gre...
research
03/11/2020

Multiplicative Controller Fusion: A Hybrid Navigation Strategy For Deployment in Unknown Environments

Learning-based approaches often outperform hand-coded algorithmic soluti...
research
01/24/2023

Constrained Reinforcement Learning for Dexterous Manipulation

Existing learning approaches to dexterous manipulation use demonstration...

Please sign up or login with your details

Forgot password? Click here to reset