Safe Multi-Agent Reinforcement Learning through Decentralized Multiple Control Barrier Functions

03/23/2021
by   Zhiyuan Cai, et al.
0

Multi-Agent Reinforcement Learning (MARL) algorithms show amazing performance in simulation in recent years, but placing MARL in real-world applications may suffer safety problems. MARL with centralized shields was proposed and verified in safety games recently. However, centralized shielding approaches can be infeasible in several real-world multi-agent applications that involve non-cooperative agents or communication delay. Thus, we propose to combine MARL with decentralized Control Barrier Function (CBF) shields based on available local information. We establish a safe MARL framework with decentralized multiple CBFs and develop Multi-Agent Deep Deterministic Policy Gradient (MADDPG) to Multi-Agent Deep Deterministic Policy Gradient with decentralized multiple Control Barrier Functions (MADDPG-CBF). Based on a collision-avoidance problem that includes not only cooperative agents but obstacles, we demonstrate the construction of multiple CBFs with safety guarantees in theory. Experiments are conducted and experiment results verify that the proposed safe MARL framework can guarantee the safety of agents included in MARL.

READ FULL TEXT

page 1

page 3

research
01/14/2021

Learning Safe Multi-Agent Control with Decentralized Neural Barrier Certificates

We study the multi-agent safe control problem where agents should avoid ...
research
01/27/2021

Safe Multi-Agent Reinforcement Learning via Shielding

Multi-agent reinforcement learning (MARL) has been increasingly used in ...
research
10/25/2019

MAMPS: Safe Multi-Agent Reinforcement Learning via Model Predictive Shielding

Reinforcement learning is a promising approach to learning control polic...
research
04/09/2022

Trust-based Rate-Tunable Control Barrier Functions for Non-Cooperative Multi-Agent Systems

For efficient and robust task accomplishment in multi-agent systems, an ...
research
04/11/2020

Safe Multi-Agent Interaction through Robust Control Barrier Functions with Learned Uncertainties

Robots operating in real world settings must navigate and maintain safet...
research
06/17/2022

Responsibility-associated Multi-agent Collision Avoidance with Social Preferences

This paper introduces a novel social preference-aware decentralized safe...
research
09/14/2021

Reactive and Safe Road User Simulations using Neural Barrier Certificates

Reactive and safe agent modelings are important for nowadays traffic sim...

Please sign up or login with your details

Forgot password? Click here to reset