Safe Policy Synthesis in Multi-Agent POMDPs via Discrete-Time Barrier Functions

03/19/2019
by   Mohamadreza Ahmadi, et al.
0

A multi-agent partially observable Markov decision process (MPOMDP) is a modeling paradigm used for high-level planning of heterogeneous autonomous agents subject to uncertainty and partial observation. Despite their modeling efficiency, MPOMDPs have not received significant attention in safety-critical settings. In this paper, we use barrier functions to design policies for MPOMDPs that ensure safety. Notably, our method does not rely on discretization of the belief space, or finite memory. To this end, we formulate sufficient and necessary conditions for the safety of a given set based on discrete-time barrier functions (DTBFs) and we demonstrate that our formulation also allows for Boolean compositions of DTBFs for representing more complicated safe sets. We show that the proposed method can be implemented online by a sequence of one-step greedy algorithms as a standalone safe controller or as a safety-filter given a nominal planning policy. We illustrate the efficiency of the proposed methodology based on DTBFs using a high-fidelity simulation of heterogeneous robots.

READ FULL TEXT

page 1

page 6

page 7

research
03/19/2020

Barrier Functions for Multiagent-POMDPs with DTL Specifications

Multi-agent partially observable Markov decision processes (MPOMDPs) pro...
research
09/19/2021

Model-Free Safety-Critical Control for Robotic Systems

This paper presents a framework for the safety-critical control of robot...
research
09/14/2021

Reactive and Safe Road User Simulations using Neural Barrier Certificates

Reactive and safe agent modelings are important for nowadays traffic sim...
research
04/11/2020

Safe Multi-Agent Interaction through Robust Control Barrier Functions with Learned Uncertainties

Robots operating in real world settings must navigate and maintain safet...
research
07/13/2023

CaRT: Certified Safety and Robust Tracking in Learning-based Motion Planning for Multi-Agent Systems

The key innovation of our analytical method, CaRT, lies in establishing ...
research
11/03/2020

Risk-Sensitive Path Planning via CVaR Barrier Functions: Application to Bipedal Locomotion

Enforcing safety of robotic systems in the presence of stochastic uncert...
research
03/02/2021

Multi-robot task allocation for safe planning under dynamic uncertainties

This paper considers the problem of multi-robot safe mission planning in...

Please sign up or login with your details

Forgot password? Click here to reset