SAT-MARL: Specification Aware Training in Multi-Agent Reinforcement Learning

12/14/2020
by   Fabian Ritz, et al.
6

A characteristic of reinforcement learning is the ability to develop unforeseen strategies when solving problems. While such strategies sometimes yield superior performance, they may also result in undesired or even dangerous behavior. In industrial scenarios, a system's behavior also needs to be predictable and lie within defined ranges. To enable the agents to learn (how) to align with a given specification, this paper proposes to explicitly transfer functional and non-functional requirements into shaped rewards. Experiments are carried out on the smart factory, a multi-agent environment modeling an industrial lot-size-one production facility, with up to eight agents and different multi-agent reinforcement learning algorithms. Results indicate that compliance with functional and non-functional constraints can be achieved by the proposed approach.

READ FULL TEXT
research
01/18/2022

K-nearest Multi-agent Deep Reinforcement Learning for Collaborative Tasks with a Variable Number of Agents

Traditionally, the performance of multi-agent deep reinforcement learnin...
research
06/12/2018

Multi-Agent Deep Reinforcement Learning with Human Strategies

Deep learning has enabled traditional reinforcement learning methods to ...
research
11/08/2022

Policy-Based Reinforcement Learning for Assortative Matching in Human Behavior Modeling

This paper explores human behavior in virtual networked communities, spe...
research
07/14/2021

Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

Existing evaluation suites for multi-agent reinforcement learning (MARL)...
research
11/17/2018

Parameter Sharing Reinforcement Learning Architecture for Multi Agent Driving Behaviors

Multi-agent learning provides a potential framework for learning and sim...
research
02/25/2023

Hierarchical Needs-driven Agent Learning Systems: From Deep Reinforcement Learning To Diverse Strategies

The needs describe the necessities for a system to survive and evolve, w...
research
04/07/2013

A General Framework for Interacting Bayes-Optimally with Self-Interested Agents using Arbitrary Parametric Model and Model Prior

Recent advances in Bayesian reinforcement learning (BRL) have shown that...

Please sign up or login with your details

Forgot password? Click here to reset