Logic-based Reward Shaping for Multi-Agent Reinforcement Learning

06/17/2022
by   Ingy Elsayed-Aly, et al.
0

Reinforcement learning (RL) relies heavily on exploration to learn from its environment and maximize observed rewards. Therefore, it is essential to design a reward function that guarantees optimal learning from the received experience. Previous work has combined automata and logic based reward shaping with environment assumptions to provide an automatic mechanism to synthesize the reward function based on the task. However, there is limited work on how to expand logic-based reward shaping to Multi-Agent Reinforcement Learning (MARL). The environment will need to consider the joint state in order to keep track of other agents if the task requires cooperation, thus suffering from the curse of dimensionality with respect to the number of agents. This project explores how logic-based reward shaping for MARL can be designed for different scenarios and tasks. We present a novel method for semi-centralized logic-based MARL reward shaping that is scalable in the number of agents and evaluate it in multiple scenarios.

READ FULL TEXT
research
10/06/2020

Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning

Reinforcement learning (RL) methods usually treat reward functions as bl...
research
06/23/2023

Reinforcement Learning with Temporal-Logic-Based Causal Diagrams

We study a class of reinforcement learning (RL) tasks where the objectiv...
research
05/23/2017

Reinforcement Learning with a Corrupted Reward Channel

No real-world reward function is perfect. Sensory errors and software bu...
research
05/02/2021

Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning

Reward engineering and designing an incentive reward function are non-tr...
research
05/30/2022

Designing Rewards for Fast Learning

To convey desired behavior to a Reinforcement Learning (RL) agent, a des...
research
07/05/2022

The StarCraft Multi-Agent Challenges+ : Learning of Multi-Stage Tasks and Environmental Factors without Precise Reward Functions

In this paper, we propose a novel benchmark called the StarCraft Multi-A...
research
11/02/2020

Interpreting Graph Drawing with Multi-Agent Reinforcement Learning

Applying machine learning techniques to graph drawing has become an emer...

Please sign up or login with your details

Forgot password? Click here to reset