Reinforcement Learning for Location-Aware Scheduling

03/07/2022
by   Stelios Stavroulakis, et al.
0

Recent techniques in dynamical scheduling and resource management have found applications in warehouse environments due to their ability to organize and prioritize tasks in a higher temporal resolution. The rise of deep reinforcement learning, as a learning paradigm, has enabled decentralized agent populations to discover complex coordination strategies. However, training multiple agents simultaneously introduce many obstacles in training as observation and action spaces become exponentially large. In our work, we experimentally quantify how various aspects of the warehouse environment (e.g., floor plan complexity, information about agents' live location, level of task parallelizability) affect performance and execution priority. To achieve efficiency, we propose a compact representation of the state and action space for location-aware multi-agent systems, wherein each agent has knowledge of only self and task coordinates, hence only partial observability of the underlying Markov Decision Process. Finally, we show how agents trained in certain environments maintain performance in completely unseen settings and also correlate performance degradation with floor plan geometry.

READ FULL TEXT

page 2

page 3

research
02/14/2020

Resource Management in Wireless Networks via Multi-Agent Deep Reinforcement Learning

We propose a mechanism for distributed radio resource management using m...
research
05/11/2020

Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments

Action and observation delays exist prevalently in the real-world cyber-...
research
03/06/2022

Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment

In a warehouse environment, tasks appear dynamically. Consequently, a ta...
research
10/22/2018

Multi-Agent Actor-Critic with Generative Cooperative Policy Network

We propose an efficient multi-agent reinforcement learning approach to d...
research
10/16/2020

PRIMAL2: Pathfinding via Reinforcement and Imitation Multi-Agent Learning – Lifelong

Multi-agent path finding (MAPF) is an indispensable component of large-s...
research
11/28/2021

Evaluating Generalization and Transfer Capacity of Multi-Agent Reinforcement Learning Across Variable Number of Agents

Multi-agent Reinforcement Learning (MARL) problems often require coopera...
research
01/24/2021

Learning Synthetic Environments for Reinforcement Learning with Evolution Strategies

This work explores learning agent-agnostic synthetic environments (SEs) ...

Please sign up or login with your details

Forgot password? Click here to reset