Decentralized Multi-AGV Task Allocation based on Multi-Agent Reinforcement Learning with Information Potential Field Rewards

08/16/2021
by   Mengyuan Li, et al.
0

Automated Guided Vehicles (AGVs) have been widely used for material handling in flexible shop floors. Each product requires various raw materials to complete the assembly in production process. AGVs are used to realize the automatic handling of raw materials in different locations. Efficient AGVs task allocation strategy can reduce transportation costs and improve distribution efficiency. However, the traditional centralized approaches make high demands on the control center's computing power and real-time capability. In this paper, we present decentralized solutions to achieve flexible and self-organized AGVs task allocation. In particular, we propose two improved multi-agent reinforcement learning algorithms, MADDPG-IPF (Information Potential Field) and BiCNet-IPF, to realize the coordination among AGVs adapting to different scenarios. To address the reward-sparsity issue, we propose a reward shaping strategy based on information potential field, which provides stepwise rewards and implicitly guides the AGVs to different material targets. We conduct experiments under different settings (3 AGVs and 6 AGVs), and the experiment results indicate that, compared with baseline methods, our work obtains up to 47% task response improvement and 22% training iterations reduction.

READ FULL TEXT

page 1

page 4

research
10/09/2022

ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward

Modern multi-agent reinforcement learning frameworks rely on centralized...
research
05/23/2023

Constrained Reinforcement Learning for Dynamic Material Handling

As one of the core parts of flexible manufacturing systems, material han...
research
06/23/2020

Online Multi-agent Reinforcement Learning for Decentralized Inverter-based Volt-VAR Control

The distributed Volt/Var control (VVC) methods have been widely studied ...
research
08/11/2023

The Impact of Overall Optimization on Warehouse Automation

In this study, we propose a novel approach for investigating optimizatio...
research
04/27/2018

Routing Driverless Transport Vehicles in Car Assembly with Answer Set Programming

Automated storage and retrieval systems are principal components of mode...
research
09/08/2013

Regret-Based Multi-Agent Coordination with Uncertain Task Rewards

Many multi-agent coordination problems can be represented as DCOPs. Moti...

Please sign up or login with your details

Forgot password? Click here to reset