Large-scale Traffic Signal Control Using a Novel Multi-Agent Reinforcement Learning

08/10/2019
by   Xiaoqiang Wang, et al.
1

Finding the optimal signal timing strategy is a difficult task for the problem of large-scale traffic signal control (TSC). Multi-Agent Reinforcement Learning (MARL) is a promising method to solve this problem. However, there is still room for improvement in extending to large-scale problems and modeling the behaviors of other agents for each individual agent. In this paper, a new MARL, called Cooperative double Q-learning (Co-DQL), is proposed, which has several prominent features. It uses a highly scalable independent double Q-learning method based on double estimators and the UCB policy, which can eliminate the over-estimation problem existing in traditional independent Q-learning while ensuring exploration. It uses mean field approximation to model the interaction among agents, thereby making agents learn a better cooperative strategy. In order to improve the stability and robustness of the learning process, we introduce a new reward allocation mechanism and a local state sharing method. In addition, we analyze the convergence properties of the proposed algorithm. Co-DQL is applied on TSC and tested on a multi-traffic signal simulator. According to the results obtained on several traffic scenarios, Co- DQL outperforms several state-of-the-art decentralized MARL algorithms. It can effectively shorten the average waiting time of the vehicles in the whole road system.

READ FULL TEXT

page 1

page 10

research
09/11/2022

Graphon Mean-Field Control for Cooperative Multi-Agent Reinforcement Learning

The marriage between mean-field theory and reinforcement learning has sh...
research
04/22/2021

Independent Reinforcement Learning for Weakly Cooperative Multiagent Traffic Control Problem

The adaptive traffic signal control (ATSC) problem can be modeled as a m...
research
02/27/2020

Gamma-Reward: A Novel Multi-Agent Reinforcement Learning Method for Traffic Signal Control

The intelligent control of traffic signal is critical to the optimizatio...
research
11/20/2018

Brain-Inspired Stigmergy Learning

Stigmergy has proved its great superiority in terms of distributed contr...
research
02/27/2020

Learning Scalable Multi-Agent Coordination by Spatial Differential for Traffic Signal Control

The intelligent control of the traffic signal is critical to the optimiz...
research
02/04/2022

Analysis of Independent Learning in Network Agents: A Packet Forwarding Use Case

Multi-Agent Reinforcement Learning (MARL) is nowadays widely used to sol...
research
01/04/2021

Variationally and Intrinsically motivated reinforcement learning for decentralized traffic signal control

One of the biggest challenges in multi-agent reinforcement learning is c...

Please sign up or login with your details

Forgot password? Click here to reset