Reinforcement learning for traffic signal control in hybrid action space

11/23/2022
by   Haoqing Luo, et al.
0

The prevailing reinforcement-learning-based traffic signal control methods are typically staging-optimizable or duration-optimizable, depending on the action spaces. In this paper, we propose a novel control architecture, TBO, which is based on hybrid proximal policy optimization. To the best of our knowledge, TBO is the first RL-based algorithm to implement synchronous optimization of the staging and duration. Compared to discrete and continuous action spaces, hybrid action space is a merged search space, in which TBO better implements the trade-off between frequent switching and unsaturated release. Experiments are given to demonstrate that TBO reduces the queue length and delay by 13.78 existing baselines. Furthermore, we calculate the Gini coefficients of the right-of-way to indicate TBO does not harm fairness while improving efficiency.

READ FULL TEXT

page 10

page 13

research
09/12/2021

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation

Discrete-continuous hybrid action space is a natural setting in many pra...
research
10/10/2018

Parametrized Deep Q-Networks Learning: Reinforcement Learning with Discrete-Continuous Hybrid Action Space

Most existing deep reinforcement learning (DRL) frameworks consider eith...
research
01/02/2020

Continuous-Discrete Reinforcement Learning for Hybrid Control in Robotics

Many real-world control problems involve both discrete decision variable...
research
09/29/2021

Improving Safety in Deep Reinforcement Learning using Unsupervised Action Planning

One of the key challenges to deep reinforcement learning (deep RL) is to...
research
04/07/2022

DynLight: Realize dynamic phase duration with multi-level traffic signal control

Adopting reinforcement learning (RL) for traffic signal control is incre...
research
07/22/2022

Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution

Optimal execution is a sequential decision-making problem for cost-savin...
research
12/30/2021

Knowledge intensive state design for traffic signal control

There is a general trend of applying reinforcement learning (RL) techniq...

Please sign up or login with your details

Forgot password? Click here to reset