Cooperative Multi-Objective Reinforcement Learning for Traffic Signal Control and Carbon Emission Reduction

06/16/2023
by   Cheng Ruei Tang, et al.
0

Existing traffic signal control systems rely on oversimplified rule-based methods, and even RL-based methods are often suboptimal and unstable. To address this, we propose a cooperative multi-objective architecture called Multi-Objective Multi-Agent Deep Deterministic Policy Gradient (MOMA-DDPG), which estimates multiple reward terms for traffic signal control optimization using age-decaying weights. Our approach involves two types of agents: one focuses on optimizing local traffic at each intersection, while the other aims to optimize global traffic throughput. We evaluate our method using real-world traffic data collected from an Asian country's traffic cameras. Despite the inclusion of a global agent, our solution remains decentralized as this agent is no longer necessary during the inference stage. Our results demonstrate the effectiveness of MOMA-DDPG, outperforming state-of-the-art methods across all performance metrics. Additionally, our proposed system minimizes both waiting time and carbon emissions. Notably, this paper is the first to link carbon emissions and global agents in traffic signal control.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2022

Cooperative Reinforcement Learning on Traffic Signal Control

Traffic signal control is a challenging real-world problem aiming to min...
research
03/11/2019

Multi-Agent Deep Reinforcement Learning for Large-scale Traffic Signal Control

Reinforcement learning (RL) is a promising data-driven approach for adap...
research
12/09/2019

Intelligent Coordination among Multiple Traffic Intersections Using Multi-Agent Reinforcement Learning

We use Asynchronous Advantage Actor Critic (A3C) for implementing an AI ...
research
09/30/2019

Tensor-based Cooperative Control for Large Scale Multi-intersection Traffic Signal Using Deep Reinforcement Learning and Imitation Learning

Traffic signal control has long been considered as a critical topic in i...
research
03/27/2017

Deep Deterministic Policy Gradient for Urban Traffic Light Control

Traffic light timing optimization is still an active line of research de...
research
04/21/2017

Multi-Objective Deep Q-Learning with Subsumption Architecture

In this work we present a method for using Deep Q-Networks (DQNs) in mul...
research
08/24/2023

Perimeter Control with Heterogeneous Cordon Signal Behaviors: A Semi-Model Dependent Reinforcement Learning Approach

Perimeter Control (PC) strategies have been proposed to address urban ro...

Please sign up or login with your details

Forgot password? Click here to reset