Value-based Bayesian Meta-reinforcement Learning and Traffic Signal Control

10/01/2020
by   Yayi Zou, et al.
0

Reinforcement learning methods for traffic signal control has gained increasing interests recently and achieved better performances compared with traditional transportation methods. However, reinforcement learning based methods usually requires heavy training data and computational resources which largely limit its application in real-world traffic signal control. This makes meta-learning, which enables data-efficient and fast-adaptation training by leveraging the knowledge of previous learning experiences, catches attentions in traffic signal control. In this paper, we propose a novel value-based Bayesian meta-reinforcement learning framework BM-DQN to robustly speed up the learning process in new scenarios by utilizing well-trained prior knowledge learned from existing scenarios. This framework based on our proposed fast-adaptation variation to Gradient-EM Bayesian Meta-learning and the fast update advantage of DQN, which allows fast adaptation to new scenarios with continual learning ability and robustness to uncertainty. The experiments on 2D navigation and traffic signal control show that our proposed framework adapts more quickly and robustly in new scenarios than previous methods, and specifically, much better continual learning ability in heterogeneous scenarios.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/15/2021

ModelLight: Model-Based Meta-Reinforcement Learning for Traffic Signal Control

Traffic signal control is of critical importance for the effective use o...
research
03/12/2020

Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning

Learning from non-stationary data remains a great challenge for machine ...
research
06/21/2020

Gradient-EM Bayesian Meta-learning

Bayesian meta-learning enables robust and fast adaptation to new tasks w...
research
02/11/2021

Reproducibility Report: La-MAML: Look-ahead Meta Learning for Continual Learning

The Continual Learning (CL) problem involves performing well on a sequen...
research
08/13/2020

Offline Meta-Reinforcement Learning with Advantage Weighting

Massive datasets have proven critical to successfully applying deep lear...
research
09/25/2019

Data Valuation using Reinforcement Learning

Quantifying the value of data is a fundamental problem in machine learni...
research
11/02/2021

Robust Dynamic Bus Control: A Distributional Multi-agent Reinforcement Learning Approach

Bus system is a critical component of sustainable urban transportation. ...

Please sign up or login with your details

Forgot password? Click here to reset