COVID-19 Pandemic Cyclic Lockdown Optimization Using Reinforcement Learning

09/10/2020
by   Mauricio Arango, et al.
0

This work examines the use of reinforcement learning (RL) to optimize cyclic lockdowns, which is one of the methods available for control of the COVID-19 pandemic. The problem is structured as an optimal control system for tracking a reference value, corresponding to the maximum usage level of a critical resource, such as ICU beds. However, instead of using conventional optimal control methods, RL is used to find optimal control policies. A framework was developed to calculate optimal cyclic lockdown timings using an RL-based on-off controller. The RL-based controller is implemented as an RL agent that interacts with an epidemic simulator, implemented as an extended SEIR epidemic model. The RL agent learns a policy function that produces an optimal sequence of open/lockdown decisions such that goals specified in the RL reward function are optimized. Two concurrent goals were used: the first one is a public health goal that minimizes overshoots of ICU bed usage above an ICU bed threshold, and the second one is a socio-economic goal that minimizes the time spent under lockdowns. It is assumed that cyclic lockdowns are considered as a temporary alternative to extended lockdowns when a region faces imminent danger of overpassing resource capacity limits and when imposing an extended lockdown would cause severe social and economic consequences due to lack of necessary economic resources to support its affected population during an extended lockdown.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2019

Jet grooming through reinforcement learning

We introduce a novel implementation of a reinforcement learning (RL) alg...
research
11/14/2020

RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems

With the rapid advance of information technology, network systems have b...
research
06/16/2023

CAJun: Continuous Adaptive Jumping using a Learned Centroidal Controller

We present CAJun, a novel hierarchical learning and control framework th...
research
10/20/2020

Reinforcement Learning for Optimization of COVID-19 Mitigation policies

The year 2020 has seen the COVID-19 virus lead to one of the worst globa...
research
08/03/2023

Aligning Agent Policy with Externalities: Reward Design via Bilevel RL

In reinforcement learning (RL), a reward function is often assumed at th...
research
03/24/2021

Pyfectious: An individual-level simulator to discover optimal containment polices for epidemic diseases

Simulating the spread of infectious diseases in human communities is cri...
research
03/31/2020

Optimising Lockdown Policies for Epidemic Control using Reinforcement Learning

In the context of the ongoing Covid-19 pandemic, several reports and stu...

Please sign up or login with your details

Forgot password? Click here to reset