Planning Multiple Epidemic Interventions with Reinforcement Learning

01/30/2023
by   Anh Mai, et al.
0

Combating an epidemic entails finding a plan that describes when and how to apply different interventions, such as mask-wearing mandates, vaccinations, school or workplace closures. An optimal plan will curb an epidemic with minimal loss of life, disease burden, and economic cost. Finding an optimal plan is an intractable computational problem in realistic settings. Policy-makers, however, would greatly benefit from tools that can efficiently search for plans that minimize disease and economic costs especially when considering multiple possible interventions over a continuous and complex action space given a continuous and equally complex state space. We formulate this problem as a Markov decision process. Our formulation is unique in its ability to represent multiple continuous interventions over any disease model defined by ordinary differential equations. We illustrate how to effectively apply state-of-the-art actor-critic reinforcement learning algorithms (PPO and SAC) to search for plans that minimize overall costs. We empirically evaluate the learning performance of these algorithms and compare their performance to hand-crafted baselines that mimic plans constructed by policy-makers. Our method outperforms baselines. Our work confirms the viability of a computational approach to support policy-makers

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2022

Exploring the Pareto front of multi-objective COVID-19 mitigation policies using reinforcement learning

Infectious disease outbreaks can have a disruptive impact on public heal...
research
08/04/2021

Risk Conditioned Neural Motion Planning

Risk-bounded motion planning is an important yet difficult problem for s...
research
10/31/2021

Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method

We discuss the problem of decentralized multi-agent reinforcement learni...
research
03/15/2019

Policy Distillation and Value Matching in Multiagent Reinforcement Learning

Multiagent reinforcement learning algorithms (MARL) have been demonstrat...
research
07/13/2020

Implicit Distributional Reinforcement Learning

To improve the sample efficiency of policy-gradient based reinforcement ...
research
10/09/2020

EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models

Epidemiologists model the dynamics of epidemics in order to propose cont...
research
10/19/2012

Optimal Limited Contingency Planning

For a given problem, the optimal Markov policy can be considerred as a c...

Please sign up or login with your details

Forgot password? Click here to reset