Optimal Dispatch in Emergency Service System via Reinforcement Learning

10/15/2020
by   Cheng Hua, et al.
0

In the United States, medical responses by fire departments over the last four decades increased by 367 emergency response departments that existing resources are efficiently used. In this paper, we model the ambulance dispatch problem as an average-cost Markov decision process and present a policy iteration approach to find an optimal dispatch policy. We then propose an alternative formulation using post-decision states that is shown to be mathematically equivalent to the original model, but with a much smaller state space. We present a temporal difference learning approach to the dispatch problem based on the post-decision states. In our numerical experiments, we show that our obtained temporal-difference policy outperforms the benchmark myopic policy. Our findings suggest that emergency response departments can improve their performance with minimal to no cost.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2018

Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and Some New Implementations

In this paper we discuss policy iteration methods for approximate soluti...
research
02/25/2019

An Intrusion Using Malware and DDNS

This whitepaper captures the details of the technical alert numbered TA1...
research
09/20/2021

A Reinforcement Learning Approach to the Stochastic Cutting Stock Problem

We propose a formulation of the stochastic cutting stock problem as a di...
research
06/29/2021

Globally Optimal Hierarchical Reinforcement Learning for Linearly-Solvable Markov Decision Processes

In this work we present a novel approach to hierarchical reinforcement l...
research
07/05/2021

The Multi-phase spatial meta-heuristic algorithm for public health emergency transportation

The delivery of Medical Countermeasures(MCMs) for mass prophylaxis in th...
research
12/03/2019

Mo' States Mo' Problems: Emergency Stop Mechanisms from Observation

In many environments, only a relatively small subset of the complete sta...
research
08/17/2021

On the equivalence of holding cost and response time for evaluating performance of queues

This self-contained discussion relates the long-run average holding cost...

Please sign up or login with your details

Forgot password? Click here to reset